Structure-from-Motion (SfM)

Overview

This repository contains a Structure from Motion(SfM) pipeline that resonctructs 3D point clouds from a set of continuous images of a scene. The project aims to demonstrate the fundamentals of camera calibration, feature matching, feature matching, pose estimation, triangulation of 3D points and bundle adjustment(optional) to refine both 3D pointds and camera parameters and point cloud visualization which is a 3D representation of the captured scene.

Key Highlights:

Incremental SfM: Takes pairs of images, estimates relative pose, and gradually adds more images to improve the reconstruction.
Feature Matching: Uses SIFT + FLANN-based matching with ratio test for robust keypoint correspondences.
3D Reconstruction: Triangulates matched points across multiple views to produce a 3D point cloud.
Bundle Adjustment (optional): Refinement of camera poses and 3D structure via iterative least squares.
PLY Output: Saves the final 3D point cloud (with color) in .ply format for visualization in tools like Open3D or MeshLab.

Features

Camera Intrinsic Handling
- Loads a custom K.txt file containing the 3×3 camera intrinsic matrix.
- Automatically scales intrinsics if downsampling the input images
Robust Feature Matching
- SIFT descriptors (up to 10,000 features if needed).
- FLANN-based nearest neighbor search with ratio test to remove ambiguous matches(outliers).
Pose Estimation
- Essential matrix estimation + recoverPose for the initial two images
- solvePnPRansac for subsequent images to localize new cameras.
Triangulation
- Converts matched points into 3D coordinates with cv2.triangulatePoints.
Bundle Adjustment(Optional)
- Minimizes reprojection error across cameras and 3D points for a more accurate reconstruction if enabled.
PLY Export
- Saves final 3D points + color in ASCII or binary PLY format.
- Includes optional outlier removal and user-defined scaling factor.

Methodology

The Structure from Motion(SfM) pipeline reconstructs a 3D representation of an object from a continuous series of continuos 2D images of an object from different angles by estimating camera poses and triangulating the matched feature points. Here is the stp-by-step implementation of the pipeline:

Image & Intrinsics Loading
- Read all images from a dataset directory capturing the object from different viewpoints.
- Read K.txt for the 3×3 camera intrinsics.
- Downsamples images and scales the intrinsic matrix accordingly.
Feature Detection and Matching
- Feature Detection: Use SIFT to detect and compute descriptors for the iamges.
- Feature Matching: Utilize a FLANN-based matcher to finds matches between consecutive images.
- Lowe's Ratio Test: filters ambiguous(outliers) matches.
Initial Pose Estimation
- Essential Matrix estimation: Compute essential matrix using th matched feature points and recover pose for the first two images.
- Pose Recovery: Recover fundamental camera motion (rotation, translation) under the pinhole camera model.
Triangulation:
- Project Matrices: Construct projection matrices for each camera pose.
- 3D points Triangulation: Convert 2D matched points into 3D coordinates. Maintain a growing set of 3D points as more images are added.
Incremental SfM
- For each new image:
  - Match features to the previous image.
  - SolvePnPRansac to get the new camera pose.
  - Triangulate points between the new view and an existing view.
Bundle Adjustment(Optional)
- If enabled, refine all camera parameters and 3D points by minimizing reprojection error.
Point Cloud Export
- Accumulate all 3D points plus colors from each iteration.
- Save to PLY with potential outlier removal, scaling, and color normalization.

Installation

Prerequisites

Python 3.7+

Install Additional Dependencies listed in the 'Requirements.txt' file.

Setup Instructions

Clone the Repository:

git clone https://github.com/StarkGoku10/Multiview-Structure-From-Motion.git
cd Multiview-Structure-From-Motion

Create a Virtual Environment(Optional but recommended):

python3 -m venv venv
source venv/bin/activate #on windows: venv\Scripts\activate

Install Dependencies:
```
pip install -r Requirements.txt
```
Visualization(Optional)

For visualizing the saved point cloud, python's Open3D library is used.
```
pip install open3d
```
Important Note:

Images:
- Ensure there is sufficient overlap of the object/scene between consecutive images.
- Capture images with different orientations and perspectives of the scene.
Calibration File(K.txt):
- This file contains the camera intrinsic parameters(matrix). The format expected for this file is nine numerical values in a single line or representating 3x3 matrix seperated by spaces or new lines.
- Ensure that the matrix is accurate and corresponds to the camera used to capture the images.
- Example K.txt matrix:
```
2759.48 0 1520.69
0 2764.16 1006.81
0 0 1
```

Usage

Executing the SfM pipeline

Execute the sfm.py script to run the SfM pipeline. The script processes the first two images and triangulates the points between the two images. Then the pipeline adds one image sequentially to reconstruct the D structure of the scene. Optionally, Bundle Adjustment can be enabled to refine the 3D points and reduce the reprojection errors.

Steps:

Navigate to the Project Directory

Ensure you are in the root directory where the 'SfM.py` file resides.
```
cd Multiview-Structure-from_Motion
```
Run the Pipeline

Note: Adjust the dataset path as needed(in the __main__ block):
```
if __name__ == '__main__':
  sfm= StructurefromMotion("Datasets/YourDataset")
  sfm()
```
Ensure you have specified the proper directory paths for the dataset. Execute the following command:
```
python SfM.py
```
Monitor the Output:

The script will process first two first and display messages about the processing step.
For each subsequent step, it will perform featue matching,pose estimation, pose recovery, trianglation, bundle adjustment(if enabled) and display the iamge and reprojection error graph for all the images.
The script will save the final 3D reconstruction in the form of a point cloud(.ply) and the pose array in the the respective folders.

Visualize the Point Cloud:
- The point cloud can be visualized using visualized tools like MeshLab, Open3D or CloudCompare.
- For this project, Open3D is used for visualzation of the point cloud. To install open3d, Execute the following command:
```
pip install open3d 
```
- Run the script with the appropriate path to the point cloud to generate the 3D visualization:
```
python visualize.py
```

Output

Console:

  Camera Intrinsic Matrix: [[ 1.1969761e+03 -3.4106051e-13  4.6619110e+02]
  [ 0.0000000e+00  1.1990593e+03  3.1413251e+02]
  [ 0.0000000e+00  0.0000000e+00  1.0000000e+00]]
  Reprojection error for first two images: 0.00487721240747486
  total_images 55
    0%|                                                                                                                                                                             | 0/55 [00:00<?, ?it/s] Shape of New Array (468, 2) (468, 2)
  Reprojection error: 0.0508255796928927
    2%|███                                                                                                                                                                  | 1/55 [00:01<00:58,  1.08s/it] Shape of New Array (295, 2) (295, 2)
  Reprojection error: 0.5636247818285103
    4%|██████                                                                                                                                                               | 2/55 [00:01<00:37,  1.40it/s]
    ...

Reprojection Error Plot:
- Saved in /Results directory.
Point Cloud(.ply):
- Point cloud will be saved in directory /Datasets/<DatasetName.ply>.
- Contains the 3D points plus per-point color(in ASCII or binary).
Pose Array:
- A CSV file genereated will stored in /Results Array/<DatasetName>_pose_array.csv with all the camera parameters.

Results

Results(without Bundle Adjustment):

Herz-Jesus:

Reprojection Error plot

Reconstruction

Fountain-P11

Reprojection Error plot

Reconstruction

Gustavll-Adolf

Reprojection Error plot

Reconstruction

Entry-P10

Reprojection Error plot

Reconstruction

Results(with Bundle Adjustment):

Herz-Jesus:

Reprojection Error plot

Reconstruction

Fountain-P11

Reprojection Error plot

Reconstruction

Entry-P10

Reprojection Error plot

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Structure-from-Motion (SfM)

Overview

Features

Methodology

Installation

Prerequisites

Setup Instructions

Usage

Executing the SfM pipeline

Output

Results

Results(without Bundle Adjustment):

Results(with Bundle Adjustment):

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
Dataset		Dataset
Results		Results
LICENSE		LICENSE
README.md		README.md
Requirements.txt		Requirements.txt
SfM.py		SfM.py
Visualize.py		Visualize.py

License

StarkGoku10/Multiview-Structure-From-Motion

Folders and files

Latest commit

History

Repository files navigation

Structure-from-Motion (SfM)

Overview

Features

Methodology

Installation

Prerequisites

Setup Instructions

Usage

Executing the SfM pipeline

Output

Results

Results(without Bundle Adjustment):

Results(with Bundle Adjustment):

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages