Automated aircraft inspection through image-to-CAD model registration
Bharath Somayajula
Niviru Wijayaratne
1
Sponsor : Near Earth Autonomy
The Team
2
Motivation
3
Problem Statement
Build an automated system for aircraft inspection through image-to-CAD model registration
4
Problem Statement: Pose Estimation
5
RGB Image(s)
Untextured CAD Model in Canonical Pose
Untextured CAD Model in Correct Pose
Problem Statement: Texture Mapping
6
RGB Image(s)
Untextured CAD Model in Correct Pose
Textured CAD Model in Correct Pose
Proposed Solution: Original
7
Keypoint
Prediction
Network
Post
Processing
Narrow
FOV
Wide
FOV
Image Capture
Proposed Solution: Updated
8
Post
Processing
[q, C]
Proposed Solution: Approach Comparison
9
[q,C]
Dataset: Choices
10
Background Options | High Fidelity | Low Cost | Scalable |
None | | | |
| | | |
| | | |
Dataset: Choices
11
Background Options | High Fidelity | Low Cost | Scalable |
None | | | |
3D | | | |
| | | |
Dataset: Choices
12
Background Options | High Fidelity | Low Cost | Scalable |
None | | | |
3D | | | |
Random 2D | ? | | |
Dataset: Pipeline
13
Input: CAD Model + Random Background
Render RGB, Depth, Mask
Generate Extrinsic Matrix
Sample Camera Position
Sample point on mesh
Compose with Random Background
Dataset: Background Image Examples
14
Fig: Examples of background images used during training
Dataset: Examples
15
Fig: Sample images produced by dataloader
Model: Architecture
16
Fig: Model architecture [1]
Quaternions
17
Fig: Gimbal lock
Model: Training Details
18
Parameter | Value |
Architecture | ResNet-34 backbone with a new FC layer |
Learning Rate | 0.001 |
Batch Size | 16 |
Epochs | 50 |
Resolution | 288 x 512 (9:16) |
Loss Function | L1 Loss |
Table: Hyper-parameters used to train pose estimation network
Results: Qualitative Evaluation
19
Fig: Top row shows the input images for the pose prediction network and the bottom row shows the rendered RGB images of 3D model from predicted poses
Results: Quantitative Evaluation
20
Metric | Value |
Position Error | 1.44 meters |
Angular Error | 5.38 degrees |
Table: Evaluation metrics on held-out data
Results: Inference
21
Metric | Value |
Number of Parameters | 21.3 million |
Inference Time | 2.7 ms �(at full precision on RTX 3090Ti) |
FLOPS | 38.4 billion |
Table: Inference metrics for pose estimation network
Sim-to-Real Gap
22
Timeline and Future Work
23
October 1st - October 22nd
Improve data generation pipeline (Bharath & Niv)
October 23rd - October 31st
Experiment with loss functions (Bharath)
November 1st - November 15th
Iterative Pose Refinement[2] (Bharath & Niv)
November 15th - November 30th
Texture Mapping (Niv)
References
24
25
Thank you!