1 of 19

Dynamic Depth: Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth

2 of 19

Monocular Depth Prediction

3 of 19

Unsupervised �Monocular Depth Prediction

Re-projection Loss is the key for unsupervised monocular depth prediction.

Pose Net

6-DOF Pose

Depth Net

Re-Projection Loss

4 of 19

Multi-frame �Monocular Depth Prediction

Cost volume is proved to be an effective way to leverage temporal frames to improve the overall depth quality, which is also based on the re-projection geometry.

Pose Net

Depth Encoder

Re-

Projection

Depth Decoder

Cost Volume

5 of 19

Re-projection Geometry

Re-Projection

Suppose to match

6 of 19

Dynamic Obj Mismatch problem

Dynamic objects will cause the ‘Mismatch’ problem.

Re-Projection

Obj Motion

Mismatch!

7 of 19

Dynamic Obj Occlusion problem

Dynamic objects will cause ‘Occlusion’ problem.

Re-Projection

Obj Motion

Mismatch!

Occlusion!

Occluded!

Visible!

8 of 19

‘Mismatch’ and ‘Occlusion’ affects:

Re-projection loss (Self-supervision).

Existing solutions rely on the object motion prediction.

Cost volume(Temporal frames inference).

No existing solution.

We propose to alleviate these problems in BOTH loss function and Cost volume side, to enable the temporal reasoning in dynamic objects areas.

Motivation:

9 of 19

We Propose DynamicDepth:

Depth Prior Net

Pose Net

Depth

Encoder

Occlusion-aware

cost volume

Depth

Decoder

Dynamic Object Motion Disentanglement

(DOMD)

Dynamic Object Cycle Consistency Loss

Our Contribution:

Novel Dynamic Object Motion Disentanglement (DOMD) module.
Dynamic Object Cycle Consistent training scheme.
Occlusion-aware Cost Volume and Re-projection Loss

10 of 19

DOMD Module

Re-project the dynamic object patch with ‘depth prior’ prediction

Depth Prior Prediction

11 of 19

DOMD Module

Replace dynamic object patch with re-projected image patch.

DOMD

12 of 19

DOMD Module

This replacement will alleviate the ‘Mismatch’ problem.

Re-Projection

Obj Motion

Match

Occlusion

13 of 19

Occlusion-aware Cost Volume

…

Occlusion-aware

Cost Volume

Occlusion Filling

…

Sharing Weights

-

…

Warp by All Depth Hypothesis

The occluded areas are filled with non-occluded cost values.

14 of 19

Occlusion-aware Re-projection Loss

Re-proj Error at t-1

The occluded areas are filled with non-occluded cost values.

15 of 19

Occlusion-aware Re-projection Loss

Source Frame

: From visible frame

: From occluded frame

Widely Used Per-pixel min Loss

Re-proj Error at t-1

16 of 19

Occlusion-aware Re-projection Loss

Source Frame

Our Occlusion-aware Loss

: From visible frame

: From occluded frame

Re-proj Error at t-1

17 of 19

Conclusion: Our method outperformed all the other methods on the Cityscapes and KITTI dataset.

18 of 19

Conclusion: Our method significantly outperformed all the other methods especially on the Dynamic objects areas.

1 of 19

2 of 19

3 of 19

4 of 19

5 of 19

6 of 19

7 of 19

8 of 19

9 of 19

10 of 19

11 of 19

12 of 19

13 of 19

14 of 19

15 of 19

16 of 19

17 of 19

18 of 19

19 of 19