Inferring Distributions Over Depth
from a Single Image
Gengshan Yang, Peiyun Hu and Deva Ramanan
Carnegie Mellon University
Why monocular images?
2
LiDAR depth image
Why estimating depth distributions?
3
[1] Li, Zhengqi, and Noah Snavely. "MegaDepth: Learning single-view depth prediction from internet photos." CVPR 2018.
[2] Fu, Huan, et al. "Deep ordinal regression network for monocular depth estimation." CVPR. 2018.
[3] Godard, Clément, et alGodard, Clément, et al. "Digging into self-supervised monocular depth estimation." ICCV. 2019.
MegaDepth [1]
DORN [2]
MonoDepth2 [3]
Why estimating depth distributions?
4
input image
depth estimation
(yellow->near)
Why estimating depth distributions?
5
Our Probabilistic Solution
6
d=80m
d=40m
d=20m
d=10m
Our Probabilistic Solution
7
d=80m
d=40m
d=20m
d=10m
Our Probabilistic Solution
8
d=80m
d=40m
d=20m
d=10m
d*=41m
Our Probabilistic Solution
9
d=80m
d=40m
d=20m
d=10m
d*=41m
Our Probabilistic Solution
10
d=80m
d=40m
d=20m
d=10m
Network Architecture
11
A lightweight architecture estimating distributions over depth given a single image.
Network Architecture
12
A lightweight architecture estimating distributions over depth given a single image.
Network Architecture
13
A lightweight architecture estimating distributions over depth given a single image.
Experiments
14
Baselines: Unimodal Gaussian
15
d=80m
d=40m
d=20m
d=10m
Kendall, Alex, and Yarin Gal. "What uncertainties do we need in bayesian deep learning for computer vision?." NeurIPS. 2017.
Baselines: Unimodal Gaussian
16
d=80m
d=40m
d=20m
d=10m
Kendall, Alex, and Yarin Gal. "What uncertainties do we need in bayesian deep learning for computer vision?." NeurIPS. 2017.
Baselines: Softmax
17
d=80m
d=40m
d=20m
d=10m
Cao, Yuanzhouhan, Zifeng Wu, and Chunhua Shen. "Estimating depth from monocular images as classification using deep fully convolutional residual networks." IEEE Transactions on Circuits and Systems for Video Technology. 2017.
Qualitative: Unimodal v.s. Multimodal
18
Qualitative: Unimodal v.s. Multimodal
19
Evaluating Depth Distribution
20
Hu, Xiaoyan, and Philippos Mordohai. "A quantitative evaluation of confidence measures for stereo vision." TPAMI, 2012.
Evaluating Depth Distribution
21
Hu, Xiaoyan, and Philippos Mordohai. "A quantitative evaluation of confidence measures for stereo vision." TPAMI, 2012.
Evaluating Depth Distribution
22
Hu, Xiaoyan, and Philippos Mordohai. "A quantitative evaluation of confidence measures for stereo vision." TPAMI, 2012.
Evaluating Depth Distribution
23
Hu, Xiaoyan, and Philippos Mordohai. "A quantitative evaluation of confidence measures for stereo vision." TPAMI, 2012.
24
Sorted with error
25
Evaluating Depth Distribution
26
[1] Kendall, Alex, and Yarin Gal. "What uncertainties do we need in bayesian deep learning for computer vision?." NeurIPS. 2017.
[2] Cao, Yuanzhouhan, Zifeng Wu, and Chunhua Shen. "Estimating depth from monocular images as classification using deep fully convolutional residual networks." IEEE Transactions on Circuits and Systems for Video Technology. 2017.
Standard Depth Estimation
27
[1] Kendall, Alex, and Yarin Gal. "What uncertainties do we need in bayesian deep learning for computer vision?." NeurIPS. 2017.
[2] Cao, Yuanzhouhan, Zifeng Wu, and Chunhua Shen. "Estimating depth from monocular images as classification using deep fully convolutional residual networks." IEEE Transactions on Circuits and Systems for Video Technology. 2017.
[3] Fu, Huan, et al. "Deep ordinal regression network for monocular depth estimation." CVPR. 2018.
Application: Dense Monocular Mapping
28
Bârsan, Ioan Andrei, et al. "Robust dense mapping for large-scale dynamic environments." ICRA, 2018.
Depth estimation (blue-> far away)
29
image coordinate
Assuming we know the camera intrinsics and extrinsics,
depth
Depth estimation (blue-> far away)
Uncertainty estimation (blue-> small uncertainty)
30
depth
image coordinate
Assuming we know the camera intrinsics and extrinsics,
entropy
we further remove unreliable points with high entropy
Depth estimation (blue-> far away)
Uncertainty estimation (blue-> small uncertainty)
31
depth
image coordinate
Assuming we know the camera intrinsics and extrinsics,
entropy
we further remove unreliable points with high entropy
Octomap
Application: Online Monocular Mapping
32
33
34
188 MB memory
LiDAR
35
243 MB memory
Ours
36
182 MB memory
Ours-uncertainty
37
LiDAR
Ours
Ours-uncertainty
Summary
38
Thanks!
39