From BEV to Scene-as-Occupancy:
An Overview of Camera 3D Perception
Chonghao Sima
Shanghai AI Laboratory | 上海人工智能实验室
Shanghai AI Laboratory | 上海人工智能实验室
Why we need 3D perception
RoboTaxi
Autonomous Driving
Trunk
ADAS System
Delivery
Robotics & Embodied AI
Housework
Agriculture
Logistics system
Industry
Why Camera-based? low-cost, easy-to-deploy, long-range, rich in semantic appearance;
Shanghai AI Laboratory | 上海人工智能实验室
Why we need 3D perception
RoboTaxi
Autonomous Driving
Trunk
ADAS System
Delivery
Robotics & Embodied AI
Housework
Agriculture
Logistics system
Industry
Why Camera-based? low-cost, easy-to-deploy, long-range, rich in semantic appearance;
Shanghai AI Laboratory | 上海人工智能实验室
Why we need 3D perception
RoboTaxi
Autonomous Driving
Trunk
ADAS System
Delivery
Robotics & Embodied AI
Housework
Agriculture
Logistics system
Industry
Why Camera-based? low-cost, easy-to-deploy, long-range, rich in semantic appearance;
Shanghai AI Laboratory | 上海人工智能实验室
Why we need 3D perception
RoboTaxi
Autonomous Driving
Trunk
ADAS System
Delivery
Robotics & Embodied AI
Housework
Agriculture
Logistics system
Industry
Why Camera-based? low-cost, easy-to-deploy, long-range, rich in semantic appearance;
Shanghai AI Laboratory | 上海人工智能实验室
Core Issues in Camera-only 3D Perception
Accurate Depth: Bridging the gap between Camera-based and LiDAR-based method
[1] Depth Estimation Matters Most: Improving Per-Object Depth Estimation for Monocular 3D Detection and Tracking, arxiv:2206.03666
How to solve?
Shanghai AI Laboratory | 上海人工智能实验室
Trending in BEV Perception
2021.7
Shanghai AI Laboratory | 上海人工智能实验室
Trending in BEV Perception
2021.7
2021.10-12
Shanghai AI Laboratory | 上海人工智能实验室
Trending in BEV Perception
2021.7
2021.10-12
2022.3
Shanghai AI Laboratory | 上海人工智能实验室
Trending in BEV Perception
2021.7
2021.10-12
2022.3
2022.5
Core Question: How to model the View Transformation from perspective view to BEV more effectively?
Shanghai AI Laboratory | 上海人工智能实验室
View Transformation
Issues:
Shanghai AI Laboratory | 上海人工智能实验室
View Transformation
Issues:
Shanghai AI Laboratory | 上海人工智能实验室
View Transformation
Issues:
Shanghai AI Laboratory | 上海人工智能实验室
View Transformation
Issues:
No matter what, the transformation is ill-posed
Shanghai AI Laboratory | 上海人工智能实验室
Two Ways to Address View Transformation
Way 1: From-2D-to-3D prior
Way 2: From-3D-to-2D prior
Shanghai AI Laboratory | 上海人工智能实验室
BEVFormer
Multi-camera and temporal feature based on Deformable Attention.
Shanghai AI Laboratory | 上海人工智能实验室
BEVFormer
Multi-camera and temporal feature based on Deformable Attention.
Shanghai AI Laboratory | 上海人工智能实验室
BEVFormer
Multi-camera and temporal feature based on Deformable Attention.
Shanghai AI Laboratory | 上海人工智能实验室
BEVFormer
Multi-camera and temporal feature based on Deformable Attention.
Shanghai AI Laboratory | 上海人工智能实验室
BEVFormer
Shanghai AI Laboratory | 上海人工智能实验室
PersFormer
Input: Image in perspective view
Output: Lane lines in 3D space
Shanghai AI Laboratory | 上海人工智能实验室
PersFormer
Input: Image in perspective view
Output: Lane lines in 3D space
Conventional 2D Lane
Segmentation
Anchor-based Detection
[1] SCNN, AAAI 2018
[2] LaneAF, RA-L 2021
[3] LaneATT, CVPR 2021
[4] CondLaneNet, CVPR 2021
Problem
IPM: the assumption of flat ground does not always hold
Shanghai AI Laboratory | 上海人工智能实验室
PersFormer
Shanghai AI Laboratory | 上海人工智能实验室
PersFormer
Shanghai AI Laboratory | 上海人工智能实验室
PersFormer
Shanghai AI Laboratory | 上海人工智能实验室
OpenLane
Shanghai AI Laboratory | 上海人工智能实验室
BEVFormer & PersFormer
BEV Perception is prevailing since 2022 in academia.
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
What’s the problem of current 3D perception representation?
(a)
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
What’s the problem of current 3D perception representation?
(a)
(b)
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
What’s the problem of current 3D perception representation?
(a)
(b)
(c)
(d)
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
How does 3D perception evolve into 3D occupancy?
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
How does 3D perception evolve into 3D occupancy?
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
How does 3D perception evolve into 3D occupancy?
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
How does 3D perception evolve into 3D occupancy?
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
How does 3D perception evolve into 3D occupancy?
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
How does 3D perception evolve into 3D occupancy?
Shanghai AI Laboratory | 上海人工智能实验室
3D Occupancy Prediction
How does 3D perception evolve into 3D occupancy?
Shanghai AI Laboratory | 上海人工智能实验室
Scene as Occupancy
Shanghai AI Laboratory | 上海人工智能实验室
Challenge Stats
Shanghai AI Laboratory | 上海人工智能实验室