History of Neural Radiance Fields
CS 598 LAZ
Albert Zhai, Yilin Yang, Tianhang Cheng
Popularity of Neural Radiance Field (NeRF)
Popularity of Neural Radiance Field (NeRF)
NeRF: a method for multi-view 3D reconstruction
https://miro.medium.com/v2/resize:fit:1400/0*l233ieenj80ogEOa.png
History (before NeRF)
3D Reconstruction From Multiple Views —— Early Photography and Photosculpture 1850 �
Photosculpture, a mechanical 19th century NeRF
The resulting raw sculpture made of wood slices
Image: https://neuralradiancefields.io/history-of-neural-radiance-fields/
3D Reconstruction From Multiple Views—— Early Photography and Photosculpture 1850 �
3D Reconstruction From Multiple Views —— Triangulation
Figure source: https://towardsai.net/p/machine-learning/introduction-14
3D Reconstruction From Multiple Views —— visual hull
Figure source: JC Perez-Cortes et al. A System for In-Line 3D Inspection without Hidden Surfaces Sensors, 2018
A. Laurentini, The visual hull concept for silhouette-based image understanding TPAMI 1994
Silhouette cone
Silhouette
The intersection is the visual hull.
3D Reconstruction From Multiple Views —— space carving
K. N. Kutulakos and S. M. Seitz, A Theory of Shape by Space Carving, ICCV 1999
Imagine the object is inside a volume….
Algorithm:
Lightfield�
Image: https://medium.com/@dc.aihub/3d-reconstruction-with-stereo-images-part-1-camera-calibration-d86f750a1ade
Maybe we don’t actually need to do the reconstruction…
Lightfield —— The Plenoptic Function�
The 8 dimensional full plenoptic describes light transport as light as waves
Proposed by: Gabriel Lippmann
Image: https://neuralradiancefields.io/history-of-neural-radiance-fields/
Cons:
Lightfield —— The Plenoptic Function�
5D Plenoptic Function
Figure by Leonard McMillan
Lightfield —— Two-plane light fields��
4D light field representation
M. Levoy and P. Hanrahan. Light field rendering. SIGGRAPH 1996
Nonplanar planes
camera plane
Image plane
Lightfield —— Light field Rendering��
M. Levoy and P. Hanrahan. Light field rendering. SIGGRAPH 1996
Camera plane
Image plane
Lightfield —— Light field Rendering��
Novel view rendering
Lightfield —— Light field Rendering��
Lightfield —— Lightfield Camera��
Lightfield Camera
Figure source: M. Levoy https://graphics.stanford.edu/courses/cs178-13/lectures/lightfields-02may13.pdf
Conventional Camera
Lightfield —— Lightfield Camera��
Figure source: M. Levoy https://graphics.stanford.edu/courses/cs178-13/lectures/lightfields-02may13.pdf
Lightfield —— Lightfield Camera��
Figures source: M. Levoy https://graphics.stanford.edu/courses/cs178-13/lectures/lightfields-02may13.pdf
Lightfield —— Lightfield Camera��
What is a Neural Radiance Field (NeRF)?
What is a Neural Radiance Field (NeRF)?
Fitting a NeRF to images via differentiable rendering
3D Scene Representation (NeRF)
Rendering
Fitting a NeRF to images via differentiable rendering
3D Scene Representation (NeRF)
Rendering
Rendering Loss Minimization
Fitting a NeRF to images via differentiable rendering
Mildenhall et al., NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis, ECCV 2020
Volume Rendering
𝛼𝑖 is the probability that there is a particle in segment i
𝑇𝑖 is the probability that there are no blocking particles
Volume Rendering
𝛼𝑖 is the probability that there is a particle in segment i
𝑇𝑖 is the probability that there are no blocking particles
Results: Novel View Synthesis and Depth Estimation
Mildenhall et al., NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis, ECCV 2020
Results: Novel View Synthesis and Depth Estimation
Mildenhall et al., NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis, ECCV 2020
Summary: What is NeRF?
Was NeRF (ECCV ‘20) the first to use these ideas?
Answer: No
Occupancy Networks
(CVPR ‘19)
DeepSDF (CVPR ‘19)
Differentiable Volumetric Rendering (CVPR ‘19)
Neural Implicit Representations
Differentiable Rendering of Implicit Representations
Scene Representation Networks (NeurIPS ‘19)
Mescheder et al., Occupancy networks: Learning 3D reconstruction in function space, CVPR 2019
Park et al., DeepSDF: Learning continuous signed distance functions for shape representation, CVPR 2019
Niemeyer et al., Differentiable volumetric rendering: Learning implicit 3D representations without 3D supervision, CVPR 2019
Sitzmann et al., Scene Representation Networks: Continuous 3D-Structure-Aware Neural Scene Representation, NeurIPS 2019
NeRF is slow
Training: ~1 day
Inference: ~1 minutes
NeRF is ambiguous
Improve Speed and Quality
Different Representation —— SDF
NeRF: MV-images -> density, color
NeuS: MV-images -> SDF -> density, color
SDF gives
shortest distance to surface
Different Representation —— SDF
Different Representation —— Voxel
NeRF: MV-images -> density, color
DVGO: MV-images -> query feature from voxel -> density, color
Different Representation —— Voxel
NeRF: MV-images -> density, color
DVGO: MV-images -> query feature from voxel -> density, color
Different Representation —— Grid
NeRF: MV-images -> density, color
TensoRF: MV-images -> query feature from grid -> density, color
Different Representation —— Grid
NeRF: coordinate -> density, color
TensoRF: MV-images -> query feature from grid -> density, color
Different Representation —— LoD
NeRF: MV-images -> density, color
Neuralangelo: MV-images -> query feature from different level -> density, color
Different Representation —— LoD
Different Representation —— LoD
Different Representation —— LoD
Pretrained Network —— Depth / Normal
NeRF: MV-images -> density, color
MonoSDF: MV-images -> density, color, depth, normal
https://niujinshuchong.github.io/monosdf/
Pretrained Network —— Diffusion model
Single Image + relative pose (R, T) -> Novel view Image
Pretrained Network —— Diffusion model
Expand the application of vanilla NeRF
New application —— Dynamic/deformable NeRF
NeRF: MV-images -> density, color
D-NeRF: MV-video + time -> displacement -> density, color
Learn a time-dependent displacement field
Pumarola et al., D-NeRF: Neural Radiance Fields for Dynamic Scenes, CVPR 2021, https://arxiv.org/pdf/2011.13961.pdf
New application —— Dynamic/deformable NeRF
Learn a time-dependent displacement field
Pumarola et al., D-NeRF: Neural Radiance Fields for Dynamic Scenes, CVPR 2021, https://arxiv.org/pdf/2011.13961.pdf
New application —— Semantics
NeRF: MV-images-> RGBA
Semantics NeRF: MV-semantics -> semantics vector -> semantics
Zhi, S., Laidlow, T., Leutenegger, S., & Davison, A. J. (2021). In-place scene labelling and understanding with implicit scene representation. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 15838-15847).
New application —— Anime Head
NeRF: MV-images-> RGBA
PAniC-3D: Single image -> anime head model
Chen, S., Zhang, K., Shi, Y., Wang, H., Zhu, Y., Song, G., ... & Zwicker, M. (2023). PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 21068-21077).
New application —— complex texture
NeRF: MV-images -> RGBA
InvRender: MV-images -> diffuse, roughness, metallic
Zhang, Y., Sun, J., He, X., Fu, H., Jia, R., & Zhou, X. (2022). Modeling indirect illumination for inverse rendering. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 18643-18652),
New application —— Edit with Language
Clip-NeRF: Text + MV-images -> edited NeRF
Wang, C., Chai, M., He, M., Chen, D., & Liao, J. (2022). Clip-nerf: Text-and-image driven manipulation of neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 3835-3844), https://arxiv.org/abs/2112.05139
New application —— Physical Property
NeRF: MV-images -> RGBA
PAC-NeRF: MV-images -> Elasticity, hardness, viscosity, etc.
Li, X., Qiao, Y. L., Chen, P. Y., Jatavallabhula, K. M., Lin, M., Jiang, C., & Gan, C. (2023). PAC-neRF: Physics augmented continuum neural radiance fields for geometry-agnostic system identification. arXiv preprint arXiv:2303.05512.
New application —— Robot Action
Multiview-Video -> state feature -> robot action
Li, Y., Li, S., Sitzmann, V., Agrawal, P., & Torralba, A. (2022, January). 3d neural scene representations for visuomotor control. In Conference on Robot Learning (pp. 112-123). PMLR.
New application —— Hide message
CopyRNeRF: Multiview-Image + message -> NeRF
Luo, Z., Guo, Q., Cheung, K. C., See, S., & Wan, R. (2023). CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields. arXiv preprint arXiv:2307.11526.
Discussion: will NeRF stand the test of time?
Advantages:
Disadvantages:
References