1 of 30

DreamFusion�Text-to-3D using 2D diffusion

POOLE, B., JAIN, A., BARRON, J. T., & MILDENHALL, B. (2022)

2 of 30

Key Contributions

  • A NeRF-based text-to-3d model. Without explicitly using reference images.
  • A novel way of using diffusion models as a loss function for images.

3 of 30

Example generations

4 of 30

Image Diffusion Refresher

Forward process

Reverse process

 

 

 

 

 

 

 

5 of 30

Image Diffusion Refresher

Forward process

Reverse process

 

 

 

 

 

 

 

6 of 30

At each step

 

 

 

Instead of predicting denoised images, diffusion models predict the noise content

7 of 30

At each step

Conditioning

 

 

 

They can be conditioned on things like text, other images, or other embeddings

8 of 30

Imagen

Text Embeddings

 

 

 

Imagen

Saharia, C. et al. (2022) ‘Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding’, arXiv [cs.CV]. Available at: http://arxiv.org/abs/2205.11487.

9 of 30

The DreamFusion Model

10 of 30

The rendering

11 of 30

The rendering

12 of 30

Score Distillation Sampling Loss (SDS)

 

 

13 of 30

Score Distillation Sampling Loss (SDS)

 

“Overhead view/ front view/ side view”

+

 

14 of 30

Score Distillation Sampling Loss (SDS)

 

 

15 of 30

Score Distillation Sampling Loss (SDS)

 

 

16 of 30

Score Distillation Sampling Loss (SDS)

 

 

17 of 30

Results

18 of 30

Results

The two other models were trained using CLIP, so using CLIP here is not the best evaluation metric

19 of 30

Common Failure

Prompt: a DSLR photo of a toy cow

20 of 30

Prolific DreamerHigh-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation

WANG, Z., LU, C., WANG, Y., BAO, F., LI, C., SU, H., & ZHU, J. (2023)

21 of 30

Key Contribution

  • Variational Score Distillation loss to replace Score Distillation Sampling.

22 of 30

Variational Score Distillation

  •  

 

 

 

23 of 30

In practice

Use several NeRF instead of 1. (up to 4 in practice)

24 of 30

In Practice

25 of 30

In Practice

  •  

26 of 30

In Practice

VDS

SDS

 

 

27 of 30

How Does it Compare

28 of 30

How Does it Compare

29 of 30

How Does it Compare

30 of 30

Question time