CSE 5524: �Generative models - 1
1
HW 3
Final project (30%)
Today (32)
4
Recap: Popular CNN “architectures”
5
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Recognition models vs. generative models
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Critical difference
Generative models achieve this “ambiguity” by making g a “stochastic function”
Illustration
8
Big-GAN
Diffusion models
Real
[Diffusion Models Beat GANs on Image Synthesis, NeurIPS 2021]
How can we make a neural network stochastic?
Solution: “explicitly” introduce a stochastic input
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Terminology
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Unconditional generative models
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Gray: visible; White: latent
Learning generative model
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
What is the objective?
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
What is the objective?
Direct and indirect approaches
Direct
Indirect
Synthetic data have high probability under a density model fit to real data
Direct and indirect approaches
What will you see today?
Density model
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Example: Gaussian
Learning density function
Learning density function
Learning density function
Learning density function
Learning density function
Maximum likelihood estimation (MLE)
= Minimum KL divergence
Illustration
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Density model for images
Autoregressive density model
Illustration
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Predict the next pixel based on local context!
Softmax for categorical distribution modeling
Training
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
The training process is entirely supervised!
Training
[Figure credit: A. Torralba, P. Isola, and W. T. Freeman, Foundations of Computer Vision.]
Dive into autoregressive model training
Diffusion models
Training
Training
Neural network model
What is the objective?
Generative adversarial networks (GAN)
Generative adversarial net (GAN)
39
Generator
Discriminator
REAL
FAKE
[Credits: Mengdi Fan and Xinyu Zhou, CSE 5539 course presentation]
Example results (by Style-GAN)
40
[A Style-Based Generator Architecture for Generative Adversarial Networks, CVPR 2019]