Stochastic Conditional Diffusion Models�for Robust Semantic Image Synthesis
Juyeon Ko*, Inho Kong*, Dogyun Park, Hyunwoo J. Kim
�Department of Computer Science and Engineering, Korea University
Korea University
MLV Lab
ICML 2024
Korea University
MLV Lab
Semantic Image Synthesis (SIS)
Semantic map y
(Label)
Semantic Image Synthesis with Spatially-Adaptive Normalization, Park et al., CVPR 2019 (Oral)
ICML 2024
Image
approximate
1
1
1
1
1
1
1
1
1
1
1
5
5
5
5
5
5
5
5
5
5
5
5
5
…
…
…
1: sky
2: tree
…
5: grass
…
Semantic Image Synthesis (SIS)
generate
Korea University
MLV Lab
Motivation
Image credit: https://tech.hindustantimes.com/tech/news/this-photoshop-ai-feature-will-change-the-way-you-edit-photos-know-what-is-generative-fill-71686747616850.html
car
water
grass
tree
sky
Photo editing
Content creation
Model
Train
Clean labels
from the dataset
Inference
Noisy labels
from users
gap
ICML 2024
Korea University
MLV Lab
Stochastic Conditional Diffusion Models (SCDM)
ICML 2024
…
…
Stochastic Conditional Diffusion Model (SCDM)
Stochastic conditioning via Label Diffusion
(Noisy)
Label
…
…
Existing Conditional Diffusion Models
(Noisy)
Label
Korea University
MLV Lab
ICML 2024
Label Diffusion
…
…
…
…
clean
label
noisy
label
They get similar!
identical
Korea University
MLV Lab
ICML 2024
Label Diffusion
Korea University
MLV Lab
ICML 2024
Forward process and Generation process
Label Diffusion
continuous
discrete
Label Diffusion
Korea University
MLV Lab
ICML 2024
Class-wise Noise Schedule
Slowly diffuse small/rare classes
Korea University
MLV Lab
ICML 2024
Noisy SIS Benchmark
1. [DS] downsampled semantic maps
2. [Edge] masking the edges of instances
3. [Random] randomly adding unlabeled class to the semantic maps (10%)
[DS]
[Edge]
[Random]
Korea University
MLV Lab
ICML 2024
Experiments - Noisy SIS
Label
OASIS
SAFM
SDM
Ours
LDM
Korea University
MLV Lab
ICML 2024
Analysis – Label Diffusion and Robustness
Label
Ours
Baseline
Clean
DS
Edge
Random
Korea University
MLV Lab
Conclusion
ICML 2024
Korea University
MLV Lab
Thank You
Paper
GitHub
ICML 2024