Text-to-Image Generation with Mamba
Qian Yang, Kanishk Jain
Overview
Reminder of Projects
Updated Timeline
Q&A
Progress Until Now
2
1
3
4
2
Problem Statement Reminder
MUSE: Text-to-Image Generation via Masked Generative Transformers
Mamba: Selective State Space model
VMamba
VMamba
Our Approach: M-Mamba for Image Generation
Mamba: Selective State Space model
Problem Statement Reminder
Stable Diffusion
Our Approach: M-Mamba for Image Generation
a little mamba
Text
Image Dir-1
Text
Image Dir-2
Text
Image Dir-3
Text
Image Dir-4
Text
Image Dir-1
Text
Image Dir-2
Text
Image Dir-3
Text
Image Dir-4
Text Merge
Text
Progress Until Now
12
Loss Curve
Loss Curve for M-Mamba 22 layers
Loss Curve for UViT 22 layers
13
Speed Comparison
14
Model | 22-Layer Time | Single Layer Time | Attention v.s. SSM |
UViT 513M | 0.137 s | 5.94e-3 s | Self-attention: 1.19e-3 Cross-attention: 1.16e-3 |
M-Mamba 448M | 0.217 s | 8.92e-3 s | 2.33e-4 s |
Experiments
Generated by M-Mamba 22 layers at 22k steps
Generated by UViT 22 layers at 22k steps
15
Experiments
Generated by M-Mamba 22 layers at 22k steps
Generated by UViT 22 layers at 22k steps
16
Experiments
1k steps
17k steps
22k steps
17
Experiments
Stages of Learning in Image Generation:
18
Experiments
Stages of Learning in Image Generation:
19
Experiments
Stages of Learning in Image Generation:
20
Experiments
Stages of Learning in Image Generation:
21
Experiments
1k steps
17k steps
22k steps
22
Experiments
1k steps
17k steps
22k steps
23
Experiments
1k steps
17k steps
22k steps
24
Experiments
1k steps
17k steps
22k steps
25
Experiments
1k steps
17k steps
22k steps
26
Is Mamba worth exploring?
27
Is Mamba worth exploring?
28
Is Mamba worth exploring?
29
Is Mamba worth exploring?
30
Experiments
1k steps
17k steps
22k steps
31
Updated Timeline
State Space Model
Mamba’s Contributions
Selection Mechanism
Parallel Scan
Similar to Mamba’s recurrence relation:
Selection Mechanism
Selective Copying
Induction Heads
Parallel Scan
Similar to Mamba’s recurrence relation:
Hardware-Aware Algorithm
Our Approach: M-Mamba for Image Generation
Training Plan
Evaluation Metrics
Evaluation Metrics
Research Objectives
References