ScAi Reading Group
Planning via Diffusion
Xiusi Chen
March 9, 2023
Motivation
Motivation
Motivation
Motivation
Motivation
Motivation
Motivation
Motivation
Planning as generative modeling
A generative model of trajectories
A generative model of trajectories
A generative model of trajectories
Compositionality via local consistency
Variable-length predictions
Non-autoregressive prediction
Training
in which i ∼ U{1,2,...,N} is the diffusion timestep, ε ∼ N(0,I) is the noise target, and τi is the trajectory τ 0 corrupted with noise ε .
From trajectory modeling to planning
Planning
Planning
Offline RL through Value Guidance
Offline RL through Value Guidance
Experiments
Connections with Guided Diffusion
guidance
Connections with Guided Diffusion
guidance
Diffusing over states
Acting with Inverse-Dynamics
Decision Diffuser
Experiments
Experiments
Takeaways
Thank you!
Q & A
Related Readings
Training of Decision Diffuser
Architecture