Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep MBRL
Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Harm van Seijen, Sarath Chandar
Adaptivity is a Key Feature of Model-Based Learning
The Local Change Adaptation (LoCA) Setup
The Local Change Adaptation (LoCA) Setup
The Local Change Adaptation (LoCA) Setup
Current Deep Model-Based RL Methods Are Not Adaptive!
Replay Buffer Against Catastrophic Forgetting
Stale Data
Stale Data
Interference-Forgetting Dilemma
Interference-Forgetting Dilemma
Local Forgetting (LoFo) Replay Buffer
Local Forgetting (LoFo) Replay Buffer
Local Forgetting (LoFo) Replay Buffer
Local Forgetting (LoFo) Replay Buffer
LoFo Buffer
Experiments
Dreamer w/ the LoFo Buffer
Dreamer w/ the LoFo Buffer
Dreamer’s Reward Estimates
Dreamer’s Reward Estimates
Dreamer’s Reward Estimates
Dreamer’s Reward Estimates
Limitations and Future Work
Limitations and Future Work
Limitations and Future Work
Limitations and Future Work
Thank you!
Contact Info: ali-rahimi.kalahroudi@mila.quebec