Lecture 9.๏ฟฝ Actor-Critic Design Decisions๏ฟฝ๏ฟฝ
Sookyung Kim๏ฟฝ
1
Taxonomy of RL algorithm
Actor-Critic
2
Value Function Fitting
ย
3
From Evaluation to Actor Critic
4
Actor-critic algorithm (with discount)
5
Actor-critic Design Decisions
6
Architecture Design
7
Online Actor-critic in practice
8
Can we remove the on-policy assumption entirely?
9
Letโs see what that looks like
10
11
Fixing the policy update
Remember
12
13
Some implementation details
DDPG in next chapter
14