A | B | C | D | E | |
---|---|---|---|---|---|
1 | CMSC 818B Fall 2023 | ||||
2 | |||||
3 | Date | Lecture | Primary Reading | Additional Reading | Important Dates |
4 | Tue, Aug 29 | Introduction | |||
5 | Thu, Aug 31 | MDPs | Chapter 3 of Sutton & Barto RL book | Lecture Notes by Jamieson | |
6 | Tue, Sep 5 | MDPs - Value Iteration | Chapter 6.5, 13 of Sutton & Barto RL book | ||
7 | Thu, Sep 7 | Q-Learning | |||
8 | Tue, Sep 12 | Q-Learning, Policy Gradient, Actor Critic | Lilian Weng Notes | ||
9 | Thu, Sep 14 | Q-Learning, Policy Gradient, Actor Critic | |||
10 | Tue, Sep 19 | Q-Learning, Policy Gradient, Actor Critic | List of papers released | ||
11 | Thu, Sep 21 | Learning from human feedback (IRL, LFD, BC) | |||
12 | Tue, Sep 26 | IRL, IL, BC papers | Paper selections due on Tue | ||
13 | Thu, Sep 28 | IRL, IL, BC papers | |||
14 | Tue, Oct 3 | IRL, IL, BC papers | |||
15 | Thu, Oct 5 | Vision+Language+Action papers | Project pitch due | ||
16 | Tue, Oct 10 | Multi-Agent RL | Mini-project 1 due | ||
17 | Thu, Oct 12 | MARL papers | |||
18 | Tue, Oct 17 | MARL papers | |||
19 | Thu, Oct 19 | Multi-Robot Coordination | |||
20 | Tue, Oct 24 | 1. https://arxiv.org/abs/2003.06709 2. https://arxiv.org/abs/2211.09019 3. https://arxiv.org/abs/2210.15185 | |||
21 | Thu, Oct 26 | 1. https://arxiv.org/abs/2305.14992 2. https://arxiv.org/abs/2308.01399 3. https://arxiv.org/abs/2307.12981 4. https://arxiv.org/abs/2303.03378 | |||
22 | Tue, Oct 31 | 1. https://arxiv.org/abs/2307.15818 2. https://arxiv.org/abs/2310.10103 3. https://arxiv.org/abs/2208.02918 4. https://arxiv.org/abs/2309.02721 | |||
23 | Thu, Nov 2 | 1. https://openreview.net/forum?id=N3VbFUpwaa 2. https://arxiv.org/abs/2305.15288 3. https://openreview.net/forum?id=5VCT-DptDTs 4. https://ieeexplore.ieee.org/document/10160864 | |||
24 | Tue, Nov 7 | 1. https://arxiv.org/abs/2204.12568 2. https://openreview.net/forum?id=VscdYkKgwdH 3. https://arxiv.org/abs/2111.06974 4. https://arxiv.org/abs/2306.15724 | |||
25 | Thu, Nov 9 | 1. https://arxiv.org/abs/2309.05665 2. https://arxiv.org/abs/2208.07860 3. https://arxiv.org/abs/2308.16185 4. https://ieeexplore.ieee.org/document/10160725 | |||
26 | Tue, Nov 14 | 1. https://arxiv.org/abs/2209.07793 2. https://arxiv.org/abs/2305.01870 3. https://arxiv.org/abs/2309.05131 4. https://arxiv.org/abs/2211.12181 | |||
27 | Thu, Nov 16 | 1. https://arxiv.org/abs/2206.03004 2. https://arxiv.org/abs/2306.09523 3. https://www.nature.com/articles/s41586-023-06419-4 4. https://arxiv.org/abs/2207.04429 | |||
28 | Tue, Nov 21 | Exam -- no class | Take-home exam | ||
29 | Thu, Nov 23 | Thanksgiving | |||
30 | Tue, Nov 28 | 1. https://arxiv.org/abs/2309.03185 2. https://arxiv.org/abs/2210.06575 | |||
31 | Thu, Nov 30 | 1. https://arxiv.org/abs/2006.16908 2. https://arxiv.org/abs/2301.06864 3. https://rss2023.github.io/rss2023-website/program/papers/104/ | |||
32 | Tue, Dec 5 | ||||
33 | Thu, Dec 7 | Project presentations | |||
34 | |||||
35 | |||||
36 | |||||
37 | |||||
38 | |||||
39 | |||||
40 | |||||
41 | |||||
42 | |||||
43 | |||||
44 | Paper list | ||||
45 | |||||
46 | |||||
47 | |||||
48 | |||||
49 | |||||
50 | |||||
51 | |||||
52 | |||||
53 | |||||
54 | |||||
55 | |||||
56 | |||||
57 | |||||
58 | |||||
59 | |||||
60 | |||||
61 | |||||
62 | |||||
63 | |||||
64 | |||||
65 | |||||
66 | |||||
67 | |||||
68 | |||||
69 | |||||
70 | |||||
71 | |||||
72 | |||||
73 | |||||
74 | |||||
75 | |||||
76 | |||||
77 | |||||
78 | |||||
79 | |||||
80 | |||||
81 | |||||
82 | |||||
83 | |||||
84 | |||||
85 | |||||
86 | |||||
87 | |||||
88 | |||||
89 | |||||
90 | |||||
91 | |||||
92 | |||||
93 | |||||
94 | |||||
95 | |||||
96 | |||||
97 | |||||
98 | |||||
99 | |||||
100 |