| A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | # | Date | Title | Presenters | Instructor Mentor | |||||||||||||
2 | 1 | 3/29/2016 | Intro | YY | ||||||||||||||
3 | 2 | 3/31/2016 | MW / OL with experts | SZ | ||||||||||||||
4 | 3 | 4/5/2016 | Online Convex Optimization | Ellen Feldman, Gautam Goel, Milan Cvitkovic | Yisong | |||||||||||||
5 | 4 | 4/7/2016 | Multi-armed Bandits & UCB1 Algorithm | Hoang Le, Connor Lee, Ritvik Mishra | Hoang | |||||||||||||
6 | 5 | 4/12/2016 | Linear Bandits & Applications | Pengchuan Zheng, Feng Bi, Leiya Ma, Joon Sik Kim | Yisong | |||||||||||||
7 | 6 | 4/14/2016 | Monte Carlo Tree Search, Go | Suraj Nair, Peter Kundzicz, Vansh Kumar, Kevin An | Stephan | |||||||||||||
8 | 7 | 4/19/2016 | Reinforcement Learning, (Atari or Memory Controller) | Timothy Chou, Charlie Tong, Vincent Zhuang | Stephan | |||||||||||||
9 | 8 | 4/21/2016 | Reinforcement Learning via Apprenticeship Learning, Helicopter | Nick Haliday, Audrey Huang, Dryden Bouamalay, Ritwik Anand | Hoang | |||||||||||||
10 | 9 | 4/26/2016 | Imitation Learning | Richard Zhu, Andrew Kang, Dimitar Ho | Hoang | |||||||||||||
11 | 10 | 4/28/2016 | Active Learning for Supervised Learning | Daniel Gu, Matthew Morgan, Keegan Ryan, Matthew Clark | Hoang | |||||||||||||
12 | 11 | 5/3/2016 | Active Learning for Decision Making | Joe Marino, Grant Van Horn, Alvita Tran | Yisong | |||||||||||||
13 | 12 | 5/5/2016 | Crowdsourcing | Sreenivas Appasani, Madhav Mohandas, Ajay Mandlekar | Yisong | |||||||||||||
14 | 13 | 5/10/2016 | Machine Teaching | Justin Leong, Kevin Tang, Zilong Chen, Kaikai Sheng | Yisong | |||||||||||||
15 | 14 | 5/12/2016 | Machine Teaching for Crowdsourcing | Nancy Cao, Andrew Chico, Betsy Fu | Yisong | |||||||||||||
16 | 15 | 5/17/2016 | Modeling Human Decision Making | Zachary Fein, Eric Gorlin, Emily Mazo | Hoang | |||||||||||||
17 | 16 | 5/19/2016 | Combinatorial Action Spaces, Adaptive Routing | Luciana Cendon, Tobias Bischoff, Jiyun Ivy Xiao, Brennan Young | Yisong | |||||||||||||
18 | 17 | 5/24/2016 | Dueling Bandits | Fabian Boemer, Kushal Agarwal, Jialin Song, Aman Agarwal | Yisong | |||||||||||||
19 | 18 | 5/26/2016 | Coactive Learning | Rohan Batra, Avishek Dutta, Nand Kishore, Siddharth Murching | Hoang | |||||||||||||
20 | 19 | 5/31/2016 | Bayesian Optimization | Erya Yu, Danni Ma | Hoang | |||||||||||||
21 | 20 | 6/2/2016 | Off-Policy Evaluation | Miguel Aroca-Ouellette, Akshata Athawale, Mannat Singh | Stephan | |||||||||||||
22 | ||||||||||||||||||
23 | Available topics | References (also see website for more) | ||||||||||||||||
24 | (Dueling Bandits) | The K-armed Dueling Bandits Problem, by Yisong Yue, Josef Broder, Robert Kleinberg, and Thorsten Joachims. Journal of Computer and System Sciences, DOI:10.1016/j.jcss.2011.12.028, 2012. | ||||||||||||||||
25 | (Coactive Learning) | Online Structured Prediction via Coactive Learning, by Pannaga Shivaswamy and Thorsten Joachims. International Conference on Machine Learning, 2012. [journal version] | ||||||||||||||||
26 | (Active Learning for Decision Making) | Near Optimal Bayesian Active Learning for Decision Making, by Shervin Javdani, Yuxin Chen, Amin Karbasi, Andreas Krause, Drew Bagnell, Siddhartha Srinivasa. International Conference on Artificial International and Statistics, 2014. | ||||||||||||||||
27 | (Bayesian Optimization) | Practical Bayesian Optimization of Machine Learning Algorithms, by Jasper Snoek, Hugo Larochelle, and Ryan Adams. Neural Information Processing Systems, 2012. | ||||||||||||||||
28 | (Off-Policy Evaluation) | Exploration Scavenging, by John Langford, Alexander Strehl, and Jenn Wortman Vaughan. International Conference on Machine Learning, 2008. | ||||||||||||||||
29 | (Imitation Learning) | A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning, by Stephane Ross, Geoff Gordon, and Drew Bagnell. International Conference on Artificial Intelligence and Statistics, 2011. | ||||||||||||||||
30 | (Reinforcement Learning, Monte Carlo Tree Search) | A Survey of Monte Carlo Tree Search Methods by Cameron Browne, Edward Powley, Daniel Whitehouse, Simon Lucas, Peter I. Cowling, Philipp Rohlfshagen, Stephen Tavener, Diego Perez, Spyridon Samothrakis and Simon Colton. IEEE Transactions on Computational Intelligence and AI in Games, 4(1), 2012. | ||||||||||||||||
31 | (Crowdsourcing) | Optimistic Knowledge Gradient Policy for Optimal Budget Allocation in Crowdsourcing, by Xi Chen, Qihang Lin, and Denny Zhou. International Conference on Machine Learning, 2013. [appendix][journal version] | ||||||||||||||||
32 | (Machine Teaching) | How Do Humans Teach: On Curriculum Learning and Teaching Dimension, by Faisal Khan, Xiaojin Zhu, and Bilge Mutlu. Neural Information Processing Systems, 2011. | ||||||||||||||||
33 | (Modeling Human Decision Making) | Forgetful Bayes and myopic planning: Human learning and decision-making in a bandit setting, by Shunan Zhang and Angela Yu. Neural Information Processing Systems, 2013. | ||||||||||||||||
34 | (Combinatorial Action Spaces, Adaptive Routing) | Non-Myopic Adaptive Route Planning in Uncertain Congestion Environments, by Siyuan Liu, Yisong Yue, and Ramayya Krishnan. ACM Transactions on Knowledge Discovery and Engineering, DOI 10.1109/TKDE.2015.2411278, 2015. | ||||||||||||||||
35 | (Reinforcement Learning from via Apprenticeship Learning) | |||||||||||||||||
36 | (Active Learning for Supervised Learning) | Importance Weighted Active Learning | ||||||||||||||||
37 | (Machine Teaching for Crowdsourcing) | Near-Optimally Teaching the Crowd to Classify | ||||||||||||||||
38 | ||||||||||||||||||
39 | ||||||||||||||||||
40 | ||||||||||||||||||
41 | ||||||||||||||||||
42 | ||||||||||||||||||
43 | ||||||||||||||||||
44 | ||||||||||||||||||
45 | ||||||||||||||||||
46 | ||||||||||||||||||
47 | ||||||||||||||||||
48 | ||||||||||||||||||
49 | ||||||||||||||||||
50 | ||||||||||||||||||
51 | ||||||||||||||||||
52 | ||||||||||||||||||
53 | ||||||||||||||||||
54 | ||||||||||||||||||
55 | ||||||||||||||||||
56 | ||||||||||||||||||
57 | ||||||||||||||||||
58 | ||||||||||||||||||
59 | ||||||||||||||||||
60 | ||||||||||||||||||
61 | ||||||||||||||||||
62 | ||||||||||||||||||
63 | ||||||||||||||||||
64 | ||||||||||||||||||
65 | ||||||||||||||||||
66 | ||||||||||||||||||
67 | ||||||||||||||||||
68 | ||||||||||||||||||
69 | ||||||||||||||||||
70 | ||||||||||||||||||
71 | ||||||||||||||||||
72 | ||||||||||||||||||
73 | ||||||||||||||||||
74 | ||||||||||||||||||
75 | ||||||||||||||||||
76 | ||||||||||||||||||
77 | ||||||||||||||||||
78 | ||||||||||||||||||
79 | ||||||||||||||||||
80 | ||||||||||||||||||
81 | ||||||||||||||||||
82 | ||||||||||||||||||
83 | ||||||||||||||||||
84 | ||||||||||||||||||
85 | ||||||||||||||||||
86 | ||||||||||||||||||
87 | ||||||||||||||||||
88 | ||||||||||||||||||
89 | ||||||||||||||||||
90 | ||||||||||||||||||
91 | ||||||||||||||||||
92 | ||||||||||||||||||
93 | ||||||||||||||||||
94 | ||||||||||||||||||
95 | ||||||||||||||||||
96 | ||||||||||||||||||
97 | ||||||||||||||||||
98 | ||||||||||||||||||
99 | ||||||||||||||||||
100 |