CS 159 presentation schedule

	A	B	C	D	E
1	#	Date	Title	Presenters	Instructor Mentor
2	1	3/29/2016	Intro	YY
3	2	3/31/2016	MW / OL with experts	SZ
4	3	4/5/2016	Online Convex Optimization	Ellen Feldman, Gautam Goel, Milan Cvitkovic	Yisong
5	4	4/7/2016	Multi-armed Bandits & UCB1 Algorithm	Hoang Le, Connor Lee, Ritvik Mishra	Hoang
6	5	4/12/2016	Linear Bandits & Applications	Pengchuan Zheng, Feng Bi, Leiya Ma, Joon Sik Kim	Yisong
7	6	4/14/2016	Monte Carlo Tree Search, Go	Suraj Nair, Peter Kundzicz, Vansh Kumar, Kevin An	Stephan
8	7	4/19/2016	Reinforcement Learning, (Atari or Memory Controller)	Timothy Chou, Charlie Tong, Vincent Zhuang	Stephan
9	8	4/21/2016	Reinforcement Learning via Apprenticeship Learning, Helicopter	Nick Haliday, Audrey Huang, Dryden Bouamalay, Ritwik Anand	Hoang
10	9	4/26/2016	Imitation Learning	Richard Zhu, Andrew Kang, Dimitar Ho	Hoang
11	10	4/28/2016	Active Learning for Supervised Learning	Daniel Gu, Matthew Morgan, Keegan Ryan, Matthew Clark	Hoang
12	11	5/3/2016	Active Learning for Decision Making	Joe Marino, Grant Van Horn, Alvita Tran	Yisong
13	12	5/5/2016	Crowdsourcing	Sreenivas Appasani, Madhav Mohandas, Ajay Mandlekar	Yisong
14	13	5/10/2016	Machine Teaching	Justin Leong, Kevin Tang, Zilong Chen, Kaikai Sheng	Yisong
15	14	5/12/2016	Machine Teaching for Crowdsourcing	Nancy Cao, Andrew Chico, Betsy Fu	Yisong
16	15	5/17/2016	Modeling Human Decision Making	Zachary Fein, Eric Gorlin, Emily Mazo	Hoang
17	16	5/19/2016	Combinatorial Action Spaces, Adaptive Routing	Luciana Cendon, Tobias Bischoff, Jiyun Ivy Xiao, Brennan Young	Yisong
18	17	5/24/2016	Dueling Bandits	Fabian Boemer, Kushal Agarwal, Jialin Song, Aman Agarwal	Yisong
19	18	5/26/2016	Coactive Learning	Rohan Batra, Avishek Dutta, Nand Kishore, Siddharth Murching	Hoang
20	19	5/31/2016	Bayesian Optimization	Erya Yu, Danni Ma	Hoang
21	20	6/2/2016	Off-Policy Evaluation	Miguel Aroca-Ouellette, Akshata Athawale, Mannat Singh	Stephan
22
23			Available topics	References (also see website for more)
24			(Dueling Bandits)	The K-armed Dueling Bandits Problem, by Yisong Yue, Josef Broder, Robert Kleinberg, and Thorsten Joachims. Journal of Computer and System Sciences, DOI:10.1016/j.jcss.2011.12.028, 2012.
25			(Coactive Learning)	Online Structured Prediction via Coactive Learning, by Pannaga Shivaswamy and Thorsten Joachims. International Conference on Machine Learning, 2012. [journal version]
26			(Active Learning for Decision Making)	Near Optimal Bayesian Active Learning for Decision Making, by Shervin Javdani, Yuxin Chen, Amin Karbasi, Andreas Krause, Drew Bagnell, Siddhartha Srinivasa. International Conference on Artificial International and Statistics, 2014.
27			(Bayesian Optimization)	Practical Bayesian Optimization of Machine Learning Algorithms, by Jasper Snoek, Hugo Larochelle, and Ryan Adams. Neural Information Processing Systems, 2012.
28			(Off-Policy Evaluation)	Exploration Scavenging, by John Langford, Alexander Strehl, and Jenn Wortman Vaughan. International Conference on Machine Learning, 2008.
29			(Imitation Learning)	A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning, by Stephane Ross, Geoff Gordon, and Drew Bagnell. International Conference on Artificial Intelligence and Statistics, 2011.
30			(Reinforcement Learning, Monte Carlo Tree Search)	A Survey of Monte Carlo Tree Search Methods by Cameron Browne, Edward Powley, Daniel Whitehouse, Simon Lucas, Peter I. Cowling, Philipp Rohlfshagen, Stephen Tavener, Diego Perez, Spyridon Samothrakis and Simon Colton. IEEE Transactions on Computational Intelligence and AI in Games, 4(1), 2012.
31			(Crowdsourcing)	Optimistic Knowledge Gradient Policy for Optimal Budget Allocation in Crowdsourcing, by Xi Chen, Qihang Lin, and Denny Zhou. International Conference on Machine Learning, 2013. [appendix][journal version]
32			(Machine Teaching)	How Do Humans Teach: On Curriculum Learning and Teaching Dimension, by Faisal Khan, Xiaojin Zhu, and Bilge Mutlu. Neural Information Processing Systems, 2011.
33			(Modeling Human Decision Making)	Forgetful Bayes and myopic planning: Human learning and decision-making in a bandit setting, by Shunan Zhang and Angela Yu. Neural Information Processing Systems, 2013.
34			(Combinatorial Action Spaces, Adaptive Routing)	Non-Myopic Adaptive Route Planning in Uncertain Congestion Environments, by Siyuan Liu, Yisong Yue, and Ramayya Krishnan. ACM Transactions on Knowledge Discovery and Engineering, DOI 10.1109/TKDE.2015.2411278, 2015.
35			(Reinforcement Learning from via Apprenticeship Learning)
36			(Active Learning for Supervised Learning)	Importance Weighted Active Learning
37			(Machine Teaching for Crowdsourcing)	Near-Optimally Teaching the Crowd to Classify
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100