CS 159 presentation schedule
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
$
%
123
 
 
 
 
 
 
 
 
 
ABCDEFGHIJKLMNOPQR
1
#DateTitlePresentersInstructor Mentor
2
13/29/2016IntroYY
3
23/31/2016MW / OL with expertsSZ
4
34/5/2016Online Convex OptimizationEllen Feldman, Gautam Goel, Milan CvitkovicYisong
5
44/7/2016Multi-armed Bandits & UCB1 AlgorithmHoang Le, Connor Lee, Ritvik MishraHoang
6
54/12/2016Linear Bandits & ApplicationsPengchuan Zheng, Feng Bi, Leiya Ma, Joon Sik KimYisong
7
64/14/2016Monte Carlo Tree Search, GoSuraj Nair, Peter Kundzicz, Vansh Kumar, Kevin AnStephan
8
74/19/2016
Reinforcement Learning, (Atari or Memory Controller)
Timothy Chou, Charlie Tong, Vincent ZhuangStephan
9
84/21/2016
Reinforcement Learning via Apprenticeship Learning, Helicopter
Nick Haliday, Audrey Huang, Dryden Bouamalay, Ritwik Anand
Hoang
10
94/26/2016Imitation LearningRichard Zhu, Andrew Kang, Dimitar HoHoang
11
104/28/2016Active Learning for Supervised LearningDaniel Gu, Matthew Morgan, Keegan Ryan, Matthew ClarkHoang
12
115/3/2016Active Learning for Decision MakingJoe Marino, Grant Van Horn, Alvita TranYisong
13
125/5/2016CrowdsourcingSreenivas Appasani, Madhav Mohandas, Ajay MandlekarYisong
14
135/10/2016Machine TeachingJustin Leong, Kevin Tang, Zilong Chen, Kaikai ShengYisong
15
145/12/2016Machine Teaching for CrowdsourcingNancy Cao, Andrew Chico, Betsy FuYisong
16
155/17/2016Modeling Human Decision MakingZachary Fein, Eric Gorlin, Emily MazoHoang
17
165/19/2016Combinatorial Action Spaces, Adaptive Routing
Luciana Cendon, Tobias Bischoff, Jiyun Ivy Xiao, Brennan Young
Yisong
18
175/24/2016Dueling BanditsFabian Boemer, Kushal Agarwal, Jialin Song, Aman AgarwalYisong
19
185/26/2016Coactive Learning
Rohan Batra, Avishek Dutta, Nand Kishore, Siddharth Murching
Hoang
20
195/31/2016Bayesian OptimizationErya Yu, Danni MaHoang
21
206/2/2016Off-Policy EvaluationMiguel Aroca-Ouellette, Akshata Athawale, Mannat SinghStephan
22
23
Available topicsReferences (also see website for more)
24
(Dueling Bandits)
The K-armed Dueling Bandits Problem, by Yisong Yue, Josef Broder, Robert Kleinberg, and Thorsten Joachims. Journal of Computer and System Sciences, DOI:10.1016/j.jcss.2011.12.028, 2012.
25
(Coactive Learning)
Online Structured Prediction via Coactive Learning, by Pannaga Shivaswamy and Thorsten Joachims. International Conference on Machine Learning, 2012. [journal version]
26
(Active Learning for Decision Making)
Near Optimal Bayesian Active Learning for Decision Making, by Shervin Javdani, Yuxin Chen, Amin Karbasi, Andreas Krause, Drew Bagnell, Siddhartha Srinivasa. International Conference on Artificial International and Statistics, 2014.
27
(Bayesian Optimization)
Practical Bayesian Optimization of Machine Learning Algorithms, by Jasper Snoek, Hugo Larochelle, and Ryan Adams. Neural Information Processing Systems, 2012.
28
(Off-Policy Evaluation)
Exploration Scavenging, by John Langford, Alexander Strehl, and Jenn Wortman Vaughan. International Conference on Machine Learning, 2008.
29
(Imitation Learning)
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning, by Stephane Ross, Geoff Gordon, and Drew Bagnell. International Conference on Artificial Intelligence and Statistics, 2011.
30
(Reinforcement Learning, Monte Carlo Tree Search)
A Survey of Monte Carlo Tree Search Methods by Cameron Browne, Edward Powley, Daniel Whitehouse, Simon Lucas, Peter I. Cowling, Philipp Rohlfshagen, Stephen Tavener, Diego Perez, Spyridon Samothrakis and Simon Colton. IEEE Transactions on Computational Intelligence and AI in Games, 4(1), 2012.
31
(Crowdsourcing)
Optimistic Knowledge Gradient Policy for Optimal Budget Allocation in Crowdsourcing, by Xi Chen, Qihang Lin, and Denny Zhou. International Conference on Machine Learning, 2013. [appendix][journal version]
32
(Machine Teaching)
How Do Humans Teach: On Curriculum Learning and Teaching Dimension, by Faisal Khan, Xiaojin Zhu, and Bilge Mutlu. Neural Information Processing Systems, 2011.
33
(Modeling Human Decision Making)
Forgetful Bayes and myopic planning: Human learning and decision-making in a bandit setting, by Shunan Zhang and Angela Yu. Neural Information Processing Systems, 2013.
34
(Combinatorial Action Spaces, Adaptive Routing)
Non-Myopic Adaptive Route Planning in Uncertain Congestion Environments, by Siyuan Liu, Yisong Yue, and Ramayya Krishnan. ACM Transactions on Knowledge Discovery and Engineering, DOI 10.1109/TKDE.2015.2411278, 2015.
35
(Reinforcement Learning from via Apprenticeship Learning)
36
(Active Learning for Supervised Learning)Importance Weighted Active Learning
37
(Machine Teaching for Crowdsourcing)Near-Optimally Teaching the Crowd to Classify
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
Loading...
Main menu