Published using Google Docs
EECS 598-006 Course Schedule
Updated automatically every 5 minutes

Date

Schedule of Topics

Links + Notes

9/4/2013

Introduction

Learning theory discussion

Prediction with expert advice

The halving algorithm

Example: halving on permutations for “predictive sorting”

The Multiplicative Weights Update Method by Arora, Hazan and Kale

Notes from Peter Bartlett’s Course

9/9/2013

Review of halving algorithm on permutations

The weighted majority algorithm

Proving a mistake bound on Weighted Majority

Scribe notes posted from Lecture 1

An open problem I posed regarding “learning rankings” from COLT 2010

Apparently a solution to this open problem by Elad Hazan, Satyen Kale, and Shai Shalev-Shwartz

Homework to be released soon! It will be due 9/25/2013.

9/11/2013

WMA generalized: Exponential Weights Algorithm

The “Hedge Setting”

Homework 1 is available, it’s due 9/25/2013.

Notes from Yoav Freund’s course

9/16/2013

More the the action setting vs prediction setting

A new “additive” bound

Variants of weighted majority (different priors, fixed share forecaster, changing learning rates)

Hannan-consistency and the no-regret property

Orig “Hedge Setting” introduced by Freund and Schapire’s original Adaboost paper

I showed an “additive bound” and a “multiplicative bound” in class on 9/16. These can be found in the PLG book, Theorems 2.2 and 2.4.

I needed an inequality in class (regarding log E[exp(sX)]) which I didn’t prove, but it’s pretty classical and you can find the result in Lemma A.1 in the PLG book.

9/18/2013

Fixed-share forecaster

This result is actually reasonably well-written up in the PLG book, section 5.2. Theorem 5.2 proves the bound on the hyperexperts. Theorem 5.1 makes the connection between the complex algorithm making hyperexpert weight updates, and the efficient version.

9/23/2013

Lower bounds for 2 and N experts

Game Theory I

Two-player games

The lower bound I proved in class, with the important details, can roughly be found in PLG section 3.7. The key lemmas are in Appendix A.1.6.

9/25/2013

Game Theory II

Nash’s Theorem via Brouwer Fixed-point Theorem

Guest Lecturer: Satyen Kale

Zero-sum games

von Neumann Minimax Theorem

Here are the notes proving Nash’s Theorem, which includes the proof of Sperner’s Lemma that can be used to establish the Brouwer Fixed-point Theorem.

9/30/2013

Game Theory III

Proof of Minimax Thm using Hedge Alg

Computing approximate equilibria

More on this trick can be found in the Arora/Hazan/Kale survey, where I first saw this technique. There’s a more general version proven in PLG section 7.2.

10/2/2013

Introduction to Boosting

Boost by Majority

Boosting as sequential duality optimization

There a new Boosting book out by Rob Schapire and Yoav Freund which is excellent. The relationship between online learning, the minimax theorem, and boosting, is in Chapter 6.

10/7/2013

More on Boosting

The Perceptron Algorithm

There are a number of good writeups of the perceptron regret bound. Here is one.

10/9/2013

Recap for perceptron algorithm

Log loss and gambling

Constant Rebalanced Portfolios

Universal portfolio strategies

Cover’s UCRP algorithm and analysis

Tom Cover’s original paper on Universal Portfolios

The source of the easier analysis of the UCRP algorithm, with discussions of transaction costs

10/14/2013

FALL BREAK - NO CLASS!!!!!

10/16/2013

Guest Lecturer: Ambuj Tewari

Introduction to Online Convex Optimization

Be The Leader etc.

10/21/2013

Review of Universal Portfolios

Review of Online Convex Optimization model

Examples applications of OCO

Online Gradient Descent algorithm

Original OCO + gradient descent paper by Zinkevich

Several good survey papers on OCO now, see links on course website

10/23/2013

Regret bound for Online Gradient Descent

Guest Lecturer: Rafael Frongillo

Game-theoretic Probability in Finance

Lecture based on ideas from this book

10/28/2013

More on algorithms for OCO

Follow the Leader

Application: Online Density Estimation

Earlly work by Azoury and Warmuth on density estimation

General results for logarithmic regret algorithms by Hazan, Agarwal, and Kale

10/30/2013

Follow the Regularized Leader (FTRL)

Generic regret bound for FTRL

Guest visit from CRLT by Tershia

11/4/2013

Online Convex Optimization III

OCO Application 1: Convex Optimization

OCO Application 2: risk bound in IID setting

The “online to batch conversion” idea started with this nips paper from 2004

11/6/2013

Student HW2 presentations

The multi-armed bandit setting

Greedy algorithm

Nearly all results from the next couple of lectures can be found in Bubeck and Cesa-Bianchi’s survey of regret analysis for bandit problems

11/11/2013

The UCB Algorithm

The adversarial bandit setting

Definition of EXP3 Algorithm

Peter Auer was the first to propose the UCB algorithm for a finite-time analysis of the multi-armed bandit setting

11/13/2013

EXP3 Regret Bound and Proof

Online linear optimization in the bandit setting

Bandit slides from class presentation

11/18/2013

Bandit OLO and the use of self-concordant regularization

Introduction to Blackwell Approachability

My 2011 COLT paper does a reasonably good job describing the Approachability setting

11/20/2013

Blackwell Approachability Algorithm and Convergence

Reductions between Approachability and No-Regret

The COLT paper describes the reduction using cone duality, but there has been some recent work that showed how to avoid the use of cones and instead uses an OCO reduction. But the paper doesn’t appear to be available yet, I’ll contact the authors for more.

11/25/2013

OFFICE HOUR - NO CLASS

11/27/2013

PRE-THANKSGIVING - NO CLASS

12/2/2013

Approachability, Calibrated Forecasting, and No Internal Regret

12/4/2013

Reductions between learning problems

Correlated Equilibria

12/9/2013

STUDENT PRESENTATIONS

12/11/2013

POSTER SESSION