Advanced Topics:

Algorithmic game Theory and Machine Learning

Tentative Schedule:

Lecture 1: Oct 31, 2011

Introduction. Regret Minimization, Online Convex Optimization. Regularized Follow The Leader Prime- Dual using Berman Divergence

source: Elad Hazan, The convex optimization approach to regret minimization (sections 1.2 and 1.3)

Lecture 2: Nov 5, 2011

Stochastic sub-gradient decent

Logarithmic regret for strict convex function

sources:

Boyd, Xiao and Mutapcic, Subgradient Methods also here

Shai Shalev Shwatz, Class notes, lecture 6

Lecture 3.

Regret in Large Action spaces.

sources:

A. Kalai and S. Vempala, Efficient algorithms for online decision problems

S. Kakade, A. Kalai and K. Ligett, Playing Games with Approximation Algorithms

Lecture Notes

Lecture 4.

Multi-Arm Bandit,

Adversarial Model,

P. Auer, N. Cesa-Bianchi, Y. Freund and R. E. Schapire, The Nonstochastic Multi-Armed Bandit Problem (Section 3 and 7)

Lecture 5.

Lower Bound

Multi-Arm Bandit & Supervised learning (using KL-divergence)

P. Auer, N. Cesa-Bianchi, Y. Freund and R. E. Schapire, The Nonstochastic Multi-Armed Bandit Problem (section 5)

Lecture 6. Dec. 5, 2011

MAB Gradient Descent

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McMahan Online convex optimization in the bandit setting: gradient descent without a gradient

Oblivious versus Adaptive adversary

Varsha Dan and Thomas P. Hayes, How to Beat the Adaptive Multi-Armed Bandit(section 8)

Lecture 7. Dec. 12, 2011

Bayesian approach to MAB: Gittins index

J. Gittins, K. Glazerbrook and R. Weber, Multi-Arm Bandit Allocation Indices

John N. Tsitsiklis, A short proof of the Gittins index theorem

Lecture 8. Dec. 19, 2011

Information Cascading [sources]

Lecture 9. Dec. 26, 2011

Sponsored search optimization.

Nikhil Devanur Online Algorithms with Stochastic Input

Nikhil Devanur and Tom Hayes The Adwords Problem: Online Keyword Matching with Budgeted Bidders under Random Permutations

Lecture 10. Jan. 2, 2012

Market Equilibrium and Convex Programming

Combinatorial Algorithms for Market Equilibria, Vijai Vazirani, Chapter 5 sections 5.2 and 5.11 in Algorithmic Game Theory, editors: Nisan, Roughgarden, Tardos, and Vazirani.

A Polynomial Time Algorithm for Computing an Arrow–Debreu Market Equilibrium for Linear Utilities, Kamal Jain (Sections 3 and 5)

Lecture 11, Jan 9, 2012

Indivisible items and market equilibrium

Avinatan Hassidim, Haim Kaplan, Yishay Mansour and Noam Nisan, Non-Price Equilibria in Markets of Discrete Goods

Lecture 12, Jan 16, 2012

Uriel Feige, On Maximizing Welfare when Utility Functions are Subadditive

Lecture 13, Jan 23, 2012

Regret Minimization of Global Cost functions

Lecture 14, Jan 30, 2012

Nash Equilibrium and communication complexity

Home Work: