Learning - Open Problems
A systematic understanding of the online updates.
TD(l) - understand better the tradeoff in l.
Sample complexity bounds for TD and MC.
Previous slide
Next slide
Back to first slide
View graphic version