Contrast with Supervised Learning
Supervised Learning:
Fixed distribution on examples.
Reinforcement Learning:
The distribution is policy dependent!!!
A small local change in the policy can make a huge global change in the return.
Previous slide
Next slide
Back to first slide
View graphic version