Contrast with Supervised Learning
The system has a “state”.
The algorithm has a great influence on the state.
Tradeoff lower immediate reward for higher future rewards.
GOAL oriented: present versus future rewards
Previous slide
Next slide
Back to first slide
View graphic version