Learning Algorithms
Given access only to actions perform:
1. policy evaluation.
2. control - find optimal policy.
Two approaches:
1. Model based (Dynamic Programming).
2. Model free (Temporal Differences and Monte Carlo).
Previous slide
Next slide
Back to first slide
View graphic version