MDP model - Episodic tasks
Terminal state
For any policy with probability 1 we reach a terminal state.
Previous slide
Next slide
Back to first slide
View graphic version