MDP model - return functions
Finite Horizon - parameter H
Infinite Horizon
discounted - parameter gə.
undiscounted
Episodic:
Total reward
Previous slide
Next slide
Back to first slide
View graphic version