Algorithms - Policy Evaluation Example

g = 1/2

d(si,a)= si+a

p random

Previous slide Next slide Back to first slide View graphic version