Next: About this document ...
Up: Computing the Optimal Policy
Previous: Uniqueness of
Using the same example we calculate the
optimal return value to be:
If we examine different values of we get different
optimal actions in S1.
Note that as increases the optimal policy at S1
changes from a12 to a11.