Up: Large State Space
Previous: Evaluation Of Approximate Policy
We will make the following assumption.
Approximate Value Iteration
This implies that using L operator on the inequality
We have also the next inequality
Using both inequalities
Thus for each k we have
If we look at
Although calculations are much simpler than in PI. The method is less natural.