Large scale MDP - Restricted Value Function

Use a limited class of functions to estimate the value function.

estimate the optimal policy.