Policy and Decision Rules

Next: Policy Up: Introduction Previous: Immediate reward and the

Policy and Decision Rules

**Figure:** model diagram
$\begin{figure}\psfig{file=model.ps,width=6in,clip=} \end{figure}$

A decision rule may need memory of the whole history to determine the most profitable course of action, or it may need only the current state. It may also be a deterministic rule (resulting in one singe operation) or a stochastic one (resulting in a distribution on a set of operations). We will define the following set of rules:

MD - a deterministic Markovian rule (having no memory):

$d_{t}:S\rightarrow A_{s}$ , $d_{t}(s)\in{A_{s}}$