Next: Expectation of Reward
Usually the topic of discussion
will be a series of t steps, and the object of inquiry will be the
probability of making a transition from state s to state j
after t steps.
Probabilities Transition Matrix
The transition matrix after t steps, starting
from state s, using policy