Mathematical Model - MDP
Markov decision processes
S- set of states
A- set of actions
d - Transition probability
R - Reward function
Similar to DFA!
Previous slide
Next slide
Back to first slide
View graphic version