POMDP - Belief State Algorithm

we compute a posterior distribution for the state

we are in (belief state).

actions: as in the POMDP.

Transition: the posterior distribution (given the observation)