- at time \(t\), agent received observation \(o_{t}\)
- agent then chooses \(a_{t}\) based on what it knows through some kind of process in order to affect a possibly nondeterministic change on the environment
- the agent choose an \(a_{t}\) under the existance of many types of Uncertainty.