Glossary#
- Trajectory#
- : A sequence of states and actions.#
- Environment#
- : The world that the agent interact with.#
- Policy#
- : A policy is a mapping from states to probabilities of each possible action.#
- Value Function#
- : The value function of a state \(s\) under a policy \(\pi\) is the expected return when starting in \(s\) and following \(\pi\) thereafter.#