Go back to the [[AI Glossary]]
#rl
In reinforcement learning, the following identity satisfied by the optimal Q-function:
Reinforcement learning algorithms apply this identity to create Q-learning via the following update rule:
Beyond reinforcement learning, the Bellman equation has applications to dynamic programming. See the Wikipedia entry for Bellman Equation.
Rendering context...