📕 Node [[bellman_equation]]
↳ 📓 Resource @KGBicheno/bellman_equation

📄 Bellman_Equation.md by @KGBicheno

Go back to the [[AI Glossary]]

In reinforcement learning, the following identity satisfied by the optimal Q-function:

The Q-function in reinforcement learning

Reinforcement learning algorithms apply this identity to create Q-learning via the following update rule:

The Bellman equation

Beyond reinforcement learning, the Bellman equation has applications to dynamic programming. See the Wikipedia entry for Bellman Equation.

Loading pushes...

Rendering context...