πŸ“• Node [[learning_rate]]
πŸ“„ Learning_Rate.md by @KGBicheno

learning rate

Go back to the [[AI Glossary]]

A scalar used to train a model via gradient descent. During each iteration, the gradient descent algorithm multiplies the learning rate by the gradient. The resulting product is called the gradient step.

Learning rate is a key hyperparameter.

Loading pushes...

Rendering context...