Dyna-Q Learning
Model generalization
Ability to extrapolate the model's predictions to unseen state-actions, often implemented using neural networks or other function approximators.
← ZurückAbility to extrapolate the model's predictions to unseen state-actions, often implemented using neural networks or other function approximators.
← Zurück