Policy Gradient Methods
Generalized Advantage Estimation (GAE)
An advantage estimation method that combines bias and variance through a weighted average of multi-step estimators, offering an optimal trade-off for learning.
← Terug