Proximal Policy Optimization (PPO)
Surrogate Objective
Modified objective function used in PPO that approximates the original objective while incorporating stability constraints like clipping to prevent performance degradation.
← Zurück