Policy Gradient Methods
Policy Gradient Theorem
Fundamental theorem providing an analytical expression of the gradient of the expected return with respect to the policy parameters, formulating the theoretical basis of the methods.
← Wstecz