Deep Deterministic Policy Gradient (DDPG)
Deterministic Policy
Policy that associates a specific action with each state, unlike stochastic policies that return probability distributions.
← Quay lại