Deep Deterministic Policy Gradient (DDPG)
Neural Network Function Approximation
Use of neural networks to approximate complex functions like policies or value functions in reinforcement learning.
← Wstecz