Deep Deterministic Policy Gradient (DDPG)
Actor Network
Neural network learning to directly map states to optimal actions in a continuous action space.
← GeriNeural network learning to directly map states to optimal actions in a continuous action space.
← Geri