Deep Deterministic Policy Gradient (DDPG)
Actor Network
Neural network learning to directly map states to optimal actions in a continuous action space.
← KembaliNeural network learning to directly map states to optimal actions in a continuous action space.
← Kembali