Deep Deterministic Policy Gradient (DDPG)
Actor Network
Neural network learning to directly map states to optimal actions in a continuous action space.
← TerugNeural network learning to directly map states to optimal actions in a continuous action space.
← Terug