Actor-Critic Methods
Deep Deterministic Policy Gradient
Actor-Critic algorithm for continuous action spaces using deep neural networks with deterministic policy and replay buffer for stable off-policy learning.
← 뒤로