Deep Deterministic Policy Gradient (DDPG)
Soft Update
Method of gradually updating target networks using a tau coefficient (τ) to slowly mix the weights of main and target networks.
← TerugMethod of gradually updating target networks using a tau coefficient (τ) to slowly mix the weights of main and target networks.
← Terug