Actor-Critic Methods
Value Function
Mathematical function estimating the expected cumulative return from a state or state-action pair, serving as the learning signal for the critic in the Actor-Critic architecture.
← Geri