DAgger Data Aggregation
Trajectory distribution
Set of state and action sequences that the agent generates by following its current policy. DAgger aims to align this distribution with that produced by the optimal expert policy.
← Back