Observation-based Imitation
Dataset Aggregation
Iterative algorithm that collects new expert data based on the errors of the current agent, progressively aggregating a more robust dataset.
← Indietro