DAgger Data Aggregation
Feedback loop
Continuous cycle where the performance of the current policy generates new states, which in turn require expert corrections. This iterative loop is the fundamental improvement mechanism in DAgger.
← Indietro