DAgger Data Aggregation
Adaptive aggregation
Variant of DAgger that dynamically adjusts the proportion of expert actions versus current policy actions. This adaptation helps balance exploration and exploitation during learning.
← Kembali