DAgger Data Aggregation
Error correction
Process by which an expert provides the correct actions when the current agent policy makes mistakes. These corrections serve as new training data to improve the policy.
← 뒤로