Inverse Reinforcement Learning
Expert Trajectory
Sequence of states and actions observed in an expert, representing an optimal or near-optimal solution to the problem.
← ZurückSequence of states and actions observed in an expert, representing an optimal or near-optimal solution to the problem.
← Zurück