Inverse Reinforcement Learning
Expert Trajectory
Sequence of states and actions observed in an expert, representing an optimal or near-optimal solution to the problem.
← Quay lạiSequence of states and actions observed in an expert, representing an optimal or near-optimal solution to the problem.
← Quay lại