Model-Based Offline RL
Trajectory Transformers
Transformer architecture that models trajectories as sequences of states, actions, and rewards to predict future transitions in offline learning.
← Terug