Decision Transformer
Policy Extraction
Process of deriving a decision policy from a trained sequence model, where the transformer generates actions conditioned on states and desired returns.
← Zurück