Offline Multi-Task Reinforcement Learning
Task-Specific Policy Heads
Network architecture with shared common trunk and distinct output heads for each task in offline multi-task learning.
← TerugNetwork architecture with shared common trunk and distinct output heads for each task in offline multi-task learning.
← Terug