Offline Multi-Task Reinforcement Learning
Shared Dataset Policy Optimization
Multiple policy optimization technique using a common pool of experience data to improve learning efficiency across tasks.
← Back