Offline Multi-Task Reinforcement Learning
Multi-Task Offline Exploration-Exploitation
Dilemma adapted to the offline context where the balance between using existing data and controlled extrapolation is managed for multiple tasks.
← Zurück