Active Reinforcement Learning
Off-Policy Reinforcement Learning
Method enabling the learning of an optimal policy while following a different behavior policy, useful for flexible exploration.
← Zurück