Monte Carlo Methods in RL
GLIE Algorithm
Exploration strategy that is Greedy In the Limit with Infinite Exploration, guaranteeing asymptotic convergence to the optimal policy. Exploration gradually decreases while exploitation increases over time.
← Tillbaka