Multi-Armed Bandits
Convergence Rate
Speed at which the algorithm approaches the optimal policy, measuring the asymptotic efficiency of the learning strategy.
← IndietroSpeed at which the algorithm approaches the optimal policy, measuring the asymptotic efficiency of the learning strategy.
← Indietro