UCB Algorithms
Asymptotic Optimality
Theoretical property guaranteeing that a UCB algorithm asymptotically achieves the lowest possible regret bound, characterizing its long-term efficiency.
← Indietro