Epsilon-Greedy Algorithms
Algorithm convergence
Property guaranteeing that the epsilon-greedy algorithm converges to the optimal policy under certain conditions. Convergence depends on appropriate epsilon decay and a sufficient number of iterations.
← 뒤로