Bootstrap Methods in RL
C51 (Categorical 51)
Distributional algorithm discretizing the return distribution into 51 probability atoms, using bootstrap techniques to estimate uncertainty on this representation.
← Zurück