Bootstrap Methods in RL
Q-learning with Bootstrap
Extension of classic Q-learning using multiple Q-value heads trained on different bootstrap samples to capture uncertainty and improve exploration.
← Indietro