Bootstrap Methods in RL
Sequential Bootstrap
Variant adapted to temporal RL data preserving sequential dependency structure during resampling to avoid underestimation of uncertainty.
← ZurückVariant adapted to temporal RL data preserving sequential dependency structure during resampling to avoid underestimation of uncertainty.
← Zurück