Ensemble Learning
Posterior Predictive Distribution
Complete distribution over future states or rewards incorporating both uncertainty about model parameters and process noise, approximated by ensemble predictions in practice. Fundamental for robust planning in RL.
← Zurück