🏠 Home
Benchmark Hub
📊 All Benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List Applications 🎨 Creative Free Pages 🎯 FSACB - Ultimate Showcase 🌍 Translation Benchmark
Models
🏆 Top 10 Models 🆓 Free Models 📋 All Models ⚙️ Kilo Code
Resources
💬 Prompts Library 📖 AI Glossary 🔗 Useful Links

AI Glossary

The complete dictionary of Artificial Intelligence

162
categories
2,032
subcategories
23,060
terms
📖
terms

Bootstrap in RL

Resampling technique used in reinforcement learning to estimate value function uncertainty by creating multiple estimations from the same data sample.

📖
terms

Bootstrap Value Distribution

Probabilistic representation of the value function obtained by aggregating multiple bootstrap estimations, allowing quantification of uncertainty on value predictions.

📖
terms

Weighted Bootstrap

Technique assigning weights to bootstrap samples based on their relevance or recency to give more importance to more informative experiences in value estimation.

📖
terms

Q-learning with Bootstrap

Extension of classic Q-learning using multiple Q-value heads trained on different bootstrap samples to capture uncertainty and improve exploration.

📖
terms

C51 (Categorical 51)

Distributional algorithm discretizing the return distribution into 51 probability atoms, using bootstrap techniques to estimate uncertainty on this representation.

📖
terms

IQN (Implicit Quantile Networks)

Network architecture directly learning the quantile distribution of returns, integrating bootstrap mechanisms to quantify uncertainty of quantile predictions.

📖
terms

QR-DQN (Quantile Regression DQN)

DQN variant using quantile regression on bootstrap samples to learn the complete distribution of action values with uncertainty quantification.

📖
terms

Bootstrap Head Networks

Architecture comprising multiple independent output heads trained on different bootstrap samples to capture uncertainty in value predictions.

📖
terms

Uncertainty-based Exploration

Exploration strategy using bootstrap estimates to quantify uncertainty and guide the agent toward the least known states of the environment.

📖
terms

Bootstrap Ensembles

Method training multiple models on different bootstrap samples to form a predictive ensemble capturing the variability and uncertainty of the learning process.

📖
terms

Dropout as Bootstrap Approximation

Technique using dropout during inference as an efficient approximation of bootstrap to quickly estimate uncertainty without training multiple models.

📖
terms

Credible Intervals

Statistical intervals derived from bootstrap distributions quantifying uncertainty on value estimates with a specified confidence probability.

📖
terms

Bootstrap Variance

Metric quantifying the dispersion of bootstrap estimates among themselves, serving as a direct indicator of epistemic uncertainty in value predictions.

📖
terms

Bootstrap Bias

Systematic deviation potentially introduced by bootstrap methods, requiring correction techniques such as double bootstrap for unbiased estimates.

📖
terms

Sequential Bootstrap

Variant adapted to temporal RL data preserving sequential dependency structure during resampling to avoid underestimation of uncertainty.

🔍

No results found