🏠 Strona Główna
Benchmarki
📊 Wszystkie benchmarki 🦖 Dinozaur v1 🦖 Dinozaur v2 ✅ Aplikacje To-Do List 🎨 Kreatywne wolne strony 🎯 FSACB - Ostateczny pokaz 🌍 Benchmark tłumaczeń
Modele
🏆 Top 10 modeli 🆓 Darmowe modele 📋 Wszystkie modele ⚙️ Kilo Code
Zasoby
💬 Biblioteka promptów 📖 Słownik AI 🔗 Przydatne linki

Słownik AI

Kompletny słownik sztucznej inteligencji

162
kategorie
2 032
podkategorie
23 060
pojęcia
📖
pojęcia

Bootstrap in RL

Resampling technique used in reinforcement learning to estimate value function uncertainty by creating multiple estimations from the same data sample.

📖
pojęcia

Bootstrap Value Distribution

Probabilistic representation of the value function obtained by aggregating multiple bootstrap estimations, allowing quantification of uncertainty on value predictions.

📖
pojęcia

Weighted Bootstrap

Technique assigning weights to bootstrap samples based on their relevance or recency to give more importance to more informative experiences in value estimation.

📖
pojęcia

Q-learning with Bootstrap

Extension of classic Q-learning using multiple Q-value heads trained on different bootstrap samples to capture uncertainty and improve exploration.

📖
pojęcia

C51 (Categorical 51)

Distributional algorithm discretizing the return distribution into 51 probability atoms, using bootstrap techniques to estimate uncertainty on this representation.

📖
pojęcia

IQN (Implicit Quantile Networks)

Network architecture directly learning the quantile distribution of returns, integrating bootstrap mechanisms to quantify uncertainty of quantile predictions.

📖
pojęcia

QR-DQN (Quantile Regression DQN)

DQN variant using quantile regression on bootstrap samples to learn the complete distribution of action values with uncertainty quantification.

📖
pojęcia

Bootstrap Head Networks

Architecture comprising multiple independent output heads trained on different bootstrap samples to capture uncertainty in value predictions.

📖
pojęcia

Uncertainty-based Exploration

Exploration strategy using bootstrap estimates to quantify uncertainty and guide the agent toward the least known states of the environment.

📖
pojęcia

Bootstrap Ensembles

Method training multiple models on different bootstrap samples to form a predictive ensemble capturing the variability and uncertainty of the learning process.

📖
pojęcia

Dropout as Bootstrap Approximation

Technique using dropout during inference as an efficient approximation of bootstrap to quickly estimate uncertainty without training multiple models.

📖
pojęcia

Credible Intervals

Statistical intervals derived from bootstrap distributions quantifying uncertainty on value estimates with a specified confidence probability.

📖
pojęcia

Bootstrap Variance

Metric quantifying the dispersion of bootstrap estimates among themselves, serving as a direct indicator of epistemic uncertainty in value predictions.

📖
pojęcia

Bootstrap Bias

Systematic deviation potentially introduced by bootstrap methods, requiring correction techniques such as double bootstrap for unbiased estimates.

📖
pojęcia

Sequential Bootstrap

Variant adapted to temporal RL data preserving sequential dependency structure during resampling to avoid underestimation of uncertainty.

🔍

Nie znaleziono wyników