🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Bootstrap in RL

Resampling technique used in reinforcement learning to estimate value function uncertainty by creating multiple estimations from the same data sample.

📖
용어

Bootstrap Value Distribution

Probabilistic representation of the value function obtained by aggregating multiple bootstrap estimations, allowing quantification of uncertainty on value predictions.

📖
용어

Weighted Bootstrap

Technique assigning weights to bootstrap samples based on their relevance or recency to give more importance to more informative experiences in value estimation.

📖
용어

Q-learning with Bootstrap

Extension of classic Q-learning using multiple Q-value heads trained on different bootstrap samples to capture uncertainty and improve exploration.

📖
용어

C51 (Categorical 51)

Distributional algorithm discretizing the return distribution into 51 probability atoms, using bootstrap techniques to estimate uncertainty on this representation.

📖
용어

IQN (Implicit Quantile Networks)

Network architecture directly learning the quantile distribution of returns, integrating bootstrap mechanisms to quantify uncertainty of quantile predictions.

📖
용어

QR-DQN (Quantile Regression DQN)

DQN variant using quantile regression on bootstrap samples to learn the complete distribution of action values with uncertainty quantification.

📖
용어

Bootstrap Head Networks

Architecture comprising multiple independent output heads trained on different bootstrap samples to capture uncertainty in value predictions.

📖
용어

Uncertainty-based Exploration

Exploration strategy using bootstrap estimates to quantify uncertainty and guide the agent toward the least known states of the environment.

📖
용어

Bootstrap Ensembles

Method training multiple models on different bootstrap samples to form a predictive ensemble capturing the variability and uncertainty of the learning process.

📖
용어

Dropout as Bootstrap Approximation

Technique using dropout during inference as an efficient approximation of bootstrap to quickly estimate uncertainty without training multiple models.

📖
용어

Credible Intervals

Statistical intervals derived from bootstrap distributions quantifying uncertainty on value estimates with a specified confidence probability.

📖
용어

Bootstrap Variance

Metric quantifying the dispersion of bootstrap estimates among themselves, serving as a direct indicator of epistemic uncertainty in value predictions.

📖
용어

Bootstrap Bias

Systematic deviation potentially introduced by bootstrap methods, requiring correction techniques such as double bootstrap for unbiased estimates.

📖
용어

Sequential Bootstrap

Variant adapted to temporal RL data preserving sequential dependency structure during resampling to avoid underestimation of uncertainty.

🔍

결과를 찾을 수 없습니다