🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Value Distribution

Complete representation of uncertainty about future returns in reinforcement learning, modeling the entire probability distribution of each possible return rather than just its expectation.

📖
용어

Distributional Reinforcement Learning

RL paradigm that explicitly models the full distribution of expected returns to capture uncertainty and variability of future outcomes.

📖
용어

Distributional Q-Function

Extension of the Q-value function that returns a probability distribution over expected returns instead of a single scalar value.

📖
용어

Atomization Parametrization

Technique for discretizing continuous distributions into finite sets of points (atoms) with associated probabilities to facilitate computational learning.

📖
용어

Categorical Distributional RL (C51)

Pioneering algorithm that models the return distribution as a discrete categorical distribution over a fixed support of values.

📖
용어

Distributional Bellman Operator

Generalization of the classical Bellman operator that applies to full distributions rather than just expected values.

📖
용어

Wasserstein Distance

Metric used to measure similarity between value distributions in the return space, allowing capture of both the location and shape of distributions.

📖
용어

Distributional Projection

Process of projecting continuous distributions onto a predefined discrete support, essential for practical implementation of distributional algorithms.

📖
용어

Distributional Risk

Measure of the uncertainty and variability in return predictions, quantified through the higher statistical moments of the value distribution.

📖
용어

Higher-Order Moments

Statistics (variance, skewness, kurtosis) describing the shape of the return distribution beyond the mean, capturing asymmetry and probability concentration.

📖
용어

Distributional Temporal Variation

Temporal evolution of the full shape of the return distribution rather than just its expected value, revealing changing risk patterns.

📖
용어

Discrete Value Support

Finite and ordered set of values on which continuous distributions are approximated in practical distributional algorithms.

📖
용어

Distributional Propagation

Process of updating value distributions via the Bellman operator, preserving uncertainty information at each time step.

📖
용어

Distributional Stability

Property of convergence of value distributions to a stable form during learning, ensuring the consistency of uncertainty estimates.

🔍

결과를 찾을 수 없습니다