🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Continuous Quantile Distribution

Representation of the return distribution as a set of evolving quantiles in continuous action spaces, allowing fine modeling of uncertainty and risks.

📖
용어

Cramer-Wold Distributional Projection

Mathematical technique enabling comparison of distributions by projecting onto one-dimensional directions, essential for distributional metrics in continuous RL.

📖
용어

Atomic Distribution Network

Neural architecture representing a distribution as a weighted set of fixed atoms, suitable for continuous action problems with stochastic returns.

📖
용어

Distributional Risk in Continuum

Measure quantifying uncertainty in return distributions of continuous action spaces, crucial for robust policy evaluation.

📖
용어

Distributional Stochastic Policy

Action strategy directly incorporating return distribution in continuous action selection, optimizing over the entire distribution rather than just the expectation.

📖
용어

Quantile Distribution Expectation

Operator calculating expectation from quantile representation, preserving distributional properties in continuous spaces.

📖
용어

Distributional Rejection Sampling

Sampling method preserving distributional properties when generating continuous actions from complex return distributions.

📖
용어

Stochastic Distributional Optimization

Optimization paradigm working directly on return distributions rather than their point estimates in continuous spaces.

📖
용어

Distributional Kernel Approximation

Technique using kernel functions to approximate return distributions in high-dimensional continuous action spaces.

📖
용어

Wasserstein Distance in Continuous RL

Metric measuring the dissimilarity between return distributions, particularly adapted to continuous action problems with complex geometry.

📖
용어

Distributional Importance Sampling

Weighted sampling technique preserving distributional characteristics when estimating policy gradients in continuous settings.

📖
용어

Distributional Monte-Carlo Update

Algorithm updating return distributions using Monte-Carlo samples in continuous action spaces, preserving the distributional shape.

📖
용어

Distributional Variance Reduction

Set of techniques aiming to reduce variance in distributional estimates without losing information about distribution shapes.

📖
용어

Distributional Greedy Policy

Strategy selecting optimal actions based on criteria on the full distribution (e.g., quantile, CVaR) rather than just expectation in continuous settings.

📖
용어

Distributional Bellman Equation

Formulation of the Bellman equation operating on complete distributions rather than scalar values, fundamental to continuous distributional RL.

📖
용어

Continuous Distributional Critic

Neural network estimating the complete return distribution for continuous state-action pairs, replacing the traditional scalar value critic.

📖
용어

Distributional Bias in Continuous Action

Phenomenon where distributional approximations introduce systematic biases in return estimation in continuous action spaces.

📖
용어

Continuous Distributional Normalization

Normalization technique preserving distributional properties when processing continuous actions at different scales.

📖
용어

Adaptive Distributional Exploration

Exploration strategy using complete return distribution information to adapt exploratory behavior in continuous action.

🔍

결과를 찾을 수 없습니다