🏠 Beranda
Benchmark
📊 Semua Benchmark 🦖 Dinosaurus v1 🦖 Dinosaurus v2 ✅ Aplikasi To-Do List 🎨 Halaman Bebas Kreatif 🎯 FSACB - Showcase Utama 🌍 Benchmark Terjemahan
Model
🏆 Top 10 Model 🆓 Model Gratis 📋 Semua Model ⚙️ Kilo Code
Sumber Daya
💬 Perpustakaan Prompt 📖 Glosarium AI 🔗 Tautan Berguna

Glosarium AI

Kamus lengkap Kecerdasan Buatan

162
kategori
2.032
subkategori
23.060
istilah
📖
istilah

State-action distribution

Probabilistic representation of the Q(s,a) value function that models the complete distribution of possible returns rather than just their mathematical expectation.

📖
istilah

Distributional transition model

Model-based reinforcement learning model that captures uncertainty in state transitions by modeling probability distributions over next states.

📖
istilah

Probabilistic dynamics model

Predictive model in model-based RL that generates probability distributions over next states or rewards rather than deterministic predictions.

📖
istilah

Epistemic uncertainty in RL

Uncertainty due to lack of knowledge about the environment model, modeled by distributions in distributional model-based RL approaches.

📖
istilah

Aleatoric uncertainty in RL

Inherent uncertainty in the environment that cannot be reduced even with more data, captured by distributions in distributional RL models.

📖
istilah

Distributional policy gradient

Extension of policy gradient methods that directly optimizes over the distribution of returns rather than their expectation, enabling risk-sensitive policies.

📖
istilah

Risk-sensitive RL

Reinforcement learning approach that uses distributional information to optimize risk metrics like CVaR or standard deviation instead of just expectation.

📖
istilah

Model ensembles in distributional RL

Technique using multiple independently learned models to capture epistemic uncertainty in distributional model-based RL approaches.

📖
istilah

Particle-based distribution models

Distributional modeling approach that represents distributions by a set of weighted particles, useful for complex transitions in model-based RL.

📖
istilah

Wasserstein distance in distributional RL

Metric used to measure dissimilarity between distributions in the distributional Bellman operator, offering better convergence properties than KL distance.

📖
istilah

Moment matching in distributional RL

Optimization technique that adjusts parameters to match statistical moments (mean, variance, etc.) of predicted and target distributions.

📖
istilah

Variational inference in RL

Method for approximating complex distributions by optimizing a family of simpler distributions, applied in model-based RL to handle uncertainty.

📖
istilah

Bayesian model-based RL

Approach that maintains a distribution over possible environment models, using Bayesian methods to quantify and exploit epistemic uncertainty.

📖
istilah

Distributional Bellman operator

Extension of the classic Bellman operator that operates on return distributions rather than scalar values, preserving distributional structure.

📖
istilah

Horizon-dependent distributions

Concept in distributional RL where the return distribution changes with the time horizon, capturing the evolution of uncertainty over different time scales.

📖
istilah

Categorical atomic projection

Mathematical operation used in C51 that projects the target distribution onto predefined atom support to maintain distributional consistency.

📖
istilah

Distributional uncertainty propagation

Process in model-based RL where the uncertainty of model predictions is propagated through planning steps to evaluate policy robustness.

🔍

Tidak ada hasil ditemukan