🏠 Beranda
Benchmark
📊 Semua Benchmark 🦖 Dinosaurus v1 🦖 Dinosaurus v2 ✅ Aplikasi To-Do List 🎨 Halaman Bebas Kreatif 🎯 FSACB - Showcase Utama 🌍 Benchmark Terjemahan
Model
🏆 Top 10 Model 🆓 Model Gratis 📋 Semua Model ⚙️ Kilo Code
Sumber Daya
💬 Perpustakaan Prompt 📖 Glosarium AI 🔗 Tautan Berguna

Glosarium AI

Kamus lengkap Kecerdasan Buatan

162
kategori
2.032
subkategori
23.060
istilah
📖
istilah

Value Distribution

Complete representation of uncertainty about future returns in reinforcement learning, modeling the entire probability distribution of each possible return rather than just its expectation.

📖
istilah

Distributional Reinforcement Learning

RL paradigm that explicitly models the full distribution of expected returns to capture uncertainty and variability of future outcomes.

📖
istilah

Distributional Q-Function

Extension of the Q-value function that returns a probability distribution over expected returns instead of a single scalar value.

📖
istilah

Atomization Parametrization

Technique for discretizing continuous distributions into finite sets of points (atoms) with associated probabilities to facilitate computational learning.

📖
istilah

Categorical Distributional RL (C51)

Pioneering algorithm that models the return distribution as a discrete categorical distribution over a fixed support of values.

📖
istilah

Distributional Bellman Operator

Generalization of the classical Bellman operator that applies to full distributions rather than just expected values.

📖
istilah

Wasserstein Distance

Metric used to measure similarity between value distributions in the return space, allowing capture of both the location and shape of distributions.

📖
istilah

Distributional Projection

Process of projecting continuous distributions onto a predefined discrete support, essential for practical implementation of distributional algorithms.

📖
istilah

Distributional Risk

Measure of the uncertainty and variability in return predictions, quantified through the higher statistical moments of the value distribution.

📖
istilah

Higher-Order Moments

Statistics (variance, skewness, kurtosis) describing the shape of the return distribution beyond the mean, capturing asymmetry and probability concentration.

📖
istilah

Distributional Temporal Variation

Temporal evolution of the full shape of the return distribution rather than just its expected value, revealing changing risk patterns.

📖
istilah

Discrete Value Support

Finite and ordered set of values on which continuous distributions are approximated in practical distributional algorithms.

📖
istilah

Distributional Propagation

Process of updating value distributions via the Bellman operator, preserving uncertainty information at each time step.

📖
istilah

Distributional Stability

Property of convergence of value distributions to a stable form during learning, ensuring the consistency of uncertainty estimates.

🔍

Tidak ada hasil ditemukan