🏠 Home
Prestatietests
📊 Alle benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List applicaties 🎨 Creatieve vrije pagina's 🎯 FSACB - Ultieme showcase 🌍 Vertaalbenchmark
Modellen
🏆 Top 10 modellen 🆓 Gratis modellen 📋 Alle modellen ⚙️ Kilo Code
Bronnen
💬 Promptbibliotheek 📖 AI-woordenlijst 🔗 Nuttige links

AI-woordenlijst

Het complete woordenboek van kunstmatige intelligentie

162
categorieën
2.032
subcategorieën
23.060
termen
📖
termen

Value Distribution

Complete representation of uncertainty about future returns in reinforcement learning, modeling the entire probability distribution of each possible return rather than just its expectation.

📖
termen

Distributional Reinforcement Learning

RL paradigm that explicitly models the full distribution of expected returns to capture uncertainty and variability of future outcomes.

📖
termen

Distributional Q-Function

Extension of the Q-value function that returns a probability distribution over expected returns instead of a single scalar value.

📖
termen

Atomization Parametrization

Technique for discretizing continuous distributions into finite sets of points (atoms) with associated probabilities to facilitate computational learning.

📖
termen

Categorical Distributional RL (C51)

Pioneering algorithm that models the return distribution as a discrete categorical distribution over a fixed support of values.

📖
termen

Distributional Bellman Operator

Generalization of the classical Bellman operator that applies to full distributions rather than just expected values.

📖
termen

Wasserstein Distance

Metric used to measure similarity between value distributions in the return space, allowing capture of both the location and shape of distributions.

📖
termen

Distributional Projection

Process of projecting continuous distributions onto a predefined discrete support, essential for practical implementation of distributional algorithms.

📖
termen

Distributional Risk

Measure of the uncertainty and variability in return predictions, quantified through the higher statistical moments of the value distribution.

📖
termen

Higher-Order Moments

Statistics (variance, skewness, kurtosis) describing the shape of the return distribution beyond the mean, capturing asymmetry and probability concentration.

📖
termen

Distributional Temporal Variation

Temporal evolution of the full shape of the return distribution rather than just its expected value, revealing changing risk patterns.

📖
termen

Discrete Value Support

Finite and ordered set of values on which continuous distributions are approximated in practical distributional algorithms.

📖
termen

Distributional Propagation

Process of updating value distributions via the Bellman operator, preserving uncertainty information at each time step.

📖
termen

Distributional Stability

Property of convergence of value distributions to a stable form during learning, ensuring the consistency of uncertainty estimates.

🔍

Geen resultaten gevonden