🏠 Ana Sayfa
Benchmarklar
📊 Tüm Benchmarklar 🦖 Dinozor v1 🦖 Dinozor v2 ✅ To-Do List Uygulamaları 🎨 Yaratıcı Serbest Sayfalar 🎯 FSACB - Nihai Gösteri 🌍 Çeviri Benchmarkı
Modeller
🏆 En İyi 10 Model 🆓 Ücretsiz Modeller 📋 Tüm Modeller ⚙️ Kilo Code
Kaynaklar
💬 Prompt Kütüphanesi 📖 YZ Sözlüğü 🔗 Faydalı Bağlantılar

YZ Sözlüğü

Yapay Zekanın tam sözlüğü

162
kategoriler
2.032
alt kategoriler
23.060
terimler
📖
terimler

Return distribution

Complete probabilistic representation of the sum of discounted future rewards, capturing all possible scenarios rather than a single expected value.

📖
terimler

Quantile distribution

Approach that directly models the quantiles of the return distribution to capture the variability and distribution tails of rewards.

📖
terimler

Conditional value at risk

Robust risk measure calculating the expected value of returns in the lower tail of the distribution, beyond a specified quantile.

📖
terimler

Implicit distribution

Distributional representation learned indirectly without explicit parameters, often through generative neural networks or samplers.

📖
terimler

Return variance

Dispersion measure quantifying the mean square deviation of returns from their expectation, a key indicator of risk in decisions.

📖
terimler

Policy entropy

Uncertainty measure on the agent's actions, used to explore the state-action space and quantify behavioral uncertainty.

📖
terimler

Confidence bound

Statistical intervals guaranteeing with a predefined probability that the true value lies within the estimated range, essential for safe exploration.

📖
terimler

Cramer distribution

Family of flexible distributions allowing modeling of skewness and heavy tails in returns, beyond Gaussian assumptions.

📖
terimler

Kernel estimation

Non-parametric method for estimating the probability density of returns using kernel functions to smooth empirical observations.

📖
terimler

Uncertainty propagation

Process of propagating uncertainty through successive steps of reinforcement learning, from observations to final decisions.

📖
terimler

Variational approximation

Optimization method approximating complex distributions by simpler families, minimizing divergence between distributions.

📖
terimler

Mixture distribution

Weighted combination of several base distributions, allowing to capture multimodal behaviors in expected returns.

📖
terimler

Cumulative distribution function

Function F(x) giving the probability that the return is less than or equal to x, completely characterizing the distribution of returns.

📖
terimler

Bias-variance tradeoff

Fundamental dilemma between model complexity (high variance, low bias) and its simplicity (low variance, high bias) in distributional estimation.

🔍

Sonuç bulunamadı