🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

Return distribution

Complete probabilistic representation of the sum of discounted future rewards, capturing all possible scenarios rather than a single expected value.

📖
thuật ngữ

Quantile distribution

Approach that directly models the quantiles of the return distribution to capture the variability and distribution tails of rewards.

📖
thuật ngữ

Conditional value at risk

Robust risk measure calculating the expected value of returns in the lower tail of the distribution, beyond a specified quantile.

📖
thuật ngữ

Implicit distribution

Distributional representation learned indirectly without explicit parameters, often through generative neural networks or samplers.

📖
thuật ngữ

Return variance

Dispersion measure quantifying the mean square deviation of returns from their expectation, a key indicator of risk in decisions.

📖
thuật ngữ

Policy entropy

Uncertainty measure on the agent's actions, used to explore the state-action space and quantify behavioral uncertainty.

📖
thuật ngữ

Confidence bound

Statistical intervals guaranteeing with a predefined probability that the true value lies within the estimated range, essential for safe exploration.

📖
thuật ngữ

Cramer distribution

Family of flexible distributions allowing modeling of skewness and heavy tails in returns, beyond Gaussian assumptions.

📖
thuật ngữ

Kernel estimation

Non-parametric method for estimating the probability density of returns using kernel functions to smooth empirical observations.

📖
thuật ngữ

Uncertainty propagation

Process of propagating uncertainty through successive steps of reinforcement learning, from observations to final decisions.

📖
thuật ngữ

Variational approximation

Optimization method approximating complex distributions by simpler families, minimizing divergence between distributions.

📖
thuật ngữ

Mixture distribution

Weighted combination of several base distributions, allowing to capture multimodal behaviors in expected returns.

📖
thuật ngữ

Cumulative distribution function

Function F(x) giving the probability that the return is less than or equal to x, completely characterizing the distribution of returns.

📖
thuật ngữ

Bias-variance tradeoff

Fundamental dilemma between model complexity (high variance, low bias) and its simplicity (low variance, high bias) in distributional estimation.

🔍

Không tìm thấy kết quả