🏠 Ana Sayfa
Benchmarklar
📊 Tüm Benchmarklar 🦖 Dinozor v1 🦖 Dinozor v2 ✅ To-Do List Uygulamaları 🎨 Yaratıcı Serbest Sayfalar 🎯 FSACB - Nihai Gösteri 🌍 Çeviri Benchmarkı
Modeller
🏆 En İyi 10 Model 🆓 Ücretsiz Modeller 📋 Tüm Modeller ⚙️ Kilo Code
Kaynaklar
💬 Prompt Kütüphanesi 📖 YZ Sözlüğü 🔗 Faydalı Bağlantılar

YZ Sözlüğü

Yapay Zekanın tam sözlüğü

162
kategoriler
2.032
alt kategoriler
23.060
terimler
📂
alt kategoriler

Stochastic Markov Decision Processes

MDP where transitions and rewards follow probabilistic distributions, modeling environmental uncertainty.

17 terimler
📂
alt kategoriler

Monte Carlo Methods in RL

Algorithms using repeated random sampling to estimate state-action values in stochastic environments.

14 terimler
📂
alt kategoriler

Stochastic Policies

Strategies returning probability distributions over actions rather than deterministic actions.

11 terimler
📂
alt kategoriler

Bayesian Reinforcement Learning

Approach handling uncertainty over model parameters using probability distributions.

9 terimler
📂
alt kategoriler

Multi-armed Stochastic Bandits

Exploration-exploitation problem where each arm has an unknown stochastic reward distribution.

7 terimler
📂
alt kategoriler

Bootstrap Methods in RL

Techniques using resampling to quantify uncertainty in value estimates.

15 terimler
📂
alt kategoriler

Gaussian Processes for RL

Using Gaussian processes to model uncertainty in the value or transition function.

10 terimler
📂
alt kategoriler

Ensemble Methods in Stochastic RL

Combination of multiple estimators to capture epistemic uncertainty in learning.

19 terimler
📂
alt kategoriler

Distributional Reinforcement Learning

Learning the full distribution of returns rather than only their expected value.

5 terimler
📂
alt kategoriler

Quantile Regression DRL

Specific approach of distributional RL using quantile regression to model uncertainty.

8 terimler
📂
alt kategoriler

Partially Observable Stochastic MDPs

Extension of stochastic MDPs with partial observation, increasing uncertainty about the state.

8 terimler
📂
alt kategoriler

Stochastic Optimization in RL

Optimization methods accounting for noise and uncertainty in gradients and updates.

10 terimler
🔍

Sonuç bulunamadı