🏠 Strona Główna
Benchmarki
📊 Wszystkie benchmarki 🦖 Dinozaur v1 🦖 Dinozaur v2 ✅ Aplikacje To-Do List 🎨 Kreatywne wolne strony 🎯 FSACB - Ostateczny pokaz 🌍 Benchmark tłumaczeń
Modele
🏆 Top 10 modeli 🆓 Darmowe modele 📋 Wszystkie modele ⚙️ Kilo Code
Zasoby
💬 Biblioteka promptów 📖 Słownik AI 🔗 Przydatne linki

Słownik AI

Kompletny słownik sztucznej inteligencji

162
kategorie
2 032
podkategorie
23 060
pojęcia
📂
podkategorie

Stochastic Markov Decision Processes

MDP where transitions and rewards follow probabilistic distributions, modeling environmental uncertainty.

17 pojęcia
📂
podkategorie

Monte Carlo Methods in RL

Algorithms using repeated random sampling to estimate state-action values in stochastic environments.

14 pojęcia
📂
podkategorie

Stochastic Policies

Strategies returning probability distributions over actions rather than deterministic actions.

11 pojęcia
📂
podkategorie

Bayesian Reinforcement Learning

Approach handling uncertainty over model parameters using probability distributions.

9 pojęcia
📂
podkategorie

Multi-armed Stochastic Bandits

Exploration-exploitation problem where each arm has an unknown stochastic reward distribution.

7 pojęcia
📂
podkategorie

Bootstrap Methods in RL

Techniques using resampling to quantify uncertainty in value estimates.

15 pojęcia
📂
podkategorie

Gaussian Processes for RL

Using Gaussian processes to model uncertainty in the value or transition function.

10 pojęcia
📂
podkategorie

Ensemble Methods in Stochastic RL

Combination of multiple estimators to capture epistemic uncertainty in learning.

19 pojęcia
📂
podkategorie

Distributional Reinforcement Learning

Learning the full distribution of returns rather than only their expected value.

5 pojęcia
📂
podkategorie

Quantile Regression DRL

Specific approach of distributional RL using quantile regression to model uncertainty.

8 pojęcia
📂
podkategorie

Partially Observable Stochastic MDPs

Extension of stochastic MDPs with partial observation, increasing uncertainty about the state.

8 pojęcia
📂
podkategorie

Stochastic Optimization in RL

Optimization methods accounting for noise and uncertainty in gradients and updates.

10 pojęcia
🔍

Nie znaleziono wyników