🏠 Hem
Benchmarkar
📊 Alla benchmarkar 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List-applikationer 🎨 Kreativa fria sidor 🎯 FSACB - Ultimata uppvisningen 🌍 Översättningsbenchmark
Modeller
🏆 Topp 10 modeller 🆓 Gratis modeller 📋 Alla modeller ⚙️ Kilo Code
Resurser
💬 Promptbibliotek 📖 AI-ordlista 🔗 Användbara länkar

AI-ordlista

Den kompletta ordlistan över AI

162
kategorier
2 032
underkategorier
23 060
termer
📂
underkategorier

Stochastic Markov Decision Processes

MDP where transitions and rewards follow probabilistic distributions, modeling environmental uncertainty.

17 termer
📂
underkategorier

Monte Carlo Methods in RL

Algorithms using repeated random sampling to estimate state-action values in stochastic environments.

14 termer
📂
underkategorier

Stochastic Policies

Strategies returning probability distributions over actions rather than deterministic actions.

11 termer
📂
underkategorier

Bayesian Reinforcement Learning

Approach handling uncertainty over model parameters using probability distributions.

9 termer
📂
underkategorier

Multi-armed Stochastic Bandits

Exploration-exploitation problem where each arm has an unknown stochastic reward distribution.

7 termer
📂
underkategorier

Bootstrap Methods in RL

Techniques using resampling to quantify uncertainty in value estimates.

15 termer
📂
underkategorier

Gaussian Processes for RL

Using Gaussian processes to model uncertainty in the value or transition function.

10 termer
📂
underkategorier

Ensemble Methods in Stochastic RL

Combination of multiple estimators to capture epistemic uncertainty in learning.

19 termer
📂
underkategorier

Distributional Reinforcement Learning

Learning the full distribution of returns rather than only their expected value.

5 termer
📂
underkategorier

Quantile Regression DRL

Specific approach of distributional RL using quantile regression to model uncertainty.

8 termer
📂
underkategorier

Partially Observable Stochastic MDPs

Extension of stochastic MDPs with partial observation, increasing uncertainty about the state.

8 termer
📂
underkategorier

Stochastic Optimization in RL

Optimization methods accounting for noise and uncertainty in gradients and updates.

10 termer
🔍

Inga resultat hittades