🏠 Home
Prestatietests
📊 Alle benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List applicaties 🎨 Creatieve vrije pagina's 🎯 FSACB - Ultieme showcase 🌍 Vertaalbenchmark
Modellen
🏆 Top 10 modellen 🆓 Gratis modellen 📋 Alle modellen ⚙️ Kilo Code
Bronnen
💬 Promptbibliotheek 📖 AI-woordenlijst 🔗 Nuttige links

AI-woordenlijst

Het complete woordenboek van kunstmatige intelligentie

162
categorieën
2.032
subcategorieën
23.060
termen
📂
subcategorieën

Stochastic Markov Decision Processes

MDP where transitions and rewards follow probabilistic distributions, modeling environmental uncertainty.

17 termen
📂
subcategorieën

Monte Carlo Methods in RL

Algorithms using repeated random sampling to estimate state-action values in stochastic environments.

14 termen
📂
subcategorieën

Stochastic Policies

Strategies returning probability distributions over actions rather than deterministic actions.

11 termen
📂
subcategorieën

Bayesian Reinforcement Learning

Approach handling uncertainty over model parameters using probability distributions.

9 termen
📂
subcategorieën

Multi-armed Stochastic Bandits

Exploration-exploitation problem where each arm has an unknown stochastic reward distribution.

7 termen
📂
subcategorieën

Bootstrap Methods in RL

Techniques using resampling to quantify uncertainty in value estimates.

15 termen
📂
subcategorieën

Gaussian Processes for RL

Using Gaussian processes to model uncertainty in the value or transition function.

10 termen
📂
subcategorieën

Ensemble Methods in Stochastic RL

Combination of multiple estimators to capture epistemic uncertainty in learning.

19 termen
📂
subcategorieën

Distributional Reinforcement Learning

Learning the full distribution of returns rather than only their expected value.

5 termen
📂
subcategorieën

Quantile Regression DRL

Specific approach of distributional RL using quantile regression to model uncertainty.

8 termen
📂
subcategorieën

Partially Observable Stochastic MDPs

Extension of stochastic MDPs with partial observation, increasing uncertainty about the state.

8 termen
📂
subcategorieën

Stochastic Optimization in RL

Optimization methods accounting for noise and uncertainty in gradients and updates.

10 termen
🔍

Geen resultaten gevonden