🏠 Home
Benchmark
📊 Tutti i benchmark 🦖 Dinosauro v1 🦖 Dinosauro v2 ✅ App To-Do List 🎨 Pagine libere creative 🎯 FSACB - Ultimate Showcase 🌍 Benchmark traduzione
Modelli
🏆 Top 10 modelli 🆓 Modelli gratuiti 📋 Tutti i modelli ⚙️ Kilo Code
Risorse
💬 Libreria di prompt 📖 Glossario IA 🔗 Link utili

Glossario IA

Il dizionario completo dell'Intelligenza Artificiale

162
categorie
2.032
sottocategorie
23.060
termini
📂
sottocategorie

Stochastic Markov Decision Processes

MDP where transitions and rewards follow probabilistic distributions, modeling environmental uncertainty.

17 termini
📂
sottocategorie

Monte Carlo Methods in RL

Algorithms using repeated random sampling to estimate state-action values in stochastic environments.

14 termini
📂
sottocategorie

Stochastic Policies

Strategies returning probability distributions over actions rather than deterministic actions.

11 termini
📂
sottocategorie

Bayesian Reinforcement Learning

Approach handling uncertainty over model parameters using probability distributions.

9 termini
📂
sottocategorie

Multi-armed Stochastic Bandits

Exploration-exploitation problem where each arm has an unknown stochastic reward distribution.

7 termini
📂
sottocategorie

Bootstrap Methods in RL

Techniques using resampling to quantify uncertainty in value estimates.

15 termini
📂
sottocategorie

Gaussian Processes for RL

Using Gaussian processes to model uncertainty in the value or transition function.

10 termini
📂
sottocategorie

Ensemble Methods in Stochastic RL

Combination of multiple estimators to capture epistemic uncertainty in learning.

19 termini
📂
sottocategorie

Distributional Reinforcement Learning

Learning the full distribution of returns rather than only their expected value.

5 termini
📂
sottocategorie

Quantile Regression DRL

Specific approach of distributional RL using quantile regression to model uncertainty.

8 termini
📂
sottocategorie

Partially Observable Stochastic MDPs

Extension of stochastic MDPs with partial observation, increasing uncertainty about the state.

8 termini
📂
sottocategorie

Stochastic Optimization in RL

Optimization methods accounting for noise and uncertainty in gradients and updates.

10 termini
🔍

Nessun risultato trovato