🏠 Home
Benchmark
📊 Tutti i benchmark 🦖 Dinosauro v1 🦖 Dinosauro v2 ✅ App To-Do List 🎨 Pagine libere creative 🎯 FSACB - Ultimate Showcase 🌍 Benchmark traduzione
Modelli
🏆 Top 10 modelli 🆓 Modelli gratuiti 📋 Tutti i modelli ⚙️ Kilo Code
Risorse
💬 Libreria di prompt 📖 Glossario IA 🔗 Link utili

Glossario IA

Il dizionario completo dell'Intelligenza Artificiale

162
categorie
2.032
sottocategorie
23.060
termini
📂
sottocategorie

Multi-Armed Bandits

Fundamental problem where an agent chooses among several options with random rewards to maximize cumulative gain.

16 termini
📂
sottocategorie

Contextual Bandits

Extension of bandits where rewards depend on an observable context, enabling personalized adaptive decisions.

15 termini
📂
sottocategorie

Combinatorial Bandits

Variant where the agent must select combinations of actions simultaneously with complex constraints and rewards.

16 termini
📂
sottocategorie

Linear Bandits

Approach where rewards are modeled as linear functions of action features or context.

11 termini
📂
sottocategorie

Non-Stationary Bandits

Scenario where reward distributions change over time, requiring adaptive algorithms.

12 termini
📂
sottocategorie

Bandits with Delay

Problem where rewards are only observed after a delay, complicating the attribution of actions to outcomes.

17 termini
📂
sottocategorie

Adversarial Bandits

Model where rewards are generated by an adversary rather than a stochastic process.

16 termini
📂
sottocategorie

Bayesian Bandits

Approach using Bayesian inference to model uncertainty about reward distributions.

12 termini
📂
sottocategorie

Hierarchical Bandits

Multi-level structure where decisions are organized hierarchically to efficiently explore large action spaces.

17 termini
📂
sottocategorie

Bandits with Constraints

Constrained optimization where the agent must maximize rewards while respecting certain limitations.

20 termini
📂
sottocategorie

Bandits for Recommendation

Specific application to recommendation systems for balancing exploration and exploitation of content.

8 termini
📂
sottocategorie

Online Bandits

Continuous learning where the agent adapts in real-time to new information without a prior training phase.

9 termini
🔍

Nessun risultato trovato