Glossario IA

Il dizionario completo dell'Intelligenza Artificiale

162

categorie

2.032

sottocategorie

23.060

termini

📂

sottocategorie

Multi-Armed Bandits

Fundamental problem where an agent chooses among several options with random rewards to maximize cumulative gain.

16 termini

📂

sottocategorie

Contextual Bandits

Extension of bandits where rewards depend on an observable context, enabling personalized adaptive decisions.

15 termini

📂

sottocategorie

Combinatorial Bandits

Variant where the agent must select combinations of actions simultaneously with complex constraints and rewards.

16 termini

📂

sottocategorie

Linear Bandits

Approach where rewards are modeled as linear functions of action features or context.

11 termini

📂

sottocategorie

Non-Stationary Bandits

Scenario where reward distributions change over time, requiring adaptive algorithms.

12 termini

📂

sottocategorie

Bandits with Delay

Problem where rewards are only observed after a delay, complicating the attribution of actions to outcomes.

17 termini

📂

sottocategorie

Adversarial Bandits

Model where rewards are generated by an adversary rather than a stochastic process.

16 termini

📂

sottocategorie

Bayesian Bandits

Approach using Bayesian inference to model uncertainty about reward distributions.

12 termini

📂

sottocategorie

Hierarchical Bandits

Multi-level structure where decisions are organized hierarchically to efficiently explore large action spaces.

17 termini

📂

sottocategorie

Bandits with Constraints

Constrained optimization where the agent must maximize rewards while respecting certain limitations.

20 termini

📂

sottocategorie

Bandits for Recommendation

Specific application to recommendation systems for balancing exploration and exploitation of content.

8 termini

📂

sottocategorie

Online Bandits

Continuous learning where the agent adapts in real-time to new information without a prior training phase.

9 termini

🔍

Glossario IA

Multi-Armed Bandits

Contextual Bandits

Combinatorial Bandits

Linear Bandits

Non-Stationary Bandits

Bandits with Delay

Adversarial Bandits

Bayesian Bandits

Hierarchical Bandits

Bandits with Constraints

Bandits for Recommendation

Online Bandits

Nessun risultato trovato