🏠 Startseite
Vergleiche
📊 Alle Benchmarks 🦖 Dinosaurier v1 🦖 Dinosaurier v2 ✅ To-Do-Listen-Apps 🎨 Kreative freie Seiten 🎯 FSACB - Ultimatives Showcase 🌍 Übersetzungs-Benchmark
Modelle
🏆 Top 10 Modelle 🆓 Kostenlose Modelle 📋 Alle Modelle ⚙️ Kilo Code
Ressourcen
💬 Prompt-Bibliothek 📖 KI-Glossar 🔗 Nützliche Links

KI-Glossar

Das vollständige Wörterbuch der Künstlichen Intelligenz

162
Kategorien
2.032
Unterkategorien
23.060
Begriffe
📂
Unterkategorien

Classic Multi-armed Bandits

Fundamental problem where the agent chooses among several options to maximize cumulative reward.

10 Begriffe
📂
Unterkategorien

Epsilon-Greedy Algorithms

Strategy that exploits the best known action with probability 1-ε and explores randomly with probability ε.

10 Begriffe
📂
Unterkategorien

UCB Algorithms

Methods based on upper confidence bounds that balance exploration and exploitation through statistical intervals.

13 Begriffe
📂
Unterkategorien

Thompson Sampling

Bayesian approach that samples parameters from their posterior distribution to make decisions.

0 Begriffe
📂
Unterkategorien

Contextual Bandits

Extension where decisions depend on contextual features observed at each round.

10 Begriffe
📂
Unterkategorien

Linear Bandits

Models where the expected reward is a linear function of contextual features.

12 Begriffe
📂
Unterkategorien

Non-Stationary Bandits

Framework where reward distributions change over time, requiring continuous adaptation.

13 Begriffe
📂
Unterkategorien

Combinatorial Bandits

Problems where the agent selects sets of actions simultaneously with structural constraints.

10 Begriffe
📂
Unterkategorien

Adversarial Bandits

Scenario where an adversary chooses rewards to minimize the agent's gain.

10 Begriffe
📂
Unterkategorien

Cascading Bandits

Model where items are presented sequentially until the user clicks on one of them.

14 Begriffe
📂
Unterkategorien

Bandits with Limited Feedback

Situations where only partial information about the rewards is observed after each action.

14 Begriffe
📂
Unterkategorien

Bandits for Online Advertising

Specific application for real-time advertising campaign optimization.

8 Begriffe
📂
Unterkategorien

Bandits for A/B Testing

Smart alternative to traditional A/B testing for web experience optimization.

5 Begriffe
📂
Unterkategorien

Bandits for Recommendations

Systems that learn user preferences to personalize recommendations.

7 Begriffe
📂
Unterkategorien

Hierarchical Bandits

Multi-level structures where decisions are organized hierarchically for complex problems.

10 Begriffe
🔍

Keine Ergebnisse gefunden