🏠 Startseite
Vergleiche
📊 Alle Benchmarks 🦖 Dinosaurier v1 🦖 Dinosaurier v2 ✅ To-Do-Listen-Apps 🎨 Kreative freie Seiten 🎯 FSACB - Ultimatives Showcase 🌍 Übersetzungs-Benchmark
Modelle
🏆 Top 10 Modelle 🆓 Kostenlose Modelle 📋 Alle Modelle ⚙️ Kilo Code
Ressourcen
💬 Prompt-Bibliothek 📖 KI-Glossar 🔗 Nützliche Links

KI-Glossar

Das vollständige Wörterbuch der Künstlichen Intelligenz

162
Kategorien
2.032
Unterkategorien
23.060
Begriffe
📖
Begriffe

Inverse Reinforcement Learning

Learning method where the agent infers the reward function from expert demonstrations rather than receiving explicit rewards.

📖
Begriffe

Maximum Entropy IRL

Variant of IRL that assumes the expert follows the maximum entropy probability distribution among all optimal policies.

📖
Begriffe

Behavioral Cloning

Supervised learning approach that directly learns to imitate expert actions without explicitly inferring the reward function.

📖
Begriffe

Expert Trajectory

Sequence of states and actions observed in an expert, representing an optimal or near-optimal solution to the problem.

📖
Begriffe

Policy Equivalence

Principle that multiple reward functions can lead to the same optimal policy, creating ambiguity in IRL.

📖
Begriffe

Bayesian Inverse Reinforcement Learning

IRL approach using Bayesian inference to estimate a distribution over possible reward functions.

📖
Begriffe

Preference Cost

Transformation of the reward function into a cost function, where the agent learns to minimize total cost while following demonstrations.

📖
Begriffe

Adversarial Inverse Reinforcement Learning

IRL method using an adversarial game where a generator learns the policy and a discriminator distinguishes expert trajectories.

📖
Begriffe

Active Inverse Reinforcement Learning

Variant of IRL where the agent can query the expert to obtain additional demonstrations and reduce uncertainty.

📖
Begriffe

Objective Function Inference

Mathematical process of deducing the underlying objective function from observations of the expert's behavior.

📖
Begriffe

Imitation Bias

Tendency of the agent to over-imitate the expert's actions without understanding the underlying intention, leading to poor generalizations.

📖
Begriffe

Reinforcement Learning with Expert Feedback

Combination of RL and IRL where a model first trains on expert data, then is refined with human feedback.

📖
Begriffe

Feature Function

Function that maps state-action pairs to a feature space, used to represent the reward function in a linear manner.

📖
Begriffe

Multi-task Inverse Reinforcement Learning

Extension of IRL where multiple tasks are learned simultaneously by sharing knowledge between reward functions.

🔍

Keine Ergebnisse gefunden