🏠 Home
Benchmark
📊 Tutti i benchmark 🦖 Dinosauro v1 🦖 Dinosauro v2 ✅ App To-Do List 🎨 Pagine libere creative 🎯 FSACB - Ultimate Showcase 🌍 Benchmark traduzione
Modelli
🏆 Top 10 modelli 🆓 Modelli gratuiti 📋 Tutti i modelli ⚙️ Kilo Code
Risorse
💬 Libreria di prompt 📖 Glossario IA 🔗 Link utili

Glossario IA

Il dizionario completo dell'Intelligenza Artificiale

162
categorie
2.032
sottocategorie
23.060
termini
📖
termini

Inverse Reinforcement Learning

Learning method where the agent infers the reward function from expert demonstrations rather than receiving explicit rewards.

📖
termini

Maximum Entropy IRL

Variant of IRL that assumes the expert follows the maximum entropy probability distribution among all optimal policies.

📖
termini

Behavioral Cloning

Supervised learning approach that directly learns to imitate expert actions without explicitly inferring the reward function.

📖
termini

Expert Trajectory

Sequence of states and actions observed in an expert, representing an optimal or near-optimal solution to the problem.

📖
termini

Policy Equivalence

Principle that multiple reward functions can lead to the same optimal policy, creating ambiguity in IRL.

📖
termini

Bayesian Inverse Reinforcement Learning

IRL approach using Bayesian inference to estimate a distribution over possible reward functions.

📖
termini

Preference Cost

Transformation of the reward function into a cost function, where the agent learns to minimize total cost while following demonstrations.

📖
termini

Adversarial Inverse Reinforcement Learning

IRL method using an adversarial game where a generator learns the policy and a discriminator distinguishes expert trajectories.

📖
termini

Active Inverse Reinforcement Learning

Variant of IRL where the agent can query the expert to obtain additional demonstrations and reduce uncertainty.

📖
termini

Objective Function Inference

Mathematical process of deducing the underlying objective function from observations of the expert's behavior.

📖
termini

Imitation Bias

Tendency of the agent to over-imitate the expert's actions without understanding the underlying intention, leading to poor generalizations.

📖
termini

Reinforcement Learning with Expert Feedback

Combination of RL and IRL where a model first trains on expert data, then is refined with human feedback.

📖
termini

Feature Function

Function that maps state-action pairs to a feature space, used to represent the reward function in a linear manner.

📖
termini

Multi-task Inverse Reinforcement Learning

Extension of IRL where multiple tasks are learned simultaneously by sharing knowledge between reward functions.

🔍

Nessun risultato trovato