🏠 Home
Benchmark
📊 Tutti i benchmark 🦖 Dinosauro v1 🦖 Dinosauro v2 ✅ App To-Do List 🎨 Pagine libere creative 🎯 FSACB - Ultimate Showcase 🌍 Benchmark traduzione
Modelli
🏆 Top 10 modelli 🆓 Modelli gratuiti 📋 Tutti i modelli ⚙️ Kilo Code
Risorse
💬 Libreria di prompt 📖 Glossario IA 🔗 Link utili

Glossario IA

Il dizionario completo dell'Intelligenza Artificiale

162
categorie
2.032
sottocategorie
23.060
termini
📖
termini

Inverse Reinforcement Learning

Method that consists of deducing the reward function of an expert from its optimal trajectories, then allowing the agent to learn an optimal policy.

📖
termini

State-only Imitation Learning

Learning paradigm where the agent only has access to the states visited by the expert without knowledge of the actions taken, requiring specific approaches to infer behaviors.

📖
termini

Trajectory Matching

Approach that minimizes the divergence between the trajectory distributions generated by the agent and those of the expert, often used in learning without access to actions.

📖
termini

GAIL

Framework combining imitation learning and generative adversarial networks, where a discriminator distinguishes the trajectories of the expert from those of the agent.

📖
termini

Dataset Aggregation

Iterative algorithm that collects new expert data based on the errors of the current agent, progressively aggregating a more robust dataset.

📖
termini

Forward-Forward Algorithm

Unsupervised learning method that predicts future states from current states without requiring action data, used in imitation by observation.

📖
termini

Observation-based Learning

Learning process where the agent acquires skills by observing only environmental states and results, without direct access to the expert's actions.

📖
termini

State Distribution Matching

Technique aiming to align the distribution of states visited by the agent with that of the expert, used when actions are not observable.

📖
termini

No-action Imitation

A form of imitation learning where the agent must learn to reproduce expert behavior without any information about the actions taken.

📖
termini

Passive Learning

Learning mode where the agent passively observes demonstrations without active interaction with the environment, typical of imitation by observation.

📖
termini

Expert Demonstration

Set of trajectories or states provided by an expert serving as reference for imitation learning, crucial in approaches without access to actions.

📖
termini

State-Action Distribution

Joint distribution of states and actions that the agent seeks to approximate, often inferred from the state distribution alone in imitation by observation.

📖
termini

Trajectory-based Learning

Learning approach that focuses on reproducing complete trajectories rather than individual state-action decisions, adapted to observation without actions.

📖
termini

Dynamics Model

Model learning the transition between consecutive states in expert demonstrations, essential for inferring actions when they are not observed.

📖
termini

Occupancy Measure

Statistical measure quantifying the visitation frequency of each state-action, adapted to contexts where only state visitations are observable.

🔍

Nessun risultato trovato