🏠 Home
Benchmark
📊 Tutti i benchmark 🦖 Dinosauro v1 🦖 Dinosauro v2 ✅ App To-Do List 🎨 Pagine libere creative 🎯 FSACB - Ultimate Showcase 🌍 Benchmark traduzione
Modelli
🏆 Top 10 modelli 🆓 Modelli gratuiti 📋 Tutti i modelli ⚙️ Kilo Code
Risorse
💬 Libreria di prompt 📖 Glossario IA 🔗 Link utili

Glossario IA

Il dizionario completo dell'Intelligenza Artificiale

162
categorie
2.032
sottocategorie
23.060
termini
📂
sottocategorie

Behavioral Cloning

Apprentissage supervisé où l'agent imite directement les actions d'experts à partir de démonstrations.

13 termini
📂
sottocategorie

Inverse Reinforcement Learning

Infère la fonction de récompense optimale à partir des comportements d'experts observés.

6 termini
📂
sottocategorie

Generative Adversarial Imitation Learning

Utilise des réseaux antagonistes pour discriminer entre les actions de l'agent et celles de l'expert.

12 termini
📂
sottocategorie

Dataset Aggregation (DAgger)

Méthode itérative collectant de nouvelles données d'expert sur les trajectoires de l'agent pour améliorer la politique.

19 termini
📂
sottocategorie

Reward Learning from Human Feedback

Apprend les récompenses à partir d'évaluations comparatives ou qualitatives fournies par des humains.

14 termini
📂
sottocategorie

Offline Reinforcement Learning

Apprentissage par renforcement utilisant uniquement des datasets fixes sans interaction avec l'environnement.

9 termini
📂
sottocategorie

Model-Based Imitation Learning

Builds a dynamic model of the environment to accelerate imitation learning.

10 termini
📂
sottocategorie

Meta-Imitation Learning

Learns to quickly imitate new tasks with only a few demonstrations.

17 termini
📂
sottocategorie

Hierarchical Imitation Learning

Decomposes complex behaviors into a hierarchy of simpler subtasks to imiter.

10 termini
📂
sottocategorie

Multi-Modal Imitation Learning

Handles multiple valid solutions for the same task by learning a distribution over actions.

9 termini
📂
sottocategorie

Self-Imitation Learning

The agent imitates its own successful past actions to improve its current policy.

17 termini
📂
sottocategorie

Goal-Conditioned Imitation Learning

Learns a policy conditioned by specific objectives to accomplish various tasks.

15 termini
📂
sottocategorie

Adversarial Inverse Reinforcement Learning

Combine IRL with adversarial learning for a more robust reward estimation.

12 termini
📂
sottocategorie

Imitation Learning with Partial Observations

Apprentissage par imitation dans des environnements où l'agent n'observe qu'une partie de l'état.

14 termini
📂
sottocategorie

Curriculum Imitation Learning

Progressive sequence of demonstrations of increasing difficulty to facilitate learning.

14 termini
🔍

Nessun risultato trovato