🏠 Hem
Benchmarkar
📊 Alla benchmarkar 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List-applikationer 🎨 Kreativa fria sidor 🎯 FSACB - Ultimata uppvisningen 🌍 Översättningsbenchmark
Modeller
🏆 Topp 10 modeller 🆓 Gratis modeller 📋 Alla modeller ⚙️ Kilo Code
Resurser
💬 Promptbibliotek 📖 AI-ordlista 🔗 Användbara länkar

AI-ordlista

Den kompletta ordlistan över AI

162
kategorier
2 032
underkategorier
23 060
termer
📂
underkategorier

Behavioral Cloning

Direct policy learning by minimizing the error between agent actions and expert demonstrations

17 termer
📂
underkategorier

Inverse Reinforcement Learning

Inferring the reward function from expert demonstrations to then learn the optimal policy.

14 termer
📂
underkategorier

Generative Adversarial Imitation Learning

Using adversarial networks to distinguish agent behaviors from expert demonstrations

18 termer
📂
underkategorier

DAgger Data Aggregation

Iterative data collection by querying the expert on states visited by the current policy

17 termer
📂
underkategorier

Offline Imitation Learning

Learning from a fixed set of demonstrations without additional interaction with the environment.

13 termer
📂
underkategorier

Apprentissage par Imitation en Ligne

Apprentissage continu avec interaction en temps réel et mises à jour basées sur les nouvelles démonstrations.

15 termer
📂
underkategorier

Observation-based Imitation

Learning by observing only states and trajectories without having access to expert actions.

15 termer
📂
underkategorier

Apprentissage par Imitation Hiérarchique

Décomposition des tâches complexes en sous-tâches avec apprentissage par imitation à différents niveaux d'abstraction.

17 termer
📂
underkategorier

One-Shot Imitation Learning

Ability to imitate a new task after observing a single demonstration.

11 termer
📂
underkategorier

Meta-Learning by Imitation

Learning to quickly learn new tasks by imitation through experience on multiple tasks.

20 termer
📂
underkategorier

Multimodal Imitation Learning

Handling demonstrations with multiple valid solutions and learning multimodal policies.

19 termer
📂
underkategorier

Imitation with Partial Observations

Imitation learning when demonstrations only partially cover the state space.

10 termer
🔍

Inga resultat hittades