🏠 Hem
Benchmarkar
📊 Alla benchmarkar 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List-applikationer 🎨 Kreativa fria sidor 🎯 FSACB - Ultimata uppvisningen 🌍 Översättningsbenchmark
Modeller
🏆 Topp 10 modeller 🆓 Gratis modeller 📋 Alla modeller ⚙️ Kilo Code
Resurser
💬 Promptbibliotek 📖 AI-ordlista 🔗 Användbara länkar

AI-ordlista

Den kompletta ordlistan över AI

162
kategorier
2 032
underkategorier
23 060
termer
📖
termer

Curiosity-Driven RL

Reinforcement learning approach where the agent generates intrinsic rewards based on its curiosity to encourage the exploration of complex environments with sparse extrinsic rewards.

📖
termer

Intrinsic Motivation

Computational psychological mechanism that drives an agent to act to satisfy internal needs such as curiosity, rather than for task-specific external rewards.

📖
termer

Prediction Error

Measure of the difference between a world model's predictions and the actual observations, used as a curiosity signal to encourage the exploration of unexpected states.

📖
termer

Intrinsic Curiosity Module (ICM)

Neural architecture composed of forward and inverse dynamics models that generates intrinsic rewards based on prediction uncertainty to guide exploration.

📖
termer

Random Network Distillation (RND)

Exploration method where a fixed random neural network is used as a target for a predictor network, with the prediction error serving as an intrinsic reward for novel states.

📖
termer

Count-Based Exploration

Exploration strategy that assigns curiosity bonuses inversely proportional to the visitation frequency of states, thus encouraging the discovery of less explored regions.

📖
termer

Pseudo-counts

Approximate estimation of state visitation frequencies in continuous or high-dimensional spaces, used to implement count-based curiosity bonuses.

📖
termer

Empowerment

Information-theoretic measure quantifying the control an agent exerts over its environment, which is maximized to encourage exploratory behaviors that increase the agent's influence.

📖
termer

Information Gain

Amount of new information acquired by the agent about the environment, used as an intrinsic signal to direct exploration toward the most informative regions.

📖
termer

Episodic Curiosity

Curiosity approach based on short-term memory where the agent is motivated to visit states different from those recently observed in the current episode.

📖
termer

Variational Information Maximization Exploration (VIME)

Exploration method that maximizes mutual information between model parameters and future observations, using Bayesian approaches to quantify uncertainty.

📖
termer

State Visitation Count

Counter of the number of times a particular state has been visited, used to calculate exploration bonuses that favor the discovery of rare or unexplored states.

📖
termer

Curiosity-Driven Exploration

Exploration paradigm where the agent is guided by intrinsic rewards based on novelty or surprise, rather than by predefined random exploration strategies.

📖
termer

Lifelong Curiosity

Ability of an agent to maintain exploratory motivation over long periods, continuously adapting its behaviors to discover new knowledge in changing environments.

📖
termer

Novelty Detection

Process of identifying observations or states significantly different from past experiences, serving as a basis for generating curiosity signals.

📖
termer

Go-Explore

Exploration algorithm that explicitly memorizes visited states with their corresponding trajectories, then systematically explores from these anchor points to discover new regions.

🔍

Inga resultat hittades