🏠 Beranda
Benchmark
📊 Semua Benchmark 🦖 Dinosaurus v1 🦖 Dinosaurus v2 ✅ Aplikasi To-Do List 🎨 Halaman Bebas Kreatif 🎯 FSACB - Showcase Utama 🌍 Benchmark Terjemahan
Model
🏆 Top 10 Model 🆓 Model Gratis 📋 Semua Model ⚙️ Kilo Code
Sumber Daya
💬 Perpustakaan Prompt 📖 Glosarium AI 🔗 Tautan Berguna

Glosarium AI

Kamus lengkap Kecerdasan Buatan

162
kategori
2.032
subkategori
23.060
istilah
📖
istilah

Partial Observations

Scenario where demonstrations cover only a limited portion of the state space, creating unexplored areas that the agent must generalize.

📖
istilah

Robust Policy

A learning policy designed to maintain acceptable performance when faced with partial observations and states not seen during training.

📖
istilah

Policy Inference

Process of estimating the expert's underlying policy from a limited set of partial demonstration trajectories.

📖
istilah

Policy Generalization

The ability of a learned policy to perform correctly in states not observed during the demonstrations, crucial for partial observations.

📖
istilah

State Reconstruction

Technique for estimating missing or unobserved states from the partial information available in the demonstrations.

📖
istilah

Covered State Space

The subset of the total state space actually explored in the demonstrations, defining the limits of direct imitation learning.

📖
istilah

Learning from Demonstration

Synonym for imitation learning, specifically applied to scenarios where demonstrations are incomplete or noisy.

📖
istilah

Out-of-Distribution Evaluation

Methodology for evaluating the policy's performance on states not present in the training data to measure its robustness.

📖
istilah

Policy Function

Mathematical mapping π(a|s) that specifies the probability of choosing action a in state s, learned from partial demonstrations.

📖
istilah

State Distribution

Probabilistic distribution describing the frequency of occurrence of different states in the environment, often biased in partial demonstrations.

🔍

Tidak ada hasil ditemukan