🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

Meta-Reinforcement Learning

Reinforcement learning approach where the agent learns to learn, acquiring meta-knowledge to quickly adapt to new tasks with few experiences.

📖
thuật ngữ

Meta-Learner

Algorithm or model that optimizes a learning process to acquire rapid adaptation capabilities to new tasks not seen during training.

📖
thuật ngữ

Task-Specific Policy

Reinforcement learning policy adapted to a particular task, quickly generated by the meta-learner from few experiences.

📖
thuật ngữ

Proximal Meta-Policy Optimization (ProMP)

Meta-RL algorithm that extends PPO to meta-learning, optimizing a meta-policy capable of generating task-specific policies.

📖
thuật ngữ

Meta-World

Benchmark and standardized environment to evaluate meta-RL algorithms on robotic manipulation tasks with varied task distribution.

📖
thuật ngữ

RL² (Reinforcement Learning Squared)

Meta-RL framework where the reinforcement learning algorithm itself is learned by another RL process, integrating history into the agent's state.

📖
thuật ngữ

Meta-Experience Replay

Experience buffer technique organized by tasks to facilitate rapid adaptation and knowledge transfer between different tasks.

📖
thuật ngữ

Meta-Policy Gradient

Optimization algorithm that calculates gradients with respect to meta-parameters to improve expected performance on the task distribution.

📖
thuật ngữ

Hindsight Experience Replay (HER) in Meta-RL

Extension of HER to meta-RL where experiences are reinterpreted with different objectives to improve sampling and inter-task generalization.

📖
thuật ngữ

Curriculum Learning in Meta-RL

Progressive sequencing of training tasks by increasing complexity to improve the adaptation capability of the meta-learner.

📖
thuật ngữ

Meta-Imitation Learning

Combination of meta-learning and imitation learning where the agent learns to quickly imitate new demonstrations with few examples.

📖
thuật ngữ

Meta-Off-Policy Evaluation

Evaluation of the performance of a meta-learned policy on new tasks using only previously collected off-policy data.

🔍

Không tìm thấy kết quả