🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Hierarchical Actor-Critic (HAC)

Reinforcement learning architecture combining multi-level hierarchical actors and critics to solve complex tasks through temporal decomposition.

📖
용어

High-level Policy

Decision policy at the top of the hierarchy that selects subgoals or options to guide lower-level policies.

📖
용어

Low-level Policy

Base policy in the hierarchy that executes primitive actions to achieve the subgoals defined by the higher-level policy.

📖
용어

Subgoal

Intermediate goal defined by a higher-level agent that lower-level agents must achieve to progress toward the final goal.

📖
용어

Intra-option Policy

Policy that determines the actions to execute at each time step when a specific option is active within the hierarchical framework.

📖
용어

Feudal Networks (FuN)

Hierarchical architecture inspired by feudalism where a manager defines goal directions and workers execute actions to achieve these goals.

📖
용어

Controller

Lower-level agent that executes primitive actions to accomplish the subgoals specified by the meta-controller.

📖
용어

Hierarchical Deep Deterministic Policy Gradient (H-DDPG)

Extension of the DDPG algorithm incorporating a hierarchical actor-critic structure for learning in continuous action spaces.

📖
용어

Multi-level Actor-Critic

Architecture where each hierarchical level has its own actor-critic pair optimized for different temporal horizons.

📖
용어

Hierarchical Q-Learning

Q-learning variant where Q-values are computed at different hierarchical levels to evaluate options and primitive actions.

📖
용어

Subtask Decomposition

Process of automatically dividing a complex task into simpler, manageable subtasks for hierarchical learning.

📖
용어

End-to-end Hierarchical Learning

Approach where the entire policy hierarchy is trained simultaneously without manual pre-decomposition of tasks.

🔍

결과를 찾을 수 없습니다