🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Multi-Agent Reinforcement Learning

Learning paradigm where multiple agents simultaneously learn to make decisions in a shared environment, interacting with each other to optimize collective or individual objectives.

📖
용어

Multi-Agent Deep Deterministic Policy Gradient (MADDPG)

CTDE algorithm extending DDPG to multi-agent environments, using centralized critics and decentralized actors to learn in continuous action spaces.

📖
용어

Multi-Agent Partially Observable Markov Decision Process (MPOMDP)

Mathematical formalization of MARL environments where each agent has partial observations and must infer the global state to make optimal decisions.

📖
용어

Mean Field Games

Theory studying the interactions of a large number of rational agents by approximating the crowd effect through a mean field, applicable to large-scale multi-agent systems.

📖
용어

Continuous Control

Application domain of MARL where agents must control physical systems with continuous actions, such as mobile robotics or object manipulation.

📖
용어

Stochastic Games

Extension of MDPs to multi-agent environments where transitions and rewards depend on the joint actions of all agents, modeling cooperative and competitive scenarios.

📖
용어

Nash Equilibrium in MARL

Stability concept where no agent can improve its reward by unilaterally changing its strategy, used as a convergence criterion in competitive MARL algorithms.

📖
용어

Coordination Protocols

Communication or synchronization mechanisms allowing agents to align their actions to achieve collective objectives in continuous MARL environments.

🔍

결과를 찾을 수 없습니다