Centralized-Decentralized MARL
Decentralised Partially Observable MDP (Dec-POMDP)
Mathematical formalization of multi-agent decision problems with partial observability where each agent makes decisions based on its local observations. Agents must cooperate to maximize a shared global reward.
← 뒤로