Centralized-Decentralized MARL
Multi-Agent Deep Deterministic Policy Gradient (MADDPG)
Extension of DDPG to multi-agent environments using centralized-decentralized learning with centralized critics and decentralized actors. Each agent learns a policy by considering other agents' policies as part of the environment.
← Quay lại