MARL Continu
Multi-Agent Partially Observable Markov Decision Process (MPOMDP)
Mathematical formalization of MARL environments where each agent has partial observations and must infer the global state to make optimal decisions.
← Geri