MARL Partially Observable
Value Function Factorization
Technique decomposing the global value function into a sum of individual or local value functions, enabling decentralized learning while preserving global consistency.
← Tillbaka