Scalability in MARL
Population Based Training for MARL
Evolutionary optimization method where a population of multi-agent policies evolves in parallel, enabling efficient exploration of cooperative strategy space.
← Indietro