Scalability in MARL
QMIX Algorithm
Multi-agent Q-learning algorithm ensuring monotonicity between individual values and joint value, enabling stable learning in large-scale systems.
← ZurückMulti-agent Q-learning algorithm ensuring monotonicity between individual values and joint value, enabling stable learning in large-scale systems.
← Zurück