MARL Partially Observable
POMDP (Partially Observable Markov Decision Process)
Theoretical framework modeling environments where the agent perceives only a partial observation of the true state, requiring probabilistic inference about the hidden state to make optimal decisions.
← Back