Multi-Objective Q-Learning
Q-value Vector
Multi-dimensional data structure where each element represents the Q-value for a specific objective, replacing the single scalar value of classical Q-Learning.
← Tillbaka