Contextual Bandits
Reward Function
Mathematical function that quantifies the immediate return obtained after taking an action in a given context, guiding the algorithm's learning.
← Quay lạiMathematical function that quantifies the immediate return obtained after taking an action in a given context, guiding the algorithm's learning.
← Quay lại