Contextual Bandits
Expected Reward
Anticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← ZurückAnticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← Zurück