Contextual Bandits
Expected Reward
Anticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← IndietroAnticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← Indietro