Contextual Bandits
Expected Reward
Anticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← BackAnticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← Back