Contextual Bandits
Expected Reward
Anticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← TillbakaAnticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← Tillbaka