Contextual Bandits
Expected Reward
Anticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← KembaliAnticipated average value of the reward for a given action in a specific context, calculated from historical observations.
← Kembali