Contextual Bandits
Upper Confidence Bound (UCB)
Strategy that selects arms based on an upper confidence bound on their expected reward, favoring the exploration of uncertain actions.
← IndietroStrategy that selects arms based on an upper confidence bound on their expected reward, favoring the exploration of uncertain actions.
← Indietro