Contextual Bandits
Upper Confidence Bound (UCB)
Strategy that selects arms based on an upper confidence bound on their expected reward, favoring the exploration of uncertain actions.
← KembaliStrategy that selects arms based on an upper confidence bound on their expected reward, favoring the exploration of uncertain actions.
← Kembali