Contextual Bandits
Thompson Sampling
Bayesian algorithm that samples reward parameters from their posterior distribution to make probabilistic decisions.
← 뒤로Bayesian algorithm that samples reward parameters from their posterior distribution to make probabilistic decisions.
← 뒤로