Contextual Bandits
Thompson Sampling
Bayesian algorithm that samples reward parameters from their posterior distribution to make probabilistic decisions.
← GeriBayesian algorithm that samples reward parameters from their posterior distribution to make probabilistic decisions.
← Geri