Contextual Bandits
Thompson Sampling
Bayesian algorithm that samples reward parameters from their posterior distribution to make probabilistic decisions.
← KembaliBayesian algorithm that samples reward parameters from their posterior distribution to make probabilistic decisions.
← Kembali