Contextual Bandits
Thompson Sampling
Bayesian algorithm that samples reward parameters from their posterior distribution to make probabilistic decisions.
← TillbakaBayesian algorithm that samples reward parameters from their posterior distribution to make probabilistic decisions.
← Tillbaka