Contextual Bandits
Arm Selection
Process of choosing the optimal action among available options based on current reward estimates and the observed context.
← IndietroProcess of choosing the optimal action among available options based on current reward estimates and the observed context.
← Indietro