Active Reinforcement Learning
Reward Function
Signal quantifying the utility of each sample selection action, typically based on model performance improvement.
← TerugSignal quantifying the utility of each sample selection action, typically based on model performance improvement.
← Terug