Active Reinforcement Learning
Reward Function
Signal quantifying the utility of each sample selection action, typically based on model performance improvement.
← WsteczSignal quantifying the utility of each sample selection action, typically based on model performance improvement.
← Wstecz