Reinforcement Learning for Optimization
Experience Replay Memory
Data structure storing transitions (state, action, reward, next state) for resampling during training, improving data usage efficiency.
← Indietro