Real-Time Reinforcement Learning
Adaptive Exploration-Exploitation
Dynamic strategy that automatically adjusts the trade-off between discovering new actions and exploiting acquired knowledge. Adaptive algorithms modulate this parameter based on performance and environmental variability.
← Wstecz