Epsilon-Greedy Algorithms
Value initialization
Process of assigning initial values to reward estimates for each action at the beginning of learning. The initialization strategy significantly influences the agent's initial exploratory behavior.
← 뒤로