Dyna-Q Learning
Computational complexity
Computational cost of Dyna-Q, depending linearly on the size of the experience replay buffer and the number of planning updates per iteration.
← 뒤로Computational cost of Dyna-Q, depending linearly on the size of the experience replay buffer and the number of planning updates per iteration.
← 뒤로