Dyna-Q Learning
Computational complexity
Computational cost of Dyna-Q, depending linearly on the size of the experience replay buffer and the number of planning updates per iteration.
← KembaliComputational cost of Dyna-Q, depending linearly on the size of the experience replay buffer and the number of planning updates per iteration.
← Kembali