Dyna-Q Learning
Computational complexity
Computational cost of Dyna-Q, depending linearly on the size of the experience replay buffer and the number of planning updates per iteration.
← WsteczComputational cost of Dyna-Q, depending linearly on the size of the experience replay buffer and the number of planning updates per iteration.
← Wstecz