Model-Based Deep RL
Planning with Learned Models
Sequential search process using the learned model to evaluate different future action sequences and select the optimum according to reward predictions.
← Geri