Meta-Learning in RL
Outer Loop Optimization
Update of meta-parameters by aggregating gradients from multiple tasks to improve the overall adaptation capability of the model.
← TerugUpdate of meta-parameters by aggregating gradients from multiple tasks to improve the overall adaptation capability of the model.
← Terug