Meta-Learning in RL
Outer Loop Optimization
Update of meta-parameters by aggregating gradients from multiple tasks to improve the overall adaptation capability of the model.
← 뒤로Update of meta-parameters by aggregating gradients from multiple tasks to improve the overall adaptation capability of the model.
← 뒤로