Meta-Reinforcement Learning
Meta-Policy Gradient
Optimization algorithm that calculates gradients with respect to meta-parameters to improve expected performance on the task distribution.
← BackOptimization algorithm that calculates gradients with respect to meta-parameters to improve expected performance on the task distribution.
← Back