Hierarchical Reinforcement Learning
Goal-conditioned Policies
Policies parameterized by specific goals allowing agents to learn generalizable behaviors that can be reused for different sub-objectives.
← Wstecz