Multi-Objective Evolutionary Optimization
Multi-Objective Policy Gradient
Reinforcement learning method that directly optimizes policy parameters to maximize a multi-objective reward vector using stochastic gradient techniques.
← Tillbaka