Momentum-based Optimization
Adagrad
Adaptive optimization algorithm that adapts the learning rate of each parameter by accumulating the squares of historical gradients, favoring infrequent parameters.
← Terug