Momentum-based Optimization
Adadelta
Extension of Adagrad that solves the problem of the learning rate's drastic decay by limiting the window of past gradients to a fixed size via an exponential moving average.
← Tillbaka