Adaptive Learning Rate Methods
Weight Decay
Regularization method that penalizes large weights by adding an L2 term to the loss function, helping to prevent overfitting and improve generalization.
← Zurück