Scaling Laws
Sharpness-Aware Minimization
Optimization technique seeking flat minima in the loss landscape, particularly important for the stability of large models.
← 뒤로Optimization technique seeking flat minima in the loss landscape, particularly important for the stability of large models.
← 뒤로