Scaling Laws
Double Descent
Phenomenon where test error decreases, increases, and then decreases again as model size exceeds the data interpolation point.
← ZurückPhenomenon where test error decreases, increases, and then decreases again as model size exceeds the data interpolation point.
← Zurück