Low-Resource Models
Memory-Efficient Optimizer
Optimizer variant (like Adafactor or 8-bit Adam) that reduces the memory footprint of optimizer states, avoiding storing moments for all parameters, which is crucial for training large models on limited GPUs.
← Zurück