Mixed Precision Computing
FP8 (8-bit Floating Point)
Emerging 8-bit representation format with different variants (E4M3, E5M2) optimized for training and inference, offering an extreme trade-off between throughput and precision for very large models.
← Zurück