Quantization by Clustering
Quantization by Clustering
Model compression technique that groups similar weights into clusters to reduce memory while preserving performance. This approach enables compact weight representation using a limited number of representative centroids.
← Tillbaka