Quantization
4-bit Quantization
Extreme compression method reducing weights to 4 bits, allowing significant memory gains but with potential quality loss.
← BackExtreme compression method reducing weights to 4 bits, allowing significant memory gains but with potential quality loss.
← Back