Quantization
AWQ
Activation-aware Weight Quantization, a method that weights the importance of weights according to the amplitude of corresponding activations.
← BackActivation-aware Weight Quantization, a method that weights the importance of weights according to the amplitude of corresponding activations.
← Back