Mixed Precision Computing
Precision-Aware Pruning
Network pruning method that considers each layer's sensitivity to precision reduction, applying more aggressive pruning on layers robust in low precision to maximize acceleration.
← Kembali