Optimization and Computational Efficiency
Convolution Kernel Fusion
Optimization technique that combines successive convolution layers (e.g., Conv + BatchNorm + ReLU) into a single convolution operation, reducing latency and memory access on inference hardware.
← Quay lại