Self-Attention
Multi-Scale Attention
Attention variant simultaneously processing dependencies at different temporal or spatial scales, combining varied receptive fields for hierarchical understanding.
← Quay lại