Cross-Attention
Token Alignment
Process by which cross-attention learns to automatically align significant tokens or segments between two sequences of different lengths or structures. Crucial for translation tasks where correspondences are not bijective.
← Geri