Attention Masks
Variable Length Mask
Dynamic mask that adapts to variable sequence lengths in a batch, optimizing computation by ignoring irrelevant positions while preserving batch alignment.
← Indietro