Positional Encoding
Length Extrapolation
Ability of a model to generalize to sequence lengths greater than those seen during training, strongly dependent on the positional encoding scheme used.
← Zurück