Scaled Dot-Product Attention
Value Projection
Linear transformation applied to input embedding to generate the Value matrix used in weighted output computation.
← WsteczLinear transformation applied to input embedding to generate the Value matrix used in weighted output computation.
← Wstecz