Feed-Forward Networks
Feed-Forward Sublayer
Individual component of the Transformer block containing the FFN, including residual connections and layer normalization to stabilize training.
← Kembali