Autoregressive Models
Transformer Decoder-only
Neural architecture using only decoder layers with causal masking, preferred for modern autoregressive language models.
← Quay lạiNeural architecture using only decoder layers with causal masking, preferred for modern autoregressive language models.
← Quay lại