BERT et ses Variantes
BART
Bidirectional and autoregressive Transformer architecture combining the advantages of BERT and GPT. Uses an encoder-decoder with text corruption for pre-training, excellent for generation tasks.
← Wstecz