BERT et ses Variantes - Glosarium AI

📖

istilah

ALBERT

Lightweight version of BERT significantly reducing parameters through embedding sharing and matrix factorization of layers. Maintains competitive performance while being more memory-efficient.

📖

istilah

Efficient pre-training architecture replacing masked language modeling with corrupted token replacement. Uses a discriminator that identifies replaced tokens, enabling faster and more effective training.

📖

istilah

ERNIE

Chinese model integrating structured and hierarchical knowledge into the base Transformer architecture. Simultaneously masks words, entities, and phrases to capture multi-level semantics.

📖

istilah

BART

Bidirectional and autoregressive Transformer architecture combining the advantages of BERT and GPT. Uses an encoder-decoder with text corruption for pre-training, excellent for generation tasks.

📖

istilah

Funnel Transformers

Hierarchical architecture progressively reducing sequence length across layers while preserving important information. Significantly saves computational memory for long sequences.

📖

istilah

DeBERTa

Improvement on BERT incorporating enhanced decoding with disentangled content and position attention. Uses a disentangled attention mechanism and enhanced size masking for better performance.

📖

istilah

TinyBERT

Ultra-compact version of BERT reducing parameters up to 7.5 times while maintaining high performance. Applies bidirectional distillation and multi-level attention for compression.

📖

istilah

CamemBERT

French version of BERT pre-trained on 138GB of French text. Maintains the original BERT architecture but is specialized for French understanding and processing.

📖

istilah

FlauBERT

French Transformer-based language model with progressive pre-training using increasingly large corpora. Incorporates French linguistic specificities for optimal performance.

📖

istilah

XLM-RoBERTa

Multilingual version of RoBERTa pre-trained on 100 languages using massive Common Crawl dataset. Outperforms XLM and mBERT thanks to improved pre-training and better handling of low-resource languages.

📖

istilah

Sentence-BERT

BERT modification optimized for encoding entire sentences into semantic vectors. Uses siamese and triplet networks to produce relevant embeddings for semantic similarity.

📖

istilah

VideoBERT

Multimodal extension of BERT learning joint video-text representations. Performs pre-training on visual and linguistic tokens for video understanding.

📖

istilah

Controlled BERT

BERT variant allowing control of style attributes during text generation. Integrates controllers in the architecture to modulate desired linguistic characteristics.

Glosarium AI

ALBERT

ELECTRA

ERNIE

BART

Funnel Transformers

DeBERTa

TinyBERT

CamemBERT

FlauBERT

XLM-RoBERTa

Sentence-BERT

VideoBERT

Controlled BERT

Tidak ada hasil ditemukan