AI 용어집

인공지능 완전 사전

162

카테고리

2,032

하위 카테고리

23,060

용어

📖

용어

Masked token

Token in a textual sequence replaced by a special [MASK] symbol during training, forcing the model to learn to predict the original token.

📖

용어

BERT

Revolutionary Transformer architecture pre-trained using MLM to understand the bidirectional context of natural language.

📖

용어

RoBERTa

Optimized version of BERT eliminating Next Sentence Prediction and using dynamic masking with improved hyperparameters.

📖

용어

Bidirectional attention

Mechanism allowing each token to attend to both preceding and following tokens in the sequence, unlike unidirectional models.

📖

용어

Token embeddings

Dense vector representations of input tokens that capture their semantic and syntactic characteristics.

📖

용어

Dynamic masking

Masking strategy where masked tokens change at each training epoch, improving model robustness as in RoBERTa.

📖

용어

Whole Word Masking (WWM)

Advanced technique masking all sub-tokens of an entire word rather than random individual tokens.

📖

용어

Span masking

Strategy masking contiguous sequences of tokens of variable lengths, better mimicking natural linguistic phenomena.

📖

용어

Masking strategy

Set of rules determining which tokens to mask, with what probability, and how to replace them during MLM training.

🔍