Transformer Neural Networks - 인공지능 용어집

📂

하위 카테고리

Attention Mechanism

Fundamental component enabling transformers to weigh the importance of different parts of a sequence

0 용어

📂

하위 카테고리

Encoders and Decoders

Basic architectural structure of transformers with two main components for processing and generating sequences

8 용어

📂

하위 카테고리

Positional Encoding

Technique enabling the incorporation of positional information into embeddings without using recurrence

5 용어

📂

하위 카테고리

Multi-Head Attention

Extension of the attention mechanism using multiple attention heads in parallel to capture different types of relationships

3 용어

📂

하위 카테고리

BERT

Revolutionary bidirectional pre-trained model based on the transformer encoder

0 용어

📂

하위 카테고리

GPT

Series of generative models based on the transformer decoder for text generation

16 용어

📂

하위 카테고리

Vision Transformers

Applying the transformer architecture to computer vision tasks by treating images as sequences

2 용어

📂

하위 카테고리

Self-Attention

Mechanism allowing each element of a sequence to interact with all other elements of the same sequence

0 용어

📂

하위 카테고리

Cross-Attention

Attention mechanism between two different sequences, essential in translation tasks

0 용어

📂

하위 카테고리

Transformer-XL

Extension of transformers capable of modeling long-term dependencies without context fragmentation

16 용어

📂

하위 카테고리

T5

Text-to-text model unifying all NLP tasks within the same text input-output format

10 용어

📂

하위 카테고리

Sparse Attention

Efficient attention variants reducing computational complexity by limiting connections

1 용어

📂

하위 카테고리

Layer Normalization

Essential normalization technique for stabilizing the training of deep transformers

10 용어

📂

하위 카테고리

Feed-Forward Networks

Fully connected networks applied at each position in transformer layers

8 용어

📂

하위 카테고리

Attention Masks

Mechanism allowing control over which tokens can attend to one another

8 용어

AI 용어집

Attention Mechanism

Encoders and Decoders

Positional Encoding

Multi-Head Attention

BERT

GPT

Vision Transformers

Self-Attention

Cross-Attention

Transformer-XL

T5

Sparse Attention

Layer Normalization

Feed-Forward Networks

Attention Masks

결과를 찾을 수 없습니다