Segmentation with Transformers - 인공지능 용어집

📖

용어

Segmenter

Semantic segmentation model based on a pure transformer architecture, designed to efficiently capture long-range contextual relationships between pixels.

📖

용어

Learnable Token

Randomly initialized embedding vector learned during training, used in transformer decoders to aggregate contextual information and predict segmentation classes.

📖

용어

Segmentation Transformer Decoder

Module that reconstructs a high-resolution segmentation map from encoder features, using attention mechanisms to refine predictions pixel by pixel.

📖

용어

SegFormer

Efficient and simple segmentation architecture based on a hierarchical transformer encoder and lightweight decoder (All-MLP), designed for better performance with fewer parameters.

📖

용어

Masked Autoencoding (MAE)

Self-supervised pre-training strategy where large portions of an image are masked and the model learns to reconstruct them, improving contextual understanding for segmentation.

📖

용어

Query-Based Segmentation

Paradigm where a fixed set of learnable query vectors is used to query image features and directly generate segmentation masks.

📖

용어

Hierarchical Windowing

Technique in vision transformers that divides the image into windows at different scales and hierarchically merges them to capture both local details and global context.

📖

용어

Class Embedding

Learned vector representation for each semantic category, used in transformer decoders to guide pixel classification and improve prediction consistency.

AI 용어집