Vision Transformers (ViT)
Class Token
Special token added to the embedding sequence whose final representation after passing through the Transformer is used for image classification.
← 뒤로Special token added to the embedding sequence whose final representation after passing through the Transformer is used for image classification.
← 뒤로