Vision Transformers (ViT)
Class Token
Special token added to the embedding sequence whose final representation after passing through the Transformer is used for image classification.
← BackSpecial token added to the embedding sequence whose final representation after passing through the Transformer is used for image classification.
← Back