Vision Transformers (ViT)
Class Token
Special token added to the embedding sequence whose final representation after passing through the Transformer is used for image classification.
← GeriSpecial token added to the embedding sequence whose final representation after passing through the Transformer is used for image classification.
← Geri