Vision Transformers (ViT)
Class Token
Special token added to the embedding sequence whose final representation after passing through the Transformer is used for image classification.
← KembaliSpecial token added to the embedding sequence whose final representation after passing through the Transformer is used for image classification.
← Kembali