Multimodal QA
Image Tokenization
Process of converting an image into a sequence of discrete tokens, often via a VAE or VQ-VAE, to make it compatible with Transformer-type architectures.
← WsteczProcess of converting an image into a sequence of discrete tokens, often via a VAE or VQ-VAE, to make it compatible with Transformer-type architectures.
← Wstecz