Multimodal Models
Multimodal Tokenization
Process of converting different modalities (image, audio, video) into token sequences compatible with Transformer architecture.
← TerugProcess of converting different modalities (image, audio, video) into token sequences compatible with Transformer architecture.
← Terug