Multi-Modal Transformers
Multi-Modal Transformer
Extended Transformer architecture capable of simultaneously processing multiple data modalities (text, image, audio) using cross-attention mechanisms to integrate inter-modal information.
← Back