Multi-Modal Transformers
Cross-Modal Alignment
Training objective aimed at semantically aligning representations of different modalities in a shared space, enabling correspondence between visual and linguistic concepts.
← Kembali