Multimodal Translation
Multimodal Embeddings
Vector representations in a shared space where different modalities (text, image, audio) can be compared and manipulated mathematically. These embeddings enable cross-modal semantic operations like search and similarity.
← Wstecz