Multimodal QA
Multimodal Information Retrieval
Task of retrieving relevant documents (e.g., images) from a query in another modality (e.g., text), based on their similarity in a shared embedding space.
← Back