Multimodal QA
Cross-modality
Ability of a system to understand and relate information from different modalities, such as text and images, to enrich contextual understanding.
← GeriAbility of a system to understand and relate information from different modalities, such as text and images, to enrich contextual understanding.
← Geri