Multimodal interpretability
Inter-modality Semantic Alignment
Technique aimed at establishing semantic correspondences between elements of different modalities (e.g., linking a word to an image region or a sound to an action), crucial for the model to understand relationships and provide coherent explanations.
← Indietro