Multimodal interpretability
Multimodal Salience Map
Visualization that highlights the most influential regions or segments of each modality (pixels of an image, words of a text, audio segments) for a specific model decision, often by overlaying contributions on the original data.
← Geri