Creative Proxy Tasks
Proxy Multimodal Alignment
Self-supervised task learning to align different modalities (text, image, sound) by exploiting their natural co-occurrences.
← GeriSelf-supervised task learning to align different modalities (text, image, sound) by exploiting their natural co-occurrences.
← Geri