Multimodal Models
Multimodal Conditional Generation
Generation task where the output (e.g., text, image) is produced based on one or more inputs of different modalities, such as describing an image or creating an image from text.
← Terug