Multimodal Diffusion Models
Text-guided diffusion
Technique where the textual description guides the diffusion process to generate consistent outputs in corresponding visual, auditory, or video modalities.
← Geri