Latent Diffusion Models
Cross-Attention Conditioning
Attention mechanism that allows the latent diffusion model to integrate heterogeneous information, such as text (CLIP embeddings), to guide image generation in a flexible and precise manner.
← Indietro