Hybrid Diffusion-Transformer Models
Adaptive Layer Normalization
Normalization method conditioned by time embeddings in Diffusion-Transformer architectures to stabilize training.
← GeriNormalization method conditioned by time embeddings in Diffusion-Transformer architectures to stabilize training.
← Geri