Multi-Modal Transformers
Perceiver IO
General Transformer architecture capable of processing any combination of modalities using a cross-attention network between input data and a set of learned latents.
← Geri