Attention Mechanisms Variants
Reformer Attention
Efficient implementation using LSH (Locality Sensitive Hashing) to limit attention to similar tokens only. Drastically reduces complexity while preserving important semantic relationships.
← Indietro