Vision Transformers for Detection
Visual Self-Attention
Mechanism allowing each image patch to evaluate its relative importance with respect to all other patches to capture global dependencies without convolution.
← Indietro