KI-Glossar
Das vollständige Wörterbuch der Künstlichen Intelligenz
162
Kategorien
2.032
Unterkategorien
23.060
Begriffe
Begriffe
Visual Self-Attention
Mechanism allowing each image patch to evaluate its relative importance with respect to all other patches to capture global dependencies without convolution.
Begriffe
Cross-Attention Detection
Bidirectional mechanism where object queries interact with image features to simultaneously localize and classify objects.
Begriffe
Token-to-Token ViT
Variant introducing a progressive transition between tokens with resizing and recombination to preserve local structural information.
Begriffe
Transformer Decoder Head
Final module of DETR architectures transforming encoder features into bounding box predictions and classes via attention on object queries.
🔍