🏠 Startseite
Vergleiche
📊 Alle Benchmarks 🦖 Dinosaurier v1 🦖 Dinosaurier v2 ✅ To-Do-Listen-Apps 🎨 Kreative freie Seiten 🎯 FSACB - Ultimatives Showcase 🌍 Übersetzungs-Benchmark
Modelle
🏆 Top 10 Modelle 🆓 Kostenlose Modelle 📋 Alle Modelle ⚙️ Kilo Code
Ressourcen
💬 Prompt-Bibliothek 📖 KI-Glossar 🔗 Nützliche Links

KI-Glossar

Das vollständige Wörterbuch der Künstlichen Intelligenz

162
Kategorien
2.032
Unterkategorien
23.060
Begriffe
📂
Unterkategorien

Attention Mechanism

Allows the model to weigh the importance of different parts of the input during processing.

10 Begriffe
📂
Unterkategorien

Self-Attention

Mechanism where each element of the sequence attends to all other elements of the same sequence.

7 Begriffe
📂
Unterkategorien

Multi-Head Attention

Extension of self-attention using multiple attention heads in parallel to capture different types of relationships.

8 Begriffe
📂
Unterkategorien

Positional Encoding

Technique to incorporate position information in embeddings without using an RNN.

19 Begriffe
📂
Unterkategorien

Encoder-Decoder Architecture

Fundamental structure of Transformers with encoder for understanding and decoder for generation.

4 Begriffe
📂
Unterkategorien

Scaled Dot-Product Attention

Basic mathematical form of attention calculation in Transformers with scaling.

5 Begriffe
📂
Unterkategorien

Feed-Forward Networks

Fully-connected networks applied after each attention layer in Transformers.

16 Begriffe
📂
Unterkategorien

Layer Normalization

Normalization technique applied in Transformers to stabilize training.

6 Begriffe
📂
Unterkategorien

Attention Masks

Mechanism to control which tokens can attend to other tokens.

19 Begriffe
📂
Unterkategorien

Vision Transformers (ViT)

Application of Transformer architecture to image processing by dividing images into patches.

14 Begriffe
📂
Unterkategorien

BERT Architecture

Transformer encoder-only pre-trained with masked language modeling objectives.

11 Begriffe
📂
Unterkategorien

GPT Architecture

Transformer decoder-only optimized for auto-regressive text generation.

8 Begriffe
📂
Unterkategorien

Cross-Attention

Attention mechanism between two different sequences in encoder-decoder models.

5 Begriffe
📂
Unterkategorien

Sparse Attention

Variant of attention that reduces complexity by computing only selective pairs.

18 Begriffe
📂
Unterkategorien

Hierarchical Attention

Multi-level architecture applying attention at different granularity scales.

12 Begriffe
📂
Unterkategorien

Attention Visualization

Techniques to interpret and visualize attention weights in Transformers.

17 Begriffe
📂
Unterkategorien

Transformer Optimization

Specific methods for effective training of large Transformer models.

16 Begriffe
📂
Unterkategorien

Multi-Modal Transformers

Extended Transformer architecture to process multiple types of data simultaneously.

18 Begriffe
📂
Unterkategorien

Efficient Transformers

Optimized variants of Transformers to reduce computational complexity.

9 Begriffe
📂
Unterkategorien

Attention Mechanisms Variants

Different approaches and improvements to the attention mechanism beyond dot-product.

9 Begriffe
🔍

Keine Ergebnisse gefunden