🏠 Hem
Benchmarkar
📊 Alla benchmarkar 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List-applikationer 🎨 Kreativa fria sidor 🎯 FSACB - Ultimata uppvisningen 🌍 Översättningsbenchmark
Modeller
🏆 Topp 10 modeller 🆓 Gratis modeller 📋 Alla modeller ⚙️ Kilo Code
Resurser
💬 Promptbibliotek 📖 AI-ordlista 🔗 Användbara länkar

AI-ordlista

Den kompletta ordlistan över AI

162
kategorier
2 032
underkategorier
23 060
termer
📂
underkategorier

Self-Attention

Fundamental mechanism allowing transformers to dynamically compute the relative importance of each element in a sequence compared to others.

2 termer
📂
underkategorier

Multi-Head Attention

Extension of self-attention where multiple attention heads operate in parallel to capture different types of relationships in the data.

4 termer
📂
underkategorier

Positional Encoding

Technique that incorporates sequential position information into embeddings to compensate for the absence of recurrence in transformers.

6 termer
📂
underkategorier

Encoder-Decoder Architecture

Fundamental structure of original transformers combining an encoder to process input and a decoder to generate output.

8 termer
📂
underkategorier

BERT (Bidirectional Encoder Representations)

Family of pre-trained models based on the encoder-only architecture with bidirectional context understanding.

10 termer
📂
underkategorier

GPT (Generative Pre-trained Transformer)

Decoder-only architecture optimized for autoregressive text generation, forming the basis of large language models.

5 termer
📂
underkategorier

Vision Transformers (ViT)

Application of transformer architectures to image processing by dividing images into patches and treating them as sequences.

11 termer
📂
underkategorier

Sparse Attention Mechanisms

Variants of attention reducing computational complexity by limiting connections between sequence elements.

2 termer
📂
underkategorier

Cross-Attention

Attention mechanism where queries come from one sequence while keys and values come from a different sequence.

2 termer
📂
underkategorier

Transformer Scaling Laws

Empirical principles describing how transformer performance evolves with model size, data, and computation.

18 termer
📂
underkategorier

Attention Head Analysis

Study of the specialized roles of different attention heads in transformers to understand their internal functioning.

19 termer
📂
underkategorier

Hierarchical Attention

Hierarchical attention architecture organized across multiple levels to process complex structured data.

9 termer
🔍

Inga resultat hittades