🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

Post-LN Transformer

Original transformer architecture where layer normalization is applied after the attention and feed-forward layers, requiring more precise learning rate tuning.

📖
thuật ngữ

Gamma and Beta

Learnable parameters of layer normalization allowing respectively to scale and shift the normalized values to preserve the network's representational power.

📖
thuật ngữ

Zero Centering

Process of subtracting the mean of activations in layer normalization to center data around zero, facilitating gradient optimization.

📖
thuật ngữ

Unit Variance

Standardization of activations to have unit variance in layer normalization, ensuring numerical stability and constant gradients across layers.

📖
thuật ngữ

Gradient Stability

Property of layer normalization that maintains stable gradients during backpropagation, avoiding exploding or vanishing gradient problems in deep transformers.

📖
thuật ngữ

Epsilon Parameter

Small constant added to the denominator in layer normalization to prevent division by zero and ensure numerical stability when computing normalized variance.

📖
thuật ngữ

Activation Distribution

Distribution of activation values in a layer that layer normalization maintains constant, facilitating convergence and optimization of transformer networks.

📖
thuật ngữ

Scale Invariance

Property of layer normalization that makes the model insensitive to input scale changes, improving model robustness to data variations.

📖
thuật ngữ

Training Speed

Significant acceleration of transformer training through layer normalization, enabling higher learning rates and faster convergence.

📖
thuật ngữ

Hidden State Normalization

Application of layer normalization to transformer hidden states to maintain stable activations across different encoder and decoder layers.

🔍

Không tìm thấy kết quả