🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

TF32 (TensorFloat-32)

NVIDIA's proprietary 19-bit hybrid format combining 8-bit exponent from FP16 and 10-bit mantissa from FP32, optimized for matrix operations on Ampere and Hopper GPU Tensor Cores.

📖
thuật ngữ

Dynamic Range

Range of representable values between the smallest normalized number and the largest floating point number, critical in precision selection as FP16 has a limited dynamic range (65504) compared to FP32 (3.4×10³⁸).

📖
thuật ngữ

Post-Training Quantization (PTQ)

Process of converting a pre-trained full-precision model to reduced precision (FP16, INT8, INT4) without retraining, using calibration techniques to determine optimal scale and bias factors.

📖
thuật ngữ

Fused Multiply-Add (FMA)

Hardware operation combining multiplication and addition into a single instruction (a×b+c) with single rounding, fundamental for accelerating matrix calculations in mixed precision and reducing cumulative rounding errors.

📖
thuật ngữ

Numerical Stability

Property of an algorithm to maintain calculation precision in the face of rounding errors and overflow/underflow, particularly critical in mixed precision where reduced dynamic range can destabilize certain calculations.

📖
thuật ngữ

INT8 Quantization

Technique for compressing neural weights and activations to 8-bit signed integers (-128 to 127) with scale factors and zero-points, offering up to 4x memory reduction and significant acceleration on compatible hardware.

📖
thuật ngữ

Precision Matrix Operations

Set of linear operations (GEMM, convolution) where different parts of the calculation use different precisions - typically accumulation in FP32 with multiplication in FP16/BF16 to optimize throughput on modern GPUs.

🔍

Không tìm thấy kết quả