🏠 Home
Benchmark Hub
📊 All Benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List Applications 🎨 Creative Free Pages 🎯 FSACB - Ultimate Showcase 🌍 Translation Benchmark
Models
🏆 Top 10 Models 🆓 Free Models 📋 All Models ⚙️ Kilo Code
Resources
💬 Prompts Library 📖 AI Glossary 🔗 Useful Links

AI Glossary

The complete dictionary of Artificial Intelligence

162
categories
2,032
subcategories
23,060
terms
📖
terms

Structurally Constrained Quantization

Neural compression technique that applies specific quantization constraints while preserving the architecture and structural relationships of the neural network.

📖
terms

Quantization Aware Training (QAT)

Training method that simulates the effects of quantization during the learning process to minimize post-quantization accuracy loss.

📖
terms

Post-Training Quantization (PTQ)

Quantization process applied after the complete training of the model without requiring additional retraining.

📖
terms

Quantization Granularity

Level of detail at which parameters are grouped for quantization, ranging from tensor level to channel or group level.

📖
terms

Quantization Bias

Systematic error introduced when converting floating-point numbers to low-precision representation.

📖
terms

Quantization Calibration

Process of determining optimal quantization parameters (scale and zero-point) by analyzing the distribution of activations.

📖
terms

Weight Clustering

Technique grouping similar weights into clusters sharing a common representative value to reduce complexity.

📖
terms

Quantized Precision

Number of bits used to represent each quantized value, typically 8, 4, 2 or 1 bit in extreme methods.

📖
terms

Dynamic Range

Range of values that the quantized format can represent, crucial for preserving important information from the model.

📖
terms

Low-Precision Quantization

Extreme compression using less than 8 bits per parameter, requiring advanced techniques to maintain performance.

📖
terms

Weight Reconstruction

Optimization process aiming to minimize the error between original weights and their reconstructed quantized versions.

📖
terms

Quantization Structure Optimization

Algorithms automatically determining the optimal quantization configuration for each part of the network.

📖
terms

Quantization Sensitivity

Measure of the impact of quantization on the performance of a specific layer, guiding heterogeneous quantization strategies.

📖
terms

Quantization with Topology Constraints

Approach preserving the topological properties of the network while applying specific quantization constraints.

🔍

No results found