🏠 Home
Benchmark Hub
📊 All Benchmarks 🦖 Dinosaur v1 🦖 Dinosaur v2 ✅ To-Do List Applications 🎨 Creative Free Pages 🎯 FSACB - Ultimate Showcase 🌍 Translation Benchmark
Models
🏆 Top 10 Models 🆓 Free Models 📋 All Models ⚙️ Kilo Code
Resources
💬 Prompts Library 📖 AI Glossary 🔗 Useful Links

AI Glossary

The complete dictionary of Artificial Intelligence

162
categories
2,032
subcategories
23,060
terms
📖
terms

Activation Quantization

Process of reducing the precision of activation values propagated through the neural network, essential for minimizing memory usage and optimizing computations on resource-constrained microcontrollers.

📖
terms

Quantization-Aware Training

Approach where quantization is simulated during the training phase to minimize accuracy loss, resulting in more robust models once quantized for embedded devices.

📖
terms

8-bit Precision

Numerical representation format using 8 bits per parameter, offering an optimal balance between precision and efficiency for most deep learning applications on IoT devices.

📖
terms

Neural Network Pruning

Compression technique that selectively removes the least important weights or neurons from the network, significantly reducing model size while preserving essential performance.

📖
terms

Extreme Binarization

Extreme form of quantization that reduces all weights and activations to 1 bit (+1/-1), maximizing compression and drastically accelerating computations on specialized IoT hardware.

📖
terms

Fixed-Point Representation

Numerical format where numbers are represented with a fixed number of bits for the integer and fractional parts, preferred in IoT devices for its hardware simplicity and energy efficiency.

📖
terms

Edge AI Optimization

Set of techniques combining quantization, compression, and algorithmic optimization to efficiently adapt AI models to the strict constraints of edge and IoT devices.

📖
terms

Structured Weight Pruning

Pruning variant that removes entire structures (filters, channels, or attention heads) rather than individual weights, generating more efficient models on IoT hardware.

📖
terms

Sub-8-bit Quantization

Advanced techniques reducing precision below 8 bits (4, 2, or even 1 bit) for maximum compression, suitable for extremely constrained IoT applications.

📖
terms

Tensor Factorization

Mathematical technique decomposing large-dimensional weight tensors into products of smaller tensors, drastically reducing the number of parameters for IoT deployment.

📖
terms

Compressed Weight Encoding

Compression algorithm applied after quantization using techniques like Huffman or range encoding to further reduce model storage size on IoT devices.

🔍

No results found