🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크

AI 용어집

인공지능 완전 사전

162
카테고리
2,032
하위 카테고리
23,060
용어
📖
용어

Activation Quantization

Process of reducing the precision of activation values propagated through the neural network, essential for minimizing memory usage and optimizing computations on resource-constrained microcontrollers.

📖
용어

Quantization-Aware Training

Approach where quantization is simulated during the training phase to minimize accuracy loss, resulting in more robust models once quantized for embedded devices.

📖
용어

8-bit Precision

Numerical representation format using 8 bits per parameter, offering an optimal balance between precision and efficiency for most deep learning applications on IoT devices.

📖
용어

Neural Network Pruning

Compression technique that selectively removes the least important weights or neurons from the network, significantly reducing model size while preserving essential performance.

📖
용어

Extreme Binarization

Extreme form of quantization that reduces all weights and activations to 1 bit (+1/-1), maximizing compression and drastically accelerating computations on specialized IoT hardware.

📖
용어

Fixed-Point Representation

Numerical format where numbers are represented with a fixed number of bits for the integer and fractional parts, preferred in IoT devices for its hardware simplicity and energy efficiency.

📖
용어

Edge AI Optimization

Set of techniques combining quantization, compression, and algorithmic optimization to efficiently adapt AI models to the strict constraints of edge and IoT devices.

📖
용어

Structured Weight Pruning

Pruning variant that removes entire structures (filters, channels, or attention heads) rather than individual weights, generating more efficient models on IoT hardware.

📖
용어

Sub-8-bit Quantization

Advanced techniques reducing precision below 8 bits (4, 2, or even 1 bit) for maximum compression, suitable for extremely constrained IoT applications.

📖
용어

Tensor Factorization

Mathematical technique decomposing large-dimensional weight tensors into products of smaller tensors, drastically reducing the number of parameters for IoT deployment.

📖
용어

Compressed Weight Encoding

Compression algorithm applied after quantization using techniques like Huffman or range encoding to further reduce model storage size on IoT devices.

🔍

결과를 찾을 수 없습니다