🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích

Thuật ngữ AI

Từ điển đầy đủ về Trí tuệ nhân tạo

162
danh mục
2.032
danh mục con
23.060
thuật ngữ
📖
thuật ngữ

Memory Coalescing

GPU optimization technique where contiguous memory accesses from threads are grouped into single transactions, reducing memory bandwidth and increasing throughput.

📖
thuật ngữ

Cache Blocking

Data partitioning strategy into cache-sized blocks to maximize local data reuse and minimize cache misses.

📖
thuật ngữ

NUMA-Aware Allocation

Memory allocation that considers Non-Uniform Memory Access architecture to place data near the cores that frequently use them, reducing access latency.

📖
thuật ngữ

Memory Pooling

Pre-allocation of a large memory block subdivided into reusable objects, eliminating the overhead of frequent dynamic allocations/deallocations.

📖
thuật ngữ

Zero-Copy Optimization

Technique allowing operations to directly access data without intermediate copying between memory spaces, reducing CPU consumption and bandwidth.

📖
thuật ngữ

Register Tiling

Use of processor registers to temporarily store data tiles, minimizing accesses to slower hierarchical memory.

📖
thuật ngữ

Prefetching Instructions

Special instructions that preload data into cache before actual use, hiding memory latency through computation/access overlap.

📖
thuật ngữ

Memory Footprint Reduction

Set of techniques (quantization, pruning, compression) aimed at reducing the memory size of AI models without significant performance degradation.

📖
thuật ngữ

Shared Memory Utilization

Optimization of GPU shared memory usage as a fast and reusable data space between threads of the same block.

📖
thuật ngữ

Memory Bandwidth Saturation

State where memory access demands exceed the capacity of the memory bus, becoming the main bottleneck of computing performance.

📖
thuật ngữ

Page Migration

Dynamic movement of memory pages between NUMA nodes based on access patterns to optimize data locality.

📖
thuật ngữ

Memory-Aware Scheduling

Task scheduling that takes into account memory constraints and access patterns to minimize contentions and maximize parallelism.

📖
thuật ngữ

Cache-Oblivious Algorithms

Algorithms designed to perform efficiently on any cache hierarchy without requiring specific cache size parameters.

📖
thuật ngữ

Memory Hierarchy Optimization

Global strategy for data placement according to their access frequency and temporal criticality across the levels of the memory hierarchy.

📖
thuật ngữ

Tensor Core Memory Layout

Specific organization of tensors in memory to maximize the efficiency of matrix operations on NVIDIA Tensor Cores.

📖
thuật ngữ

Memory Access Divergence

Phenomenon where threads in a GPU warp access non-contiguous memory addresses, degrading performance through serialization of accesses.

📖
thuật ngữ

HBM (High Bandwidth Memory) Integration

3D stacked memory architecture offering superior bandwidth for intensive AI workloads, with specific optimization of access patterns.

📖
thuật ngữ

Memory-Mapped I/O Optimization

Technique allowing peripheral devices to directly access system memory, reducing copies and CPU overhead in AI pipelines.

🔍

Không tìm thấy kết quả