🏠 Beranda
Benchmark
📊 Semua Benchmark 🦖 Dinosaurus v1 🦖 Dinosaurus v2 ✅ Aplikasi To-Do List 🎨 Halaman Bebas Kreatif 🎯 FSACB - Showcase Utama 🌍 Benchmark Terjemahan
Model
🏆 Top 10 Model 🆓 Model Gratis 📋 Semua Model ⚙️ Kilo Code
Sumber Daya
💬 Perpustakaan Prompt 📖 Glosarium AI 🔗 Tautan Berguna

Glosarium AI

Kamus lengkap Kecerdasan Buatan

162
kategori
2.032
subkategori
23.060
istilah
📖
istilah

PCIe Bandwidth

Maximum data transfer rate through the PCIe bus, crucial for communication speed between CPU and GPU in AI workloads.

📖
istilah

NVMe over Fabrics

Protocol allowing access to NVMe storage over a network, reducing latency for massive datasets in AI.

📖
istilah

GPUDirect Storage

NVIDIA technology enabling direct data transfer from storage to GPU memory, bypassing the CPU and RAM.

📖
istilah

Memory Pinning

Process of locking memory pages in RAM to ensure continuous and fast DMA (Direct Memory Access) by the GPU.

📖
istilah

Zero-Copy

Optimization technique where data is transferred directly between devices without intermediate copying in CPU memory.

📖
istilah

Tensor Core Throughput

Computing capacity of GPU Tensor Cores, often limited by the data feeding speed from memory.

📖
istilah

Data Pipeline Parallelism

Strategy where data loading, preprocessing, and transfer execute in parallel with GPU computation to hide latencies.

📖
istilah

Prefetching

Loading data into GPU memory in advance before it is needed by computation, to keep the GPU busy.

📖
istilah

Host-to-Device Latency

Time required to initiate and complete a data transfer from the CPU (host) to the GPU (device).

📖
istilah

CUDA Stream

Sequence of operations executed on the GPU in a specific order, allowing transfers and computations to be concurrent.

📖
istilah

NUMA Awareness

Optimization of memory allocations to respect the NUMA topology of multi-CPU servers, reducing access latencies.

📖
istilah

GPUDirect RDMA

Technology enabling direct data transfer between GPU memory of different nodes via RDMA, without CPU copying.

📖
istilah

Asynchronous Data Transfer

Data transfer executed in parallel with GPU computations, using CUDA streams to hide latencies.

📖
istilah

Page-Locked Memory

Non-pageable system memory, required for high-bandwidth asynchronous DMA transfers to the GPU.

🔍

Tidak ada hasil ditemukan