🏠 Beranda
Benchmark
📊 Semua Benchmark 🦖 Dinosaurus v1 🦖 Dinosaurus v2 ✅ Aplikasi To-Do List 🎨 Halaman Bebas Kreatif 🎯 FSACB - Showcase Utama 🌍 Benchmark Terjemahan
Model
🏆 Top 10 Model 🆓 Model Gratis 📋 Semua Model ⚙️ Kilo Code
Sumber Daya
💬 Perpustakaan Prompt 📖 Glosarium AI 🔗 Tautan Berguna

Glosarium AI

Kamus lengkap Kecerdasan Buatan

162
kategori
2.032
subkategori
23.060
istilah
📖
istilah

Textual Adversarial Attack

Technique consisting of subtly modifying an input text to mislead an NLP model while preserving semantics for a human reader.

📖
istilah

Character-Level Perturbation

Modification of individual characters in text (insertion, deletion, substitution) to create adversarial examples that are difficult to detect.

📖
istilah

Lexical Substitution Attack

Replacement of words with semantically close synonyms that change the NLP model's prediction in a targeted manner.

📖
istilah

Universal Adversarial Triggers

Specific sequences of words or characters that, when inserted into any text, systematically cause a classification error by the model.

📖
istilah

Black-Box Attack

Attack conducted without knowledge of the model's internal parameters, using only the model's predictions to construct adversarial examples.

📖
istilah

White-Box Attack

Attack exploiting complete knowledge of the model's architecture and gradients to generate optimal perturbations.

📖
istilah

Transfer Attack

Generation of adversarial examples on a source model that retain their effectiveness on unknown target models.

📖
istilah

Semantic Preservation

Constraint ensuring that textual perturbations do not alter the overall meaning of the text for a human reader.

📖
istilah

Data Poisoning Attack

Malicious insertion of corrupted examples into the training set to degrade model performance during its learning phase.

📖
istilah

Syntactic Perturbation

Modification of the grammatical or syntactic structure of a sentence while preserving its semantic meaning to deceive NLP models.

📖
istilah

Gradient Masking

Defense technique that modifies the model's gradient to prevent optimization-based attacks, without necessarily improving actual robustness.

📖
istilah

Query Attack

Black-box attack that optimizes perturbations by iteratively querying the model and analyzing its responses.

📖
istilah

Semantic Robustness

Ability of an NLP model to maintain consistent predictions in the face of textual variations preserving meaning but altering form.

📖
istilah

Adversarial Search Space

Set of all possible text modifications that can be applied to generate valid adversarial examples.

📖
istilah

Perturbation Score

Quantitative metric evaluating the magnitude of modification applied to the original text to create an adversarial example.

📖
istilah

Multi-objective Attack

Adversarial attack seeking simultaneously to deceive the model while optimizing multiple constraints such as readability or semantic preservation.

📖
istilah

Adversarial Attack Detection

Defensive mechanism identifying potentially adversarial inputs based on statistical or behavioral anomalies in predictions.

🔍

Tidak ada hasil ditemukan