Model Security - Glossariusz AI

📖

pojęcia

Membership Inference

Type of privacy attack where an adversary determines whether a specific data record was used in a model's training dataset, violating individuals' privacy.

📖

pojęcia

Inversion Attack

Attack that approximately reconstructs sensitive training data by analyzing the model's outputs, threatening the confidentiality of information used for its learning.

📖

pojęcia

Differential Privacy

Formal privacy framework ensuring that a model's output changes negligibly if a single individual is added to or removed from the training dataset.

📖

pojęcia

Gradient Masking Defense

Protection technique aimed at obscuring the model's gradients to prevent attackers from using gradient-based methods to generate effective adversarial attacks.

📖

pojęcia

Federated Learning

Decentralized training approach where the model is learned on local data without sharing it, reducing the risk of sensitive data leaks from a central repository.

📖

pojęcia

Backdoor in a Model

Vulnerability intentionally introduced into a model, often through data poisoning, that causes it to behave abnormally in the presence of a specific trigger.

📖

pojęcia

Model Robustness

Ability of a machine learning model to maintain its performance in the face of input data perturbations, including random noise and targeted adversarial attacks.

📖

pojęcia

Robustness Certification

Mathematical process providing a formal guarantee that a model cannot be fooled by input perturbations exceeding a certain defined magnitude.

📖

pojęcia

Transferability Attack

Phenomenon where an adversarial example, designed to deceive a specific model, also manages to mislead other models with different architectures or training data.

📖

pojęcia

Dataset Cleaning

Proactive process of identifying and removing potentially malicious or abnormal samples from a dataset before training to prevent poisoning attacks.

📖

pojęcia

Sensitivity Metric

Quantitative measure evaluating how much a model's predictions change in response to small modifications to its input data, indicating its vulnerability to attacks.

Słownik AI

Membership Inference

Inversion Attack

Differential Privacy

Gradient Masking Defense

Federated Learning

Backdoor in a Model

Model Robustness

Robustness Certification

Transferability Attack

Dataset Cleaning

Sensitivity Metric

Nie znaleziono wyników