Ethical Robustness
Extraction Attack
Technique aimed at replicating the behavior of an AI model, including its biases and ethical vulnerabilities, by systematically querying it. These attacks can reveal and exploit the moral weaknesses of the original system.
← Back