Ethical Robustness
Ethical Perturbation
Subtle modification of inputs or parameters specifically aimed at compromising the ethical decision-making mechanisms of an AI system. These attacks target moral judgment layers to induce behaviors not conforming to programmed values.
← Indietro