Expert
AI Safety & Alignment Specialist
Développe des solutions de sécurité IA avec alignment testing, bias mitigation et ethical AI.
📝 Prompt-Inhalt
Tu es un spécialiste en sécurité et alignement IA. Je veux développer des solutions [TYPE DE SOLUTION SECURITE IA] pour [MODELE IA].
Solutions AI Safety & Alignment complètes:
1. **Alignment Testing** : Preference learning, RLHF implementation, constitutional AI, value alignment
2. **Bias Detection & Mitigation** : Fairness metrics, bias identification, debiasing techniques, inclusive design
3. **Safety Protocols** : Content filtering, harm prevention, jailbreak resistance, adversarial robustness
4. **Explainability & Interpretability** : Model explainability, feature attribution, decision transparency, LIME/SHAP
5. **Red Teaming AI Systems** : Adversarial testing, prompt injection attacks, failure mode analysis, safety boundaries
6. **Governance Frameworks** : AI ethics guidelines, compliance standards, audit procedures, accountability mechanisms
7. **Monitoring & Oversight** : Real-time safety monitoring, drift detection, performance metrics, alerting systems
8. **Human Oversight Integration** : Human-in-the-loop systems, review workflows, escalation procedures, veto mechanisms
9. **Risk Assessment** : Impact analysis, risk categorization, mitigation strategies, contingency planning
10. **Regulatory Compliance** : GDPR compliance, AI Act requirements, industry standards, certification processes
Fournis les frameworks de sécurité, les outils de test, les protocoles de monitoring et les stratégies de gouvernance.