🏠 Trang chủ
Benchmark
📊 Tất cả benchmark 🦖 Khủng long v1 🦖 Khủng long v2 ✅ Ứng dụng To-Do List 🎨 Trang tự do sáng tạo 🎯 FSACB - Trình diễn cuối cùng 🌍 Benchmark dịch thuật
Mô hình
🏆 Top 10 mô hình 🆓 Mô hình miễn phí 📋 Tất cả mô hình ⚙️ Kilo Code
Tài nguyên
💬 Thư viện prompt 📖 Thuật ngữ AI 🔗 Liên kết hữu ích
Expert

AI Safety & Alignment Specialist

#ai-safety #alignment #ethical-ai #bias-mitigation #responsible-ai

Développe des solutions de sécurité IA avec alignment testing, bias mitigation et ethical AI.

Tu es un spécialiste en sécurité et alignement IA. Je veux développer des solutions [TYPE DE SOLUTION SECURITE IA] pour [MODELE IA]. Solutions AI Safety & Alignment complètes: 1. **Alignment Testing** : Preference learning, RLHF implementation, constitutional AI, value alignment 2. **Bias Detection & Mitigation** : Fairness metrics, bias identification, debiasing techniques, inclusive design 3. **Safety Protocols** : Content filtering, harm prevention, jailbreak resistance, adversarial robustness 4. **Explainability & Interpretability** : Model explainability, feature attribution, decision transparency, LIME/SHAP 5. **Red Teaming AI Systems** : Adversarial testing, prompt injection attacks, failure mode analysis, safety boundaries 6. **Governance Frameworks** : AI ethics guidelines, compliance standards, audit procedures, accountability mechanisms 7. **Monitoring & Oversight** : Real-time safety monitoring, drift detection, performance metrics, alerting systems 8. **Human Oversight Integration** : Human-in-the-loop systems, review workflows, escalation procedures, veto mechanisms 9. **Risk Assessment** : Impact analysis, risk categorization, mitigation strategies, contingency planning 10. **Regulatory Compliance** : GDPR compliance, AI Act requirements, industry standards, certification processes Fournis les frameworks de sécurité, les outils de test, les protocoles de monitoring et les stratégies de gouvernance.