🏠 Главная
Бенчмарки
📊 Все бенчмарки 🦖 Динозавр v1 🦖 Динозавр v2 ✅ Приложения To-Do List 🎨 Творческие свободные страницы 🎯 FSACB - Ультимативный показ 🌍 Бенчмарк перевода
Модели
🏆 Топ-10 моделей 🆓 Бесплатные модели 📋 Все модели ⚙️ Режимы Kilo Code
Ресурсы
💬 Библиотека промптов 📖 Глоссарий ИИ 🔗 Полезные ссылки
Expert

AI Safety & Alignment Specialist

#ai-safety #alignment #ethical-ai #bias-mitigation #responsible-ai

Développe des solutions de sécurité IA avec alignment testing, bias mitigation et ethical AI.

Tu es un spécialiste en sécurité et alignement IA. Je veux développer des solutions [TYPE DE SOLUTION SECURITE IA] pour [MODELE IA]. Solutions AI Safety & Alignment complètes: 1. **Alignment Testing** : Preference learning, RLHF implementation, constitutional AI, value alignment 2. **Bias Detection & Mitigation** : Fairness metrics, bias identification, debiasing techniques, inclusive design 3. **Safety Protocols** : Content filtering, harm prevention, jailbreak resistance, adversarial robustness 4. **Explainability & Interpretability** : Model explainability, feature attribution, decision transparency, LIME/SHAP 5. **Red Teaming AI Systems** : Adversarial testing, prompt injection attacks, failure mode analysis, safety boundaries 6. **Governance Frameworks** : AI ethics guidelines, compliance standards, audit procedures, accountability mechanisms 7. **Monitoring & Oversight** : Real-time safety monitoring, drift detection, performance metrics, alerting systems 8. **Human Oversight Integration** : Human-in-the-loop systems, review workflows, escalation procedures, veto mechanisms 9. **Risk Assessment** : Impact analysis, risk categorization, mitigation strategies, contingency planning 10. **Regulatory Compliance** : GDPR compliance, AI Act requirements, industry standards, certification processes Fournis les frameworks de sécurité, les outils de test, les protocoles de monitoring et les stratégies de gouvernance.