🏠 홈
벤치마크
📊 모든 벤치마크 🦖 공룡 v1 🦖 공룡 v2 ✅ 할 일 목록 앱 🎨 창의적인 자유 페이지 🎯 FSACB - 궁극의 쇼케이스 🌍 번역 벤치마크
모델
🏆 톱 10 모델 🆓 무료 모델 📋 모든 모델 ⚙️ 킬로 코드 모드
리소스
💬 프롬프트 라이브러리 📖 AI 용어 사전 🔗 유용한 링크
advanced

Establishing Constitutional AI Principles

#ai-safety #philosophy #ethics #alignment

Draft a set of constitutional principles to govern AGI behavior in edge-case moral dilemmas.

Define a comprehensive 'Constitution' for an Artificial General Intelligence system. The constitution must address the 'Alignment Problem' by explicitly weighting human rights, utilitarian outcomes, and the preservation of human agency. Create a decision-tree framework that the AI can use to resolve conflicts between these values in edge-case scenarios, such as the Trolley Problem or triage in medical resource scarcity. Explain how you would use RLHF (Reinforcement Learning from Human Feedback) to train the model to adhere to this constitution without overfitting to specific examples.