Establishing Constitutional AI Principles

#ai-safety #philosophy #ethics #alignment

Draft a set of constitutional principles to govern AGI behavior in edge-case moral dilemmas.

📝 프롬프트 내용

Define a comprehensive 'Constitution' for an Artificial General Intelligence system. The constitution must address the 'Alignment Problem' by explicitly weighting human rights, utilitarian outcomes, and the preservation of human agency. Create a decision-tree framework that the AI can use to resolve conflicts between these values in edge-case scenarios, such as the Trolley Problem or triage in medical resource scarcity. Explain how you would use RLHF (Reinforcement Learning from Human Feedback) to train the model to adhere to this constitution without overfitting to specific examples.

일반

Establishing Constitutional AI Principles