Alignment and Safety
Preference Learning
Machine learning domain where models learn from comparisons between different options to capture human preferences and align with them.
← Quay lạiMachine learning domain where models learn from comparisons between different options to capture human preferences and align with them.
← Quay lại