Alignment and Safety
Human Preference Data
Dataset collected from comparative human evaluations between different model responses, serving as a basis for alignment training and optimization.
← Indietro