Alignment and Safety
Alignment Taxonomy
Structured classification of different types and dimensions of alignment in AI, including value alignment, safety, robustness, and model interpretability.
← Geri