Interpretability Metrics
Post-hoc Explainability Score
Quantitative evaluation of the quality of explanations generated after model training, combining fidelity, stability, and comprehensibility. This composite score allows different explanation techniques to be compared on the same model.
← Indietro