Contrastive interpretability
Diagnosis by Difference
Method that analyzes discrepancies in a model's behavior between two contexts or datasets (e.g., before and after drift) to understand changes in its decision logic.
← Wstecz