Local interpretation methods
Integrated Gradients
A local attribution method for differentiable models that calculates the importance of a feature by integrating the gradient of the output with respect to that feature along a path from a baseline to the input.
← Indietro