Attention Head Analysis
Head Interpretability
Research field aimed at developing methods to understand, quantify and visualize the specific function of each attention head in order to demystify the internal workings of Transformer models.
← Zurück