Knowledge Distillation
Attention-Based Distillation
Specific technique where the student learns to reproduce the teacher's attention maps, thus transferring knowledge about the important parts of the input data.
← Indietro