Knowledge Distillation
Heterogeneous Knowledge Distillation
Approach where teacher and student have different architectures (CNN to Transformer, for example), requiring specific adaptation techniques for knowledge transfer.
← Geri