Knowledge Distillation
Soft Targets
Output probabilities from the teacher model before applying the argmax function, containing information about inter-class relationships that hard labels don't capture.
← Quay lại