Evaluation and Metrics
BLEU (Bilingual Evaluation Understudy)
Automatic metric for evaluating the quality of machine translations by comparing the n-gram precision of the generated text against one or more human references. It measures the overlap of text segments between the model output and the reference.
← Kembali