-
Notifications
You must be signed in to change notification settings - Fork 12
Open
Description
Within "evaluation metrics", talk about how ROUGE is not really intended for machine translation, and the pitfalls thereof.
https://stats.stackexchange.com/questions/301626/interpreting-rouge-scores
https://towardsdatascience.com/to-rouge-or-not-to-rouge-6a5f3552ea45
https://hyperskill.org/learn/step/29669
https://en.wikipedia.org/wiki/ROUGE_(metric)
https://aclanthology.org/P04-1077/ "Automatic Evaluation of Machine Translation Quality Using Longest Com-
mon Subsequence and Skip-Bigram Statistics" is the one that talks about the ROUGE-L variation, this one is actually for MT
Metadata
Metadata
Assignees
Labels
No labels