Nitika Mathur, Timothy Baldwin, Trevor Cohn. Tangled up in BLEU: Reevaluating the Evaluation of Automatic Machine Translation Evaluation Metrics. In Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel R. Tetreault, editors, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL 2020, Online, July 5-10, 2020. pages 4984-4997, Association for Computational Linguistics, 2020. [doi]
Abstract is missing.