Bayes Test of Precision, Recall, and F1 Measure for Comparison of Two Natural Language Processing Models

Ruibo Wang, Jihong Li. Bayes Test of Precision, Recall, and F1 Measure for Comparison of Two Natural Language Processing Models. In Anna Korhonen, David R. Traum, Lluís Màrquez, editors, Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, Florence, Italy, July 28- August 2, 2019, Volume 1: Long Papers. pages 4135-4145, Association for Computational Linguistics, 2019. [doi]

Abstract

Abstract is missing.