Agreement is overrated: A plea for correlation to assess human evaluation reliability

Jacopo Amidei, Paul Piwek, Alistair Willis. Agreement is overrated: A plea for correlation to assess human evaluation reliability. In Kees van Deemter, Chenghua Lin, Hiroya Takamura, editors, Proceedings of the 12th International Conference on Natural Language Generation, INLG 2019, Tokyo, Japan, October 29 - November 1, 2019. pages 344-354, Association for Computational Linguistics, 2019. [doi]

Authors

Jacopo Amidei

This author has not been identified. Look up 'Jacopo Amidei' in Google

Paul Piwek

This author has not been identified. Look up 'Paul Piwek' in Google

Alistair Willis

This author has not been identified. Look up 'Alistair Willis' in Google