BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics

Liang Ma, Shuyang Cao, Robert L. Logan IV, Di Lu, Shihao Ran, Ke Zhang, Joel R. Tetreault, Alejandro Jaimes. BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023. pages 12788-12812, Association for Computational Linguistics, 2023. [doi]

Authors

Liang Ma

This author has not been identified. Look up 'Liang Ma' in Google

Shuyang Cao

This author has not been identified. Look up 'Shuyang Cao' in Google

Robert L. Logan IV

This author has not been identified. Look up 'Robert L. Logan IV' in Google

Di Lu

This author has not been identified. Look up 'Di Lu' in Google

Shihao Ran

This author has not been identified. Look up 'Shihao Ran' in Google

Ke Zhang

This author has not been identified. Look up 'Ke Zhang' in Google

Joel R. Tetreault

This author has not been identified. Look up 'Joel R. Tetreault' in Google

Alejandro Jaimes

This author has not been identified. Look up 'Alejandro Jaimes' in Google