BECEL: Benchmark for Consistency Evaluation of Language Models

Myeongjun Jang, Deuk Sin Kwon, Thomas Lukasiewicz. BECEL: Benchmark for Consistency Evaluation of Language Models. In Nicoletta Calzolari, Chu-Ren Huang, Hansaem Kim, James Pustejovsky, Leo Wanner, Key-Sun Choi, Pum-Mo Ryu, Hsin-Hsi Chen, Lucia Donatelli, Heng Ji, Sadao Kurohashi, Patrizia Paggio, Nianwen Xue, Seokhwan Kim, YoungGyun Hahm, Zhong He, Tony Kyungil Lee, Enrico Santus, Francis Bond, Seung-Hoon Na, editors, Proceedings of the 29th International Conference on Computational Linguistics, COLING 2022, Gyeongju, Republic of Korea, October 12-17, 2022. pages 3680-3696, International Committee on Computational Linguistics, 2022. [doi]

Authors

Myeongjun Jang

This author has not been identified. Look up 'Myeongjun Jang' in Google

Deuk Sin Kwon

This author has not been identified. Look up 'Deuk Sin Kwon' in Google

Thomas Lukasiewicz

This author has not been identified. Look up 'Thomas Lukasiewicz' in Google