Comparing Test Sets with Item Response Theory

Clara Vania, Phu Mon Htut, William Huang, Dhara A. Mungra, Richard Yuanzhe Pang, Jason Phang, Haokun Liu, KyungHyun Cho, Samuel R. Bowman. Comparing Test Sets with Item Response Theory. In Chengqing Zong, Fei Xia, Wenjie Li 0002, Roberto Navigli, editors, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL/IJCNLP 2021, (Volume 1: Long Papers), Virtual Event, August 1-6, 2021. pages 1141-1158, Association for Computational Linguistics, 2021. [doi]

Abstract

Abstract is missing.