Evaluation of Question Answering Systems: Complexity of Judging a Natural Language

Amer Farea, Zhen Yang 0029, Kien Duong, Nadeesha Perera, Frank Emmert-Streib. Evaluation of Question Answering Systems: Complexity of Judging a Natural Language. ACM Computing Surveys, 58(1), January 2026. [doi]

Abstract

Abstract is missing.