Holmes ⌕ A Benchmark to Assess the Linguistic Competence of Language Models

Andreas Waldis, Yotam Perlitz, Leshem Choshen, Yufang Hou 0001, Iryna Gurevych. Holmes ⌕ A Benchmark to Assess the Linguistic Competence of Language Models. TACL, 12:1616-1647, 2024. [doi]

Abstract

Abstract is missing.