Holmes ⌕ A Benchmark to Assess the Linguistic Competence of Language Models - researchr publication

researchr

You are not signed in
Sign in
Sign up

Andreas Waldis, Yotam Perlitz, Leshem Choshen, Yufang Hou 0001, Iryna Gurevych. Holmes ⌕ A Benchmark to Assess the Linguistic Competence of Language Models. TACL, 12:1616-1647, 2024. [doi]

Abstract is missing.

runs on WebDSL