M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering

Anand Subramanian 0004, Viktor Schlegel, Abhinav Ramesh Kashyap, Thanh Tung Nguyen, Vijay Prakash Dwivedi, Stefan Winkler 0001. M-QALM: A Benchmark to Assess Clinical Reading Comprehension and Knowledge Recall in Large Language Models via Question Answering. In Lun-Wei Ku, Andre Martins, Vivek Srikumar, editors, Findings of the Association for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11-16, 2024. pages 4002-4042, Association for Computational Linguistics, 2024. [doi]

Abstract

Abstract is missing.