INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations

Anil Ramakrishna, Rahul Gupta 0001, Jens Lehmann 0001, Morteza Ziyadi. INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 5422-5429, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.