INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations

Anil Ramakrishna, Rahul Gupta 0001, Jens Lehmann 0001, Morteza Ziyadi. INVITE: a Testbed of Automatically Generated Invalid Questions to Evaluate Large Language Models for Hallucinations. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 5422-5429, Association for Computational Linguistics, 2023. [doi]

Authors

Anil Ramakrishna

This author has not been identified. Look up 'Anil Ramakrishna' in Google

Rahul Gupta 0001

This author has not been identified. Look up 'Rahul Gupta 0001' in Google

Jens Lehmann 0001

This author has not been identified. Look up 'Jens Lehmann 0001' in Google

Morteza Ziyadi

This author has not been identified. Look up 'Morteza Ziyadi' in Google