Hi-ToM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models

Yufan Wu, Yinghui He, Yilin Jia, Rada Mihalcea, Yulong Chen 0001, Naihao Deng. Hi-ToM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 10691-10706, Association for Computational Linguistics, 2023. [doi]

No reviews for this publication, yet.