Comparing the Evaluation and Production of Loophole Behavior in Humans and Large Language Models

Sonia Murthy, Kiera Parece, Sophie Bridgers, Peng Qian, Tomer D. Ullman. Comparing the Evaluation and Production of Loophole Behavior in Humans and Large Language Models. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 4010-4025, Association for Computational Linguistics, 2023. [doi]

Abstract

Abstract is missing.