Comparing the Evaluation and Production of Loophole Behavior in Humans and Large Language Models - researchr publication

researchr

You are not signed in
Sign in
Sign up

Sonia Murthy, Kiera Parece, Sophie Bridgers, Peng Qian, Tomer D. Ullman. Comparing the Evaluation and Production of Loophole Behavior in Humans and Large Language Models. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 4010-4025, Association for Computational Linguistics, 2023. [doi]

Abstract is missing.

runs on WebDSL