Zhiqiang Pi, Annapurna Vadaparty, Benjamin Bergen 0001, Cameron R. Jones. Dissecting the Ullman Variations with a SCALPEL: Why do LLMs fail at Trivial Alterations to the False Belief Task?. In David Barner, Neil Bramley, Azzurra Ruggeri, Caren M. Walker, editors, Proceedings of the 47th Annual Meeting of the Cognitive Science Society, CogSci 2025, San Francisco, CA, USA, July 30 - August 2, 2025. cognitivesciencesociety.org, 2025. [doi]
Abstract is missing.