Comparing the Evaluation and Production of Loophole Behavior in Humans and Large Language Models

Sonia Murthy, Kiera Parece, Sophie Bridgers, Peng Qian, Tomer D. Ullman. Comparing the Evaluation and Production of Loophole Behavior in Humans and Large Language Models. In Houda Bouamor, Juan Pino 0001, Kalika Bali, editors, Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023. pages 4010-4025, Association for Computational Linguistics, 2023. [doi]

@inproceedings{MurthyPBQU23,
  title = {Comparing the Evaluation and Production of Loophole Behavior in Humans and Large Language Models},
  author = {Sonia Murthy and Kiera Parece and Sophie Bridgers and Peng Qian and Tomer D. Ullman},
  year = {2023},
  url = {https://aclanthology.org/2023.findings-emnlp.264},
  researchr = {https://researchr.org/publication/MurthyPBQU23},
  cites = {0},
  citedby = {0},
  pages = {4010-4025},
  booktitle = {Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023},
  editor = {Houda Bouamor and Juan Pino 0001 and Kalika Bali},
  publisher = {Association for Computational Linguistics},
  isbn = {979-8-89176-061-5},
}