Katherine M. Collins, Catherine Wong, Jiahai Feng, Megan Wei, Josh Tenenbaum 0001. Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks. In Jennifer Culbertson, Hugh Rabagliati, VerĂ³nica C. Ramenzoni, Andrew Perfors, editors, Proceedings of the 44th Annual Meeting of the Cognitive Science Society, CogSci 2022, Toronto, ON, Canada, July 27-30, 2022. cognitivesciencesociety.org, 2022. [doi]
Abstract is missing.