NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts

Abhay Gupta, Kevin Zhu, Vasu Sharma, Sean O'Brien, Michael Lu. NovelHopQA: Diagnosing Multi-Hop Reasoning Failures in Long Narrative Contexts. In Christos Christodoulopoulos 0001, Tanmoy Chakraborty 0002, Carolyn Rose, Violet Peng, editors, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, EMNLP 2025, Suzhou, China, November 4-9, 2025. pages 26134-26151, Association for Computational Linguistics, 2025. [doi]

Abstract

Abstract is missing.