Hariram Veeramani, Surendrabikram Thapa, Usman Naseem. Measuring What Matters: Probing Transit Reasoning Consistency in Large Language Models. In Btissam Er Rahmadi, Sébastien Montella, Damien Graux, Hajira Jabeen, editors, Proceedings of the 1st Workshop on Knowledge Graphs & Agentic Systems Interplay (NORA 2025) co-located with the Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025), Mexico City, Mexico, December 1st, 2025. Volume 4162 of CEUR Workshop Proceedings, pages 95-108, CEUR-WS.org, 2025. [doi]
Abstract is missing.