ARBench: Algorithmic Reasoner or API Alchemist? Evaluating LLMs Beyond API Calls

Renbiao Liu, Chao-Zeng Ma, Anqi Li, Hui Sun 0003, Xin-Ye Li, Ming Li 0005. ARBench: Algorithmic Reasoner or API Alchemist? Evaluating LLMs Beyond API Calls. In Sven Koenig, Chad Jenkins, Matthew E. Taylor, editors, Fortieth AAAI Conference on Artificial Intelligence, Thirty-Eighth Conference on Innovative Applications of Artificial Intelligence, Sixteenth Symposium on Educational Advances in Artificial Intelligence, AAAI 2026, Singapore, January 20-27, 2026. pages 32105-32113, AAAI Press, 2026. [doi]

Abstract

Abstract is missing.