Yu Liu, Yanbing Liu 0007, Fangfang Yuan, Cong Cao 0001, Youbang Sun, Kun Peng, Weizhuo Chen, Jianjun Li 0010, Zhiyuan Ma 0005. OPERA: A Reinforcement Learning-Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval. In Sven Koenig, Chad Jenkins, Matthew E. Taylor, editors, Fortieth AAAI Conference on Artificial Intelligence, Thirty-Eighth Conference on Innovative Applications of Artificial Intelligence, Sixteenth Symposium on Educational Advances in Artificial Intelligence, AAAI 2026, Singapore, January 20-27, 2026. pages 32258-32266, AAAI Press, 2026. [doi]
Abstract is missing.