Kunbao Wu, Xinning Zhu, Yang Qin, Tieru Wang, Jianzhou Diao, Zheng Hu 0001. Beyond Return Conditioning: Multi-Scale Sequence Modeling and Advantage-Guided Policy Routing for Offline RL. In Meeyoung Cha, Chanyoung Park 0001, Noseong Park, Carl Yang 0001, Senjuti Basu Roy, Jessie Li, Jaap Kamps, Kijung Shin, Bryan Hooi, Lifang He 0001, editors, Proceedings of the 34th ACM International Conference on Information and Knowledge Management, CIKM 2025, Seoul, Republic of Korea, November 10-14, 2025. pages 3396-3405, ACM, 2025. [doi]
Abstract is missing.