LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem Exploration - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Ruiyu Qiu, Rui Wang, Guanghui Yang, Xiang Li, Zhijiang Shao. LPPG-RL: Lexicographically Projected Policy Gradient Reinforcement Learning with Subproblem Exploration. In Sven Koenig, Chad Jenkins, Matthew E. Taylor, editors, Fortieth AAAI Conference on Artificial Intelligence, Thirty-Eighth Conference on Innovative Applications of Artificial Intelligence, Sixteenth Symposium on Educational Advances in Artificial Intelligence, AAAI 2026, Singapore, January 20-27, 2026. pages 25009-25017, AAAI Press, 2026. [doi]

This author has not been identified. Look up 'Ruiyu Qiu' in GoogleThis author has not been identified. Look up 'Rui Wang' in GoogleThis author has not been identified. Look up 'Guanghui Yang' in GoogleThis author has not been identified. Look up 'Xiang Li' in GoogleThis author has not been identified. Look up 'Zhijiang Shao' in Google

runs on WebDSL