Residual Policy Optimization With Trust Region Constraints: A Learning Framework for Stable and Agile Wheel-Legged Locomotion

Naifeng He, Zhong Yang, Xiaoliang Fan, Wenqiang Que, Siyang Liu, Hongyu Xu, Chunguang Bu, Bi Zhang. Residual Policy Optimization With Trust Region Constraints: A Learning Framework for Stable and Agile Wheel-Legged Locomotion. IEEE T. Automation Science and Engineering, 22:23352-23365, 2025. [doi]

Abstract

Abstract is missing.