Q-learning for POMDP: An application to learning locomotion gaits

Tixian Wang, Amirhossein Taghvaei, Prashant G. Mehta. Q-learning for POMDP: An application to learning locomotion gaits. In 58th IEEE Conference on Decision and Control, CDC 2019, Nice, France, December 11-13, 2019. pages 2758-2763, IEEE, 2019. [doi]

Abstract

Abstract is missing.