Q-learning for POMDP: An application to learning locomotion gaits

Tixian Wang, Amirhossein Taghvaei, Prashant G. Mehta. Q-learning for POMDP: An application to learning locomotion gaits. In 58th IEEE Conference on Decision and Control, CDC 2019, Nice, France, December 11-13, 2019. pages 2758-2763, IEEE, 2019. [doi]

Authors

Tixian Wang

This author has not been identified. Look up 'Tixian Wang' in Google

Amirhossein Taghvaei

This author has not been identified. Look up 'Amirhossein Taghvaei' in Google

Prashant G. Mehta

This author has not been identified. Look up 'Prashant G. Mehta' in Google