Learning parameterized policies for Markov decision processes through demonstrations

Manjesh Kumar Hanawal, Hao Liu, Henghui Zhu, Ioannis Ch. Paschalidis. Learning parameterized policies for Markov decision processes through demonstrations. In 55th IEEE Conference on Decision and Control, CDC 2016, Las Vegas, NV, USA, December 12-14, 2016. pages 7087-7092, IEEE, 2016. [doi]

Abstract

Abstract is missing.