Learning Policies for Markov Decision Processes From Data

Manjesh Kumar Hanawal, Hao Liu 0023, Henghui Zhu, Ioannis Ch. Paschalidis. Learning Policies for Markov Decision Processes From Data. IEEE Trans. Automat. Contr., 64(6):2298-2309, 2019. [doi]

Abstract

Abstract is missing.