An online primal-dual method for discounted Markov decision processes

Mengdi Wang, Yichen Chen. An online primal-dual method for discounted Markov decision processes. In 55th IEEE Conference on Decision and Control, CDC 2016, Las Vegas, NV, USA, December 12-14, 2016. pages 4516-4521, IEEE, 2016. [doi]

Abstract

Abstract is missing.