Sridhar Mahadevan, Bo Liu. Sparse Q-learning with Mirror Descent. In Nando de Freitas, Kevin P. Murphy, editors, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, Catalina Island, CA, USA, August 14-18, 2012. pages 564-573, AUAI Press, 2012. [doi]
Abstract is missing.