Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems

Ekaterina V. Tolstaya, Alec Koppel, Ethan Stump, Alejandro Ribeiro. Nonparametric Stochastic Compositional Gradient Descent for Q-Learning in Continuous Markov Decision Problems. In 2018 Annual American Control Conference, ACC 2018, Milwaukee, WI, USA, June 27-29, 2018. pages 6608-6615, IEEE, 2018. [doi]

Abstract

Abstract is missing.