A new convergent variant of Q-learning with linear function approximation

Diogo Carvalho, Francisco S. Melo, Pedro Santos 0001. A new convergent variant of Q-learning with linear function approximation. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Diogo Carvalho

This author has not been identified. Look up 'Diogo Carvalho' in Google

Francisco S. Melo

This author has not been identified. Look up 'Francisco S. Melo' in Google

Pedro Santos 0001

This author has not been identified. Look up 'Pedro Santos 0001' in Google