An operator view of policy gradient methods

Dibya Ghosh, Marlos C. Machado, Nicolas Le Roux. An operator view of policy gradient methods. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Dibya Ghosh

This author has not been identified. Look up 'Dibya Ghosh' in Google

Marlos C. Machado

This author has not been identified. Look up 'Marlos C. Machado' in Google

Nicolas Le Roux

This author has not been identified. Look up 'Nicolas Le Roux' in Google