The Pitfalls of Regularization in Off-Policy TD Learning

Gaurav Manek, J. Zico Kolter. The Pitfalls of Regularization in Off-Policy TD Learning. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. 2022. [doi]

Authors

Gaurav Manek

This author has not been identified. Look up 'Gaurav Manek' in Google

J. Zico Kolter

This author has not been identified. Look up 'J. Zico Kolter' in Google