Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions

Antonio Terpin, Nicolas Lanzetti, Batuhan Yardim, Florian Dörfler, Giorgia Ramponi. Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. 2022. [doi]

Authors

Antonio Terpin

This author has not been identified. Look up 'Antonio Terpin' in Google

Nicolas Lanzetti

This author has not been identified. Look up 'Nicolas Lanzetti' in Google

Batuhan Yardim

This author has not been identified. Look up 'Batuhan Yardim' in Google

Florian Dörfler

This author has not been identified. Look up 'Florian Dörfler' in Google

Giorgia Ramponi

This author has not been identified. Look up 'Giorgia Ramponi' in Google