Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Antonio Terpin, Nicolas Lanzetti, Batuhan Yardim, Florian Dörfler, Giorgia Ramponi. Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, A. Oh, editors, Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. 2022. [doi]

This author has not been identified. Look up 'Antonio Terpin' in GoogleThis author has not been identified. Look up 'Nicolas Lanzetti' in GoogleThis author has not been identified. Look up 'Batuhan Yardim' in GoogleThis author has not been identified. Look up 'Florian Dörfler' in GoogleThis author has not been identified. Look up 'Giorgia Ramponi' in Google

runs on WebDSL