Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy

Boyi Liu, Qi Cai, Zhuoran Yang, Zhaoran Wang. Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Edward A. Fox, Roman Garnett, editors, Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada. pages 10564-10575, 2019. [doi]

Authors

Boyi Liu

This author has not been identified. Look up 'Boyi Liu' in Google

Qi Cai

This author has not been identified. Look up 'Qi Cai' in Google

Zhuoran Yang

This author has not been identified. Look up 'Zhuoran Yang' in Google

Zhaoran Wang

This author has not been identified. Look up 'Zhaoran Wang' in Google