V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin A. Riedmiller, Matthew M. Botvinick. V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control. In 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020. OpenReview.net, 2020. [doi]

This author has not been identified. Look up 'H. Francis Song' in GoogleThis author has not been identified. Look up 'Abbas Abdolmaleki' in GoogleThis author has not been identified. Look up 'Jost Tobias Springenberg' in GoogleThis author has not been identified. Look up 'Aidan Clark' in GoogleThis author has not been identified. Look up 'Hubert Soyer' in GoogleThis author has not been identified. Look up 'Jack W. Rae' in GoogleThis author has not been identified. Look up 'Seb Noury' in GoogleThis author has not been identified. Look up 'Arun Ahuja' in GoogleThis author has not been identified. Look up 'Siqi Liu' in GoogleThis author has not been identified. Look up 'Dhruva Tirumala' in GoogleThis author has not been identified. Look up 'Nicolas Heess' in GoogleThis author has not been identified. Look up 'Dan Belov' in GoogleThis author has not been identified. Look up 'Martin A. Riedmiller' in GoogleThis author has not been identified. Look up 'Matthew M. Botvinick' in Google

runs on WebDSL