Offline Actor-Critic Reinforcement Learning Scales to Large Models

Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang 0001, Oliver Groth, Michael Bloesch, Thomas Lampe, Philemon Brakel, Sarah Bechtle, Steven Kapturowski, Roland Hafner, Nicolas Heess, Martin A. Riedmiller. Offline Actor-Critic Reinforcement Learning Scales to Large Models. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. OpenReview.net, 2024. [doi]

Authors

Jost Tobias Springenberg

This author has not been identified. Look up 'Jost Tobias Springenberg' in Google

Abbas Abdolmaleki

This author has not been identified. Look up 'Abbas Abdolmaleki' in Google

Jingwei Zhang 0001

This author has not been identified. Look up 'Jingwei Zhang 0001' in Google

Oliver Groth

This author has not been identified. Look up 'Oliver Groth' in Google

Michael Bloesch

This author has not been identified. Look up 'Michael Bloesch' in Google

Thomas Lampe

This author has not been identified. Look up 'Thomas Lampe' in Google

Philemon Brakel

This author has not been identified. Look up 'Philemon Brakel' in Google

Sarah Bechtle

This author has not been identified. Look up 'Sarah Bechtle' in Google

Steven Kapturowski

This author has not been identified. Look up 'Steven Kapturowski' in Google

Roland Hafner

This author has not been identified. Look up 'Roland Hafner' in Google

Nicolas Heess

This author has not been identified. Look up 'Nicolas Heess' in Google

Martin A. Riedmiller

This author has not been identified. Look up 'Martin A. Riedmiller' in Google