Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Anton Bakhtin, David J. Wu 0002, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H. Miller, Noam Brown. Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

Authors

Anton Bakhtin

This author has not been identified. Look up 'Anton Bakhtin' in Google

David J. Wu 0002

This author has not been identified. Look up 'David J. Wu 0002' in Google

Adam Lerer

This author has not been identified. Look up 'Adam Lerer' in Google

Jonathan Gray

This author has not been identified. Look up 'Jonathan Gray' in Google

Athul Paul Jacob

This author has not been identified. Look up 'Athul Paul Jacob' in Google

Gabriele Farina

This author has not been identified. Look up 'Gabriele Farina' in Google

Alexander H. Miller

This author has not been identified. Look up 'Alexander H. Miller' in Google

Noam Brown

This author has not been identified. Look up 'Noam Brown' in Google