Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

researchr

You are not signed in
Sign in
Sign up

Anton Bakhtin, David J. Wu 0002, Adam Lerer, Jonathan Gray, Athul Paul Jacob, Gabriele Farina, Alexander H. Miller, Noam Brown. Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning. In The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023. OpenReview.net, 2023. [doi]

@inproceedings{Bakhtin0LGJFMB23,
  title = {Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning},
  author = {Anton Bakhtin and David J. Wu 0002 and Adam Lerer and Jonathan Gray and Athul Paul Jacob and Gabriele Farina and Alexander H. Miller and Noam Brown},
  year = {2023},
  url = {https://openreview.net/pdf?id=F61FwJTZhb},
  researchr = {https://researchr.org/publication/Bakhtin0LGJFMB23},
  cites = {0},
  citedby = {0},
  booktitle = {The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023},
  publisher = {OpenReview.net},
}

External Links

Cite Key

Statistics

PDF

Researchr

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning