Iterative Empirical Game Solving via Single Policy Best Response

Max Olan Smith, Thomas Anthony, Michael P. Wellman. Iterative Empirical Game Solving via Single Policy Best Response. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021. [doi]

Abstract

Abstract is missing.