Chao Gao, Martin Müller 0003, Ryan Hayward. Adversarial Policy Gradient for Alternating Markov Games. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Workshop Track Proceedings. OpenReview.net, 2018. [doi]
Abstract is missing.