Learning to Play No-Press Diplomacy with Best Response Policy Iteration - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Thomas W. Anthony, Tom Eccles, Andrea Tacchetti, János Kramár, Ian M. Gemp, Thomas C. Hudson, Nicolas Porcel, Marc Lanctot, Julien Pérolat, Richard Everett 0001, Satinder Singh, Thore Graepel, Yoram Bachrach. Learning to Play No-Press Diplomacy with Best Response Policy Iteration. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

This author has not been identified. Look up 'Thomas W. Anthony' in GoogleThis author has not been identified. Look up 'Tom Eccles' in GoogleThis author has not been identified. Look up 'Andrea Tacchetti' in GoogleThis author has not been identified. Look up 'János Kramár' in GoogleThis author has not been identified. Look up 'Ian M. Gemp' in GoogleThis author has not been identified. Look up 'Thomas C. Hudson' in GoogleThis author has not been identified. Look up 'Nicolas Porcel' in GoogleThis author has not been identified. Look up 'Marc Lanctot' in GoogleThis author has not been identified. Look up 'Julien Pérolat' in GoogleThis author has not been identified. Look up 'Richard Everett 0001' in GoogleThis author has not been identified. Look up 'Satinder Singh' in GoogleThis author has not been identified. Look up 'Thore Graepel' in GoogleThis author has not been identified. Look up 'Yoram Bachrach' in Google

runs on WebDSL