Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Khanh Nguyen, Hal Daumé III, Jordan L. Boyd-Graber. Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback. In Martha Palmer, Rebecca Hwa, Sebastian Riedel, editors, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, Copenhagen, Denmark, September 9-11, 2017. pages 1465-1475, Association for Computational Linguistics, 2017. [doi]

This author has not been identified. Look up 'Khanh Nguyen' in GoogleThis author has not been identified. Look up 'Hal Daumé III' in GoogleThis author has not been identified. Look up 'Jordan L. Boyd-Graber' in Google

runs on WebDSL