Expert-based reward function training: the novel method to train sequence generators

Joji Toyama, Yusuke Iwasawa, Kotaro Nakayama, Yutaka Matsuo. Expert-based reward function training: the novel method to train sequence generators. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Workshop Track Proceedings. OpenReview.net, 2018. [doi]

Abstract

Abstract is missing.