Batch Policy Gradient Methods for Improving Neural Conversation Models - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Kirthevasan Kandasamy, Yoram Bachrach, Ryota Tomioka, Daniel Tarlow, David Carter. Batch Policy Gradient Methods for Improving Neural Conversation Models. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings. OpenReview.net, 2017. [doi]

This author has not been identified. Look up 'Kirthevasan Kandasamy' in GoogleThis author has not been identified. Look up 'Yoram Bachrach' in GoogleThis author has not been identified. Look up 'Ryota Tomioka' in GoogleThis author has not been identified. Look up 'Daniel Tarlow' in GoogleThis author has not been identified. Look up 'David Carter' in Google

runs on WebDSL