Batch Policy Gradient Methods for Improving Neural Conversation Models - researchr publication references

researchr

You are not signed in
Sign in
Sign up

Kirthevasan Kandasamy, Yoram Bachrach, Ryota Tomioka, Daniel Tarlow, David Carter. Batch Policy Gradient Methods for Improving Neural Conversation Models. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings. OpenReview.net, 2017. [doi]

No references recorded for this publication.

No citations of this publication recorded.

runs on WebDSL