Deep Learning with Logged Bandit Feedback

Thorsten Joachims, Adith Swaminathan, Maarten de Rijke. Deep Learning with Logged Bandit Feedback. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.