Top-K Off-Policy Correction for a REINFORCE Recommender System

Minmin Chen, Alex Beutel, Paul Covington, Sagar Jain, Francois Belletti, Ed H. Chi. Top-K Off-Policy Correction for a REINFORCE Recommender System. In J. Shane Culpepper, Alistair Moffat, Paul N. Bennett, Kristina Lerman, editors, Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, WSDM 2019, Melbourne, VIC, Australia, February 11-15, 2019. pages 456-464, ACM, 2019. [doi]

Authors

Minmin Chen

This author has not been identified. Look up 'Minmin Chen' in Google

Alex Beutel

This author has not been identified. Look up 'Alex Beutel' in Google

Paul Covington

This author has not been identified. Look up 'Paul Covington' in Google

Sagar Jain

This author has not been identified. Look up 'Sagar Jain' in Google

Francois Belletti

This author has not been identified. Look up 'Francois Belletti' in Google

Ed H. Chi

This author has not been identified. Look up 'Ed H. Chi' in Google