Christy D. Bergman, Kourosh Hakhamaneshi. Hands-on Reinforcement Learning for Recommender Systems - From Bandits to SlateQ to Offline RL with Ray RLlib. In Jennifer Golbeck, F. Maxwell Harper, Vanessa Murdock 0001, Michael D. Ekstrand, Bracha Shapira, Justin Basilico, Keld T. Lundgaard, Even Oldridge, editors, RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18 - 23, 2022. pages 700-701, ACM, 2022. [doi]
Abstract is missing.