Cold-Start Reinforcement Learning with Softmax Policy Gradient

Nan Ding, Radu Soricut. Cold-Start Reinforcement Learning with Softmax Policy Gradient. In Isabelle Guyon, Ulrike von Luxburg, Samy Bengio, Hanna M. Wallach, Rob Fergus, S. V. N. Vishwanathan, Roman Garnett, editors, Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 4-9 December 2017, Long Beach, CA, USA. pages 2814-2823, 2017. [doi]

Abstract

Abstract is missing.