Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Aviral Kumar, Justin Fu, Matthew Soh, George Tucker, Sergey Levine. Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d'Alché-Buc, Edward A. Fox, Roman Garnett, editors, Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, 8-14 December 2019, Vancouver, BC, Canada. pages 11761-11771, 2019. [doi]

This author has not been identified. Look up 'Aviral Kumar' in GoogleThis author has not been identified. Look up 'Justin Fu' in GoogleThis author has not been identified. Look up 'Matthew Soh' in GoogleThis author has not been identified. Look up 'George Tucker' in GoogleThis author has not been identified. Look up 'Sergey Levine' in Google

runs on WebDSL