Conservative Q-Learning for Offline Reinforcement Learning

Aviral Kumar, Aurick Zhou, George Tucker, Sergey Levine. Conservative Q-Learning for Offline Reinforcement Learning. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Aviral Kumar

This author has not been identified. Look up 'Aviral Kumar' in Google

Aurick Zhou

This author has not been identified. Look up 'Aurick Zhou' in Google

George Tucker

This author has not been identified. Look up 'George Tucker' in Google

Sergey Levine

This author has not been identified. Look up 'Sergey Levine' in Google