DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction

Aviral Kumar, Abhishek Gupta 0004, Sergey Levine. DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction. In Hugo Larochelle, Marc'Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, Hsuan-Tien Lin, editors, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. 2020. [doi]

Authors

Aviral Kumar

This author has not been identified. Look up 'Aviral Kumar' in Google

Abhishek Gupta 0004

This author has not been identified. Look up 'Abhishek Gupta 0004' in Google

Sergey Levine

This author has not been identified. Look up 'Sergey Levine' in Google