Bellman-consistent Pessimism for Offline Reinforcement Learning

Tengyang Xie, Ching-An Cheng, Nan Jiang 0008, Paul Mineiro, Alekh Agarwal. Bellman-consistent Pessimism for Offline Reinforcement Learning. In Marc'Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, Jennifer Wortman Vaughan, editors, Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. pages 6683-6694, 2021. [doi]

Authors

Tengyang Xie

This author has not been identified. Look up 'Tengyang Xie' in Google

Ching-An Cheng

This author has not been identified. Look up 'Ching-An Cheng' in Google

Nan Jiang 0008

This author has not been identified. Look up 'Nan Jiang 0008' in Google

Paul Mineiro

This author has not been identified. Look up 'Paul Mineiro' in Google

Alekh Agarwal

This author has not been identified. Look up 'Alekh Agarwal' in Google