A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang 0001, Yu Liu, Wanli Ouyang. A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning. In Michael J. Wooldridge, Jennifer G. Dy, Sriraam Natarajan, editors, Thirty-Eigth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada. pages 16908-16916, AAAI Press, 2024. [doi]

Authors

Yinmin Zhang

This author has not been identified. Look up 'Yinmin Zhang' in Google

Jie Liu

This author has not been identified. Look up 'Jie Liu' in Google

Chuming Li

This author has not been identified. Look up 'Chuming Li' in Google

Yazhe Niu

This author has not been identified. Look up 'Yazhe Niu' in Google

Yaodong Yang 0001

This author has not been identified. Look up 'Yaodong Yang 0001' in Google

Yu Liu

This author has not been identified. Look up 'Yu Liu' in Google

Wanli Ouyang

This author has not been identified. Look up 'Wanli Ouyang' in Google