Is RLHF More Difficult than Standard RL? A Theoretical Perspective

Yuanhao Wang 0001, Qinghua Liu, Chi Jin 0001. Is RLHF More Difficult than Standard RL? A Theoretical Perspective. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Yuanhao Wang 0001

This author has not been identified. Look up 'Yuanhao Wang 0001' in Google

Qinghua Liu

This author has not been identified. Look up 'Qinghua Liu' in Google

Chi Jin 0001

This author has not been identified. Look up 'Chi Jin 0001' in Google