Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

Han Shen, Zhuoran Yang, Tianyi Chen. Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF. In Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024. pages 44774-44799, OpenReview.net, 2024. [doi]

Abstract

Abstract is missing.