Muhan Lin, Shuyang Shi, Yue Guo 0003, Behdad Chalaki, Vaishnav Tadiparthi, Ehsan Moradi-Pari, Simon Stepputtis, Joseph Campbell, Katia P. Sycara. Navigating Noisy Feedback: Enhancing Reinforcement Learning with Error-Prone Language Models. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024. pages 16002-16014, Association for Computational Linguistics, 2024. [doi]
Abstract is missing.