Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement

Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng Li, Wei Peng, Sujian Li. Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024. pages 1556-1572, Association for Computational Linguistics, 2024. [doi]

Authors

Weimin Xiong

This author has not been identified. Look up 'Weimin Xiong' in Google

Yifan Song

This author has not been identified. Look up 'Yifan Song' in Google

Xiutian Zhao

This author has not been identified. Look up 'Xiutian Zhao' in Google

Wenhao Wu

This author has not been identified. Look up 'Wenhao Wu' in Google

Xun Wang

This author has not been identified. Look up 'Xun Wang' in Google

Ke Wang

This author has not been identified. Look up 'Ke Wang' in Google

Cheng Li

This author has not been identified. Look up 'Cheng Li' in Google

Wei Peng

This author has not been identified. Look up 'Wei Peng' in Google

Sujian Li

This author has not been identified. Look up 'Sujian Li' in Google