Self-Explore: Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards

Hyeonbin Hwang, Doyoung Kim, Seungone Kim, Seonghyeon Ye, Minjoon Seo. Self-Explore: Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024. pages 1444-1466, Association for Computational Linguistics, 2024. [doi]

Authors

Hyeonbin Hwang

This author has not been identified. Look up 'Hyeonbin Hwang' in Google

Doyoung Kim

This author has not been identified. Look up 'Doyoung Kim' in Google

Seungone Kim

This author has not been identified. Look up 'Seungone Kim' in Google

Seonghyeon Ye

This author has not been identified. Look up 'Seonghyeon Ye' in Google

Minjoon Seo

This author has not been identified. Look up 'Minjoon Seo' in Google