MathFish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula

Li Lucy, Tal August, Rose E. Wang, Luca Soldaini, Courtney Allison, Kyle Lo. MathFish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula. In Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen, editors, Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024. pages 5644-5673, Association for Computational Linguistics, 2024. [doi]

Authors

Li Lucy

This author has not been identified. Look up 'Li Lucy' in Google

Tal August

This author has not been identified. Look up 'Tal August' in Google

Rose E. Wang

This author has not been identified. Look up 'Rose E. Wang' in Google

Luca Soldaini

This author has not been identified. Look up 'Luca Soldaini' in Google

Courtney Allison

This author has not been identified. Look up 'Courtney Allison' in Google

Kyle Lo

This author has not been identified. Look up 'Kyle Lo' in Google