Zihao Zhou, Shudong Liu 0004, Maizhen Ning, Wei Liu, Jindong Wang, Derek F. Wong, Xiaowei Huang, Qiufeng Wang, Kaizhu Huang. Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist. In The Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025. [doi]
Abstract is missing.