MDPO: Multi-Granularity Direct Preference Optimization for Mathematical Reasoning

Yunze Lin. MDPO: Multi-Granularity Direct Preference Optimization for Mathematical Reasoning. In International Joint Conference on Neural Networks, IJCNN 2025, Rome, Italy, June 30 - July 5, 2025. pages 1-7, IEEE, 2025. [doi]

Abstract

Abstract is missing.