MDPO: Multi-Granularity Direct Preference Optimization for Mathematical Reasoning

Yunze Lin. MDPO: Multi-Granularity Direct Preference Optimization for Mathematical Reasoning. In International Joint Conference on Neural Networks, IJCNN 2025, Rome, Italy, June 30 - July 5, 2025. pages 1-7, IEEE, 2025. [doi]

Authors

Yunze Lin

This author has not been identified. Look up 'Yunze Lin' in Google