Mixup-Tuning: Enhancing Multimodal Reasoning via Attention-Aware Token Alignment and Consistency Regularization

Chenghao Ma, Junpeng Ding. Mixup-Tuning: Enhancing Multimodal Reasoning via Attention-Aware Token Alignment and Consistency Regularization. In Proceedings of the 2025 International Conference on Embodied Intelligence and Large Models, EILM 2025, Chengdu, China, December 19-21, 2025. pages 142-146, ACM, 2025. [doi]

Abstract

Abstract is missing.