M3ixup: A multi-modal data augmentation approach for image captioning

Yinan Li, Jiayi Ji, Xiaoshuai Sun, Yiyi Zhou, Yunpeng Luo, Rongrong Ji. M3ixup: A multi-modal data augmentation approach for image captioning. Pattern Recognition, 158:110941, 2025. [doi]

Authors

Yinan Li

This author has not been identified. Look up 'Yinan Li' in Google

Jiayi Ji

This author has not been identified. Look up 'Jiayi Ji' in Google

Xiaoshuai Sun

This author has not been identified. Look up 'Xiaoshuai Sun' in Google

Yiyi Zhou

This author has not been identified. Look up 'Yiyi Zhou' in Google

Yunpeng Luo

This author has not been identified. Look up 'Yunpeng Luo' in Google

Rongrong Ji

This author has not been identified. Look up 'Rongrong Ji' in Google