Multi-modal data augmentation based on masked modeling for image-text retrieval

Mingyu Wang, Guoqing Chao, Xi Wang, Yongyong Chen, Xijiong Xie, Dianhui Chu. Multi-modal data augmentation based on masked modeling for image-text retrieval. Knowl.-Based Syst., 324:113821, 2025. [doi]

Abstract

Abstract is missing.