Multi-modal data augmentation based on masked modeling for image-text retrieval

Mingyu Wang, Guoqing Chao, Xi Wang, Yongyong Chen, Xijiong Xie, Dianhui Chu. Multi-modal data augmentation based on masked modeling for image-text retrieval. Knowl.-Based Syst., 324:113821, 2025. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: