Multi-modal data augmentation based on masked modeling for image-text retrieval

Mingyu Wang, Guoqing Chao, Xi Wang, Yongyong Chen, Xijiong Xie, Dianhui Chu. Multi-modal data augmentation based on masked modeling for image-text retrieval. Knowl.-Based Syst., 324:113821, 2025. [doi]

@article{WangCWCXC25,
  title = {Multi-modal data augmentation based on masked modeling for image-text retrieval},
  author = {Mingyu Wang and Guoqing Chao and Xi Wang and Yongyong Chen and Xijiong Xie and Dianhui Chu},
  year = {2025},
  doi = {10.1016/j.knosys.2025.113821},
  url = {https://doi.org/10.1016/j.knosys.2025.113821},
  researchr = {https://researchr.org/publication/WangCWCXC25},
  cites = {0},
  citedby = {0},
  journal = {Knowl.-Based Syst.},
  volume = {324},
  pages = {113821},
}