Changrong Xiao, Sean Xin Xu, Kunpeng Zhang. Multimodal Data Augmentation for Image Captioning using Diffusion Models. In Zheng Wang, Cheng Long, Shihao Xu, Bingzheng Gan, Wei Shi, Zhao Cao, Tat-Seng Chua, editors, Proceedings of the 1st Workshop on Large Generative Models Meet Multimodal Applications, LGM3A 2023, Ottawa ON, Canada, 2 November 2023. pages 23-33, ACM, 2023. [doi]
Abstract is missing.