Multimodal Data Augmentation for Image Captioning using Diffusion Models

Changrong Xiao, Sean Xin Xu, Kunpeng Zhang. Multimodal Data Augmentation for Image Captioning using Diffusion Models. In Zheng Wang, Cheng Long, Shihao Xu, Bingzheng Gan, Wei Shi, Zhao Cao, Tat-Seng Chua, editors, Proceedings of the 1st Workshop on Large Generative Models Meet Multimodal Applications, LGM3A 2023, Ottawa ON, Canada, 2 November 2023. pages 23-33, ACM, 2023. [doi]

Authors

Changrong Xiao

This author has not been identified. Look up 'Changrong Xiao' in Google

Sean Xin Xu

This author has not been identified. Look up 'Sean Xin Xu' in Google

Kunpeng Zhang

This author has not been identified. Look up 'Kunpeng Zhang' in Google