Enhancing Sentence Representation with Visually-supervised Multimodal Pre-training

Zhe Li, Laurence T. Yang, Xin Nie, Bocheng Ren, Xianjun Deng. Enhancing Sentence Representation with Visually-supervised Multimodal Pre-training. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 5686-5695, ACM, 2023. [doi]

Authors

Zhe Li

This author has not been identified. Look up 'Zhe Li' in Google

Laurence T. Yang

This author has not been identified. Look up 'Laurence T. Yang' in Google

Xin Nie

This author has not been identified. Look up 'Xin Nie' in Google

Bocheng Ren

This author has not been identified. Look up 'Bocheng Ren' in Google

Xianjun Deng

This author has not been identified. Look up 'Xianjun Deng' in Google