The following publications are possibly variants of this publication:
- CISum: Learning Cross-modality Interaction to Enhance Multimodal Semantic Coverage for Multimodal SummarizationLitian Zhang, Xiaoming Zhang 0001, Ziming Guo, ZhiPeng Liu. sdm 2023: 370-378 [doi]
- Multimodal Data Enhanced Representation Learning for Knowledge GraphsZikang Wang, Linjing Li, Qiudan Li, Daniel Zeng. ijcnn 2019: 1-8 [doi]
- Representation Learning for Compressed Video Action Recognition via Attentive Cross-modal Interaction with Motion EnhancementBing Li, Jiaxin Chen, Dongming Zhang, Xiuguo Bao, Di Huang 0001. IJCAI 2022: 1060-1066 [doi]
- Detach and Enhance: Learning Disentangled Cross-modal Latent Representation for Efficient Face-Voice Association and MatchingZhenning Yu, Xin Liu, Yiu-ming Cheung, Minghang Zhu, Xing Xu, Nannan Wang, Taihao Li. icdm 2022: 648-655 [doi]