The following publications are possibly variants of this publication:
- Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity RegulationChaoya Jiang, Wei Ye 0004, Haiyang Xu, Songfang Huang, Fei Huang 0004, Shikun Zhang. acl 2023: 14660-14679 [doi]
- UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-TrainingMingyang Zhou, Luowei Zhou, Shuohang Wang, Yu Cheng 0001, Linjie Li, Zhou Yu, Jingjing Liu. cvpr 2021: 4155-4165 [doi]
- Unicoder-VL: A Universal Encoder for Vision and Language by Cross-Modal Pre-TrainingGen Li, Nan Duan, Yuejian Fang, Ming Gong, Daxin Jiang. AAAI 2020: 11336-11344 [doi]
- Cross-modal Semantic Alignment Pre-training for Vision-and-Language NavigationSiying Wu, Xueyang Fu, Feng Wu 0001, Zheng-Jun Zha. mm 2022: 4233-4241 [doi]
- A Hierarchical Self-organizing Associative Memory for Machine LearningJanusz A. Starzyk, Haibo He, Yue Li. isnn 2007: 413-423 [doi]