Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning

Qian Jiang, Changyou Chen, Han Zhao 0002, Liqun Chen, Qing-ping, Son Dinh Tran, Yi Xu, Belinda Zeng, Trishul Chilimbi. Understanding and Constructing Latent Modality Structures in Multi-Modal Representation Learning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023, Vancouver, BC, Canada, June 17-24, 2023. pages 7661-7671, IEEE, 2023. [doi]

Abstract

Abstract is missing.