Detach and Enhance: Learning Disentangled Cross-modal Latent Representation for Efficient Face-Voice Association and Matching

Zhenning Yu, Xin Liu, Yiu-ming Cheung, Minghang Zhu, Xing Xu, Nannan Wang, Taihao Li. Detach and Enhance: Learning Disentangled Cross-modal Latent Representation for Efficient Face-Voice Association and Matching. In Xingquan Zhu 0001, Sanjay Ranka, My T. Thai, Takashi Washio, Xindong Wu 0001, editors, IEEE International Conference on Data Mining, ICDM 2022, Orlando, FL, USA, November 28 - Dec. 1, 2022. pages 648-655, IEEE, 2022. [doi]

Abstract

Abstract is missing.