A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification

Qing Wang 0008, Jun Du, SiYuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee. A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification. In Kong-Aik Lee, Hung-yi Lee, Yanfeng Lu, Minghui Dong, editors, 13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, Singapore, December 11-14, 2022. pages 453-457, IEEE, 2022. [doi]

Abstract

Abstract is missing.