Learning and Fusing Multimodal Deep Features for Acoustic Scene Categorization

Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann. Learning and Fusing Multimodal Deep Features for Acoustic Scene Categorization. In Susanne Boll, Kyoung Mu Lee, Jiebo Luo, Wenwu Zhu 0001, Hyeran Byun, Chang Wen Chen, Rainer Lienhart, Tao Mei, editors, 2018 ACM Multimedia Conference on Multimedia Conference, MM 2018, Seoul, Republic of Korea, October 22-26, 2018. pages 1892-1900, ACM, 2018. [doi]

Abstract

Abstract is missing.