Learning and Fusing Multimodal Deep Features for Acoustic Scene Categorization

Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann. Learning and Fusing Multimodal Deep Features for Acoustic Scene Categorization. In Susanne Boll, Kyoung Mu Lee, Jiebo Luo, Wenwu Zhu 0001, Hyeran Byun, Chang Wen Chen, Rainer Lienhart, Tao Mei, editors, 2018 ACM Multimedia Conference on Multimedia Conference, MM 2018, Seoul, Republic of Korea, October 22-26, 2018. pages 1892-1900, ACM, 2018. [doi]

@inproceedings{YinSZ18,
  title = {Learning and Fusing Multimodal Deep Features for Acoustic Scene Categorization},
  author = {Yifang Yin and Rajiv Ratn Shah and Roger Zimmermann},
  year = {2018},
  doi = {10.1145/3240508.3240631},
  url = {https://doi.org/10.1145/3240508.3240631},
  researchr = {https://researchr.org/publication/YinSZ18},
  cites = {0},
  citedby = {0},
  pages = {1892-1900},
  booktitle = {2018 ACM Multimedia Conference on Multimedia Conference, MM 2018, Seoul, Republic of Korea, October 22-26, 2018},
  editor = {Susanne Boll and Kyoung Mu Lee and Jiebo Luo and Wenwu Zhu 0001 and Hyeran Byun and Chang Wen Chen and Rainer Lienhart and Tao Mei},
  publisher = {ACM},
  isbn = {978-1-4503-5665-7},
}