Fusing Music and Video Modalities Using Multi-timescale Shared Representations

Bing Xu, Xiaogang Wang, Xiaoou Tang. Fusing Music and Video Modalities Using Multi-timescale Shared Representations. In Kien A. Hua, Yong Rui, Ralf Steinmetz, Alan Hanjalic, Apostol Natsev, Wenwu Zhu, editors, Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03 - 07, 2014. pages 1073-1076, ACM, 2014. [doi]

Abstract

Abstract is missing.