Yan-Bo Lin, Yu Tian, Linjie Yang, Gedas Bertasius, Heng Wang. VMAs: Video-to-Music Generation via Semantic Alignment in Web Music Videos. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025, Tucson, AZ, USA, February 26 - March 6, 2025. pages 1155-1165, IEEE, 2025. [doi]