Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang 0001. CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. pages 8421-8425, IEEE, 2024. [doi]
Abstract is missing.