CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing

Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang 0001. CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. pages 8421-8425, IEEE, 2024. [doi]

@inproceedings{ChenGLWLL024,
  title = {CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing},
  author = {Yaru Chen and Ruohao Guo and Xubo Liu and Peipei Wu and Guangyao Li and Zhenbo Li and Wenwu Wang 0001},
  year = {2024},
  doi = {10.1109/ICASSP48485.2024.10446092},
  url = {https://doi.org/10.1109/ICASSP48485.2024.10446092},
  researchr = {https://researchr.org/publication/ChenGLWLL024},
  cites = {0},
  citedby = {0},
  pages = {8421-8425},
  booktitle = {IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024},
  publisher = {IEEE},
  isbn = {979-8-3503-4485-1},
}