Graph-Based Video-Language Learning with Multi-Grained Audio-Visual Alignment

Chenyang Lyu, Wenxi Li, Tianbo Ji, Longyue Wang, Liting Zhou, Cathal Gurrin, Linyi Yang, Yi Yu, Yvette Graham, Jennifer Foster. Graph-Based Video-Language Learning with Multi-Grained Audio-Visual Alignment. In Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini 0001, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain, editors, Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023. pages 3975-3984, ACM, 2023. [doi]

@inproceedings{LyuLJWZGYYGF23,
  title = {Graph-Based Video-Language Learning with Multi-Grained Audio-Visual Alignment},
  author = {Chenyang Lyu and Wenxi Li and Tianbo Ji and Longyue Wang and Liting Zhou and Cathal Gurrin and Linyi Yang and Yi Yu and Yvette Graham and Jennifer Foster},
  year = {2023},
  doi = {10.1145/3581783.3612132},
  url = {https://doi.org/10.1145/3581783.3612132},
  researchr = {https://researchr.org/publication/LyuLJWZGYYGF23},
  cites = {0},
  citedby = {0},
  pages = {3975-3984},
  booktitle = {Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023},
  editor = {Abdulmotaleb El-Saddik and Tao Mei and Rita Cucchiara and Marco Bertini 0001 and Diana Patricia Tobon Vallejo and Pradeep K. Atrey and M. Shamim Hossain},
  publisher = {ACM},
}