UniAVLM: Unified Large Audio-Visual Language Models for Comprehensive Video Understanding - researchr publication

researchr

You are not signed in
Sign in
Sign up

Lecheng Yan, Chenyang Lyu, Wenxi Li, Younes Samih, Shaochen Jiang. UniAVLM: Unified Large Audio-Visual Language Models for Comprehensive Video Understanding. In Yi Mei 0001, Chao Qian 0001, Quan Bai 0001, Bing Xue 0001, Sankalp Khanna, editors, PRICAI 2025: Trends in Artificial Intelligence - 22nd Pacific Rim International Conference on Artificial Intelligence, PRICAI 2025, Wellington, New Zealand, November 17-21, 2025, Proceedings, Part V. Volume 16455 of Lecture Notes in Computer Science, pages 103-118, Springer, 2025. [doi]

Abstract is missing.

runs on WebDSL