Multi-Modal Cross-Attention-Guided Network for Audio-Visual Quality Evaluation via Visual Saliency and Mel-Spectrum Features

Junhao Lin, Yueli Cui, Chenli Fang, Binghong Pan, Chencheng Pan, Gangyi Jiang, Shiqing Zhang, Siwei Ma 0001, Qi Tian 0001. Multi-Modal Cross-Attention-Guided Network for Audio-Visual Quality Evaluation via Visual Saliency and Mel-Spectrum Features. IEEE Trans. Circuits Syst. Video Techn., 36(5):6783-6798, May 2026. [doi]

Abstract

Abstract is missing.