Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural Network

Yulin Wu, Ruimin Hu, Xiaochen Wang. Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural Network. In IEEE International Conference on Multimedia and Expo, ICME 2023, Brisbane, Australia, July 10-14, 2023. pages 636-641, IEEE, 2023. [doi]

Abstract

Abstract is missing.