The following publications are possibly variants of this publication:
- Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss FunctionQing Wang 0008, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee. iscslp 2022: 250-254 [doi]
- Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural NetworkYulin Wu, Ruimin Hu, Xiaochen Wang. icmcs 2023: 636-641 [doi]
- Multi-Target DoA Estimation with an Audio-Visual Fusion MechanismXinyuan Qian, Maulik C. Madhavi, Zexu Pan, Jiadong Wang, Haizhou Li 0001. icassp 2021: 4280-4284 [doi]