Speaker Localization Based on Audio-Visual Bimodal Fusion

Yingxin Zhu, Hao-Ran Jin. Speaker Localization Based on Audio-Visual Bimodal Fusion. JACIII, 25(3):375-382, 2021. [doi]

Authors

Yingxin Zhu

This author has not been identified. Look up 'Yingxin Zhu' in Google

Hao-Ran Jin

This author has not been identified. Look up 'Hao-Ran Jin' in Google