Audio-Visual Multi-Speaker Tracking Based on the GLMB Framework

Shoufeng Lin, Xinyuan Qian. Audio-Visual Multi-Speaker Tracking Based on the GLMB Framework. In Helen Meng, Bo Xu 0011, Thomas Fang Zheng, editors, Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020. pages 3082-3086, ISCA, 2020. [doi]

Abstract

Abstract is missing.