Rethinking the Visual Cues in Audio-Visual Speaker Extraction

Junjie Li, Meng Ge, Zexu Pan, Rui Cao, Longbiao Wang, Jianwu Dang 0001, Shiliang Zhang. Rethinking the Visual Cues in Audio-Visual Speaker Extraction. In Naomi Harte, Julie Carson-Berndsen, Gareth Jones, editors, 24th Annual Conference of the International Speech Communication Association, Interspeech 2023, Dublin, Ireland, August 20-24, 2023. pages 3754-3758, ISCA, 2023. [doi]

Authors

Junjie Li

This author has not been identified. Look up 'Junjie Li' in Google

Meng Ge

This author has not been identified. Look up 'Meng Ge' in Google

Zexu Pan

This author has not been identified. Look up 'Zexu Pan' in Google

Rui Cao

This author has not been identified. Look up 'Rui Cao' in Google

Longbiao Wang

This author has not been identified. Look up 'Longbiao Wang' in Google

Jianwu Dang 0001

This author has not been identified. Look up 'Jianwu Dang 0001' in Google

Shiliang Zhang

This author has not been identified. Look up 'Shiliang Zhang' in Google