VCSE: Time-Domain Visual-Contextual Speaker Extraction Network

Junjie Li, Meng Ge, Zexu Pan, Longbiao Wang, Jianwu Dang. VCSE: Time-Domain Visual-Contextual Speaker Extraction Network. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 906-910, ISCA, 2022. [doi]

Authors

Junjie Li

This author has not been identified. Look up 'Junjie Li' in Google

Meng Ge

This author has not been identified. Look up 'Meng Ge' in Google

Zexu Pan

This author has not been identified. Look up 'Zexu Pan' in Google

Longbiao Wang

This author has not been identified. Look up 'Longbiao Wang' in Google

Jianwu Dang

This author has not been identified. Look up 'Jianwu Dang' in Google