Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition

Guinan Li, Jiajun Deng, Youjun Chen, Mengzhe Geng, Shujie Hu, Zhe Li, Zengrui Jin, Tianzi Wang, Xurong Xie, Helen Meng, Xunying Liu. Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition. In Itshak Lapidot, Sharon Gannot, editors, 25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024. ISCA, 2024. [doi]

Authors

Guinan Li

This author has not been identified. Look up 'Guinan Li' in Google

Jiajun Deng

This author has not been identified. Look up 'Jiajun Deng' in Google

Youjun Chen

This author has not been identified. Look up 'Youjun Chen' in Google

Mengzhe Geng

This author has not been identified. Look up 'Mengzhe Geng' in Google

Shujie Hu

This author has not been identified. Look up 'Shujie Hu' in Google

Zhe Li

This author has not been identified. Look up 'Zhe Li' in Google

Zengrui Jin

This author has not been identified. Look up 'Zengrui Jin' in Google

Tianzi Wang

This author has not been identified. Look up 'Tianzi Wang' in Google

Xurong Xie

This author has not been identified. Look up 'Xurong Xie' in Google

Helen Meng

This author has not been identified. Look up 'Helen Meng' in Google

Xunying Liu

This author has not been identified. Look up 'Xunying Liu' in Google