Av-Sepformer: Cross-Attention Sepformer for Audio-Visual Target Speaker Extraction

Jiuxin Lin, Xinyu Cai, Heinrich Dinkel, Jun Chen, Zhiyong Yan, Yongqing Wang, Junbo Zhang, Zhiyong Wu 0001, Yujun Wang, Helen Meng. Av-Sepformer: Cross-Attention Sepformer for Audio-Visual Target Speaker Extraction. In IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023. pages 1-5, IEEE, 2023. [doi]

Authors

Jiuxin Lin

This author has not been identified. Look up 'Jiuxin Lin' in Google

Xinyu Cai

This author has not been identified. Look up 'Xinyu Cai' in Google

Heinrich Dinkel

This author has not been identified. Look up 'Heinrich Dinkel' in Google

Jun Chen

This author has not been identified. Look up 'Jun Chen' in Google

Zhiyong Yan

This author has not been identified. Look up 'Zhiyong Yan' in Google

Yongqing Wang

This author has not been identified. Look up 'Yongqing Wang' in Google

Junbo Zhang

This author has not been identified. Look up 'Junbo Zhang' in Google

Zhiyong Wu 0001

This author has not been identified. Look up 'Zhiyong Wu 0001' in Google

Yujun Wang

This author has not been identified. Look up 'Yujun Wang' in Google

Helen Meng

This author has not been identified. Look up 'Helen Meng' in Google