$C^{2}$AV-TSE: Context and Confidence-Aware Audio Visual Target Speaker Extraction

Wenxuan Wu, Xueyuan Chen, Shuai Wang 0016, Jiadong Wang, Lingwei Meng, Xixin Wu, Helen Meng, Haizhou Li 0001. $C^{2}$AV-TSE: Context and Confidence-Aware Audio Visual Target Speaker Extraction. J. Sel. Topics Signal Processing, 19(4):646-657, May 2025. [doi]

Authors

Wenxuan Wu

This author has not been identified. Look up 'Wenxuan Wu' in Google

Xueyuan Chen

This author has not been identified. Look up 'Xueyuan Chen' in Google

Shuai Wang 0016

This author has not been identified. Look up 'Shuai Wang 0016' in Google

Jiadong Wang

This author has not been identified. Look up 'Jiadong Wang' in Google

Lingwei Meng

This author has not been identified. Look up 'Lingwei Meng' in Google

Xixin Wu

This author has not been identified. Look up 'Xixin Wu' in Google

Helen Meng

This author has not been identified. Look up 'Helen Meng' in Google

Haizhou Li 0001

This author has not been identified. Look up 'Haizhou Li 0001' in Google