Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings

I-Chun Chern, Kuo-Hsuan Hung, Yi-Ting Chen, Tassadaq Hussain, Mandar Gogate, Amir Hussain 0001, Yu Tsao 0001, Jen-Cheng Hou. Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings. In IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2023 - Workshops, Rhodes Island, Greece, June 4-10, 2023. pages 1-5, IEEE, 2023. [doi]

Authors

I-Chun Chern

This author has not been identified. Look up 'I-Chun Chern' in Google

Kuo-Hsuan Hung

This author has not been identified. Look up 'Kuo-Hsuan Hung' in Google

Yi-Ting Chen

This author has not been identified. Look up 'Yi-Ting Chen' in Google

Tassadaq Hussain

This author has not been identified. Look up 'Tassadaq Hussain' in Google

Mandar Gogate

This author has not been identified. Look up 'Mandar Gogate' in Google

Amir Hussain 0001

This author has not been identified. Look up 'Amir Hussain 0001' in Google

Yu Tsao 0001

This author has not been identified. Look up 'Yu Tsao 0001' in Google

Jen-Cheng Hou

This author has not been identified. Look up 'Jen-Cheng Hou' in Google