Multimodal Attention Fusion for Target Speaker Extraction

Hiroshi Sato, Tsubasa Ochiai, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Shoko Araki. Multimodal Attention Fusion for Target Speaker Extraction. In IEEE Spoken Language Technology Workshop, SLT 2021, Shenzhen, China, January 19-22, 2021. pages 778-784, IEEE, 2021. [doi]

Authors

Hiroshi Sato

This author has not been identified. Look up 'Hiroshi Sato' in Google

Tsubasa Ochiai

This author has not been identified. Look up 'Tsubasa Ochiai' in Google

Keisuke Kinoshita

This author has not been identified. Look up 'Keisuke Kinoshita' in Google

Marc Delcroix

This author has not been identified. Look up 'Marc Delcroix' in Google

Tomohiro Nakatani

This author has not been identified. Look up 'Tomohiro Nakatani' in Google

Shoko Araki

This author has not been identified. Look up 'Shoko Araki' in Google