Multimodal Attention Fusion for Target Speaker Extraction

Hiroshi Sato, Tsubasa Ochiai, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Shoko Araki. Multimodal Attention Fusion for Target Speaker Extraction. In IEEE Spoken Language Technology Workshop, SLT 2021, Shenzhen, China, January 19-22, 2021. pages 778-784, IEEE, 2021. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.