Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Atsunori Ogawa, Tomohiro Nakatani. Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues. In Gernot Kubin, Zdravko Kacic, editors, Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019. pages 2718-2722, ISCA, 2019. [doi]
@inproceedings{OchiaiDKON19-0, title = {Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues}, author = {Tsubasa Ochiai and Marc Delcroix and Keisuke Kinoshita and Atsunori Ogawa and Tomohiro Nakatani}, year = {2019}, doi = {10.21437/Interspeech.2019-1513}, url = {https://doi.org/10.21437/Interspeech.2019-1513}, researchr = {https://researchr.org/publication/OchiaiDKON19-0}, cites = {0}, citedby = {0}, pages = {2718-2722}, booktitle = {Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019}, editor = {Gernot Kubin and Zdravko Kacic}, publisher = {ISCA}, }