Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal

Yohei Abe, Akinori Ito. Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal. In Kebin Jia, Jeng-Shyang Pan, Yao Zhao, Lakhmi C. Jain, editors, Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013, Beijing, China, October 16-18, 2013. pages 271-274, IEEE, 2013. [doi]

@inproceedings{AbeI13,
  title = {Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal},
  author = {Yohei Abe and Akinori Ito},
  year = {2013},
  doi = {10.1109/IIH-MSP.2013.76},
  url = {http://dx.doi.org/10.1109/IIH-MSP.2013.76},
  researchr = {https://researchr.org/publication/AbeI13},
  cites = {0},
  citedby = {0},
  pages = {271-274},
  booktitle = {Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013, Beijing, China, October 16-18, 2013},
  editor = {Kebin Jia and Jeng-Shyang Pan and Yao Zhao and Lakhmi C. Jain},
  publisher = {IEEE},
}