Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal

Yohei Abe, Akinori Ito. Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal. In Kebin Jia, Jeng-Shyang Pan, Yao Zhao, Lakhmi C. Jain, editors, Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, IIH-MSP 2013, Beijing, China, October 16-18, 2013. pages 271-274, IEEE, 2013. [doi]

Abstract

Abstract is missing.