Efficient Multi-angle Audio-visual Speech Recognition using Parallel WaveGAN based Scene Classifier

Shinnosuke Isobe, Satoshi Tamura, Yuuto Gotoh, Masaki Nose. Efficient Multi-angle Audio-visual Speech Recognition using Parallel WaveGAN based Scene Classifier. In Maria De Marsico, Gabriella Sanniti di Baja, Ana L. N. Fred, editors, Proceedings of the 11th International Conference on Pattern Recognition Applications and Methods, ICPRAM 2022, Online Streaming, February 3-5, 2022. pages 449-460, SCITEPRESS, 2022. [doi]

Abstract

Abstract is missing.