Transfer Learning from Audio-Visual Grounding to Speech Recognition

Wei-Ning Hsu, David Harwath, James R. Glass. Transfer Learning from Audio-Visual Grounding to Speech Recognition. In Gernot Kubin, Zdravko Kacic, editors, Interspeech 2019, 20th Annual Conference of the International Speech Communication Association, Graz, Austria, 15-19 September 2019. pages 3242-3246, ISCA, 2019. [doi]

Abstract

Abstract is missing.