A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition

Jin Li, Rongfeng Su, Xurong Xie, Lan Wang, Nan Yan. A Multi-level Acoustic Feature Extraction Framework for Transformer Based End-to-End Speech Recognition. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 3173-3177, ISCA, 2022. [doi]

Abstract

Abstract is missing.