Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition

Wenyong Huang, Wenchao Hu, Yu Ting Yeung, Xiao Chen. Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition. In Helen Meng, Bo Xu 0011, Thomas Fang Zheng, editors, Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020. pages 5001-5005, ISCA, 2020. [doi]

Authors

Wenyong Huang

This author has not been identified. Look up 'Wenyong Huang' in Google

Wenchao Hu

This author has not been identified. Look up 'Wenchao Hu' in Google

Yu Ting Yeung

This author has not been identified. Look up 'Yu Ting Yeung' in Google

Xiao Chen

This author has not been identified. Look up 'Xiao Chen' in Google