All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection

Niko Moritz, Gordon Wichern, Takaaki Hori, Jonathan Le Roux. All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection. In Helen Meng, Bo Xu 0011, Thomas Fang Zheng, editors, Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020. pages 3112-3116, ISCA, 2020. [doi]

Abstract

Abstract is missing.