All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection

Niko Moritz, Gordon Wichern, Takaaki Hori, Jonathan Le Roux. All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection. In Helen Meng, Bo Xu 0011, Thomas Fang Zheng, editors, Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020. pages 3112-3116, ISCA, 2020. [doi]

@inproceedings{MoritzWHR20,
  title = {All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection},
  author = {Niko Moritz and Gordon Wichern and Takaaki Hori and Jonathan Le Roux},
  year = {2020},
  doi = {10.21437/Interspeech.2020-2757},
  url = {https://doi.org/10.21437/Interspeech.2020-2757},
  researchr = {https://researchr.org/publication/MoritzWHR20},
  cites = {0},
  citedby = {0},
  pages = {3112-3116},
  booktitle = {Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020},
  editor = {Helen Meng and Bo Xu 0011 and Thomas Fang Zheng},
  publisher = {ISCA},
}