AST: Audio Spectrogram Transformer

Yuan Gong, Yu-An Chung, James R. Glass. AST: Audio Spectrogram Transformer. In Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek, editors, Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021. pages 571-575, ISCA, 2021. [doi]

@inproceedings{GongCG21-0,
  title = {AST: Audio Spectrogram Transformer},
  author = {Yuan Gong and Yu-An Chung and James R. Glass},
  year = {2021},
  doi = {10.21437/Interspeech.2021-698},
  url = {https://doi.org/10.21437/Interspeech.2021-698},
  researchr = {https://researchr.org/publication/GongCG21-0},
  cites = {0},
  citedby = {0},
  pages = {571-575},
  booktitle = {Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021},
  editor = {Hynek Hermansky and Honza Cernocký and Lukás Burget and Lori Lamel and Odette Scharenborg and Petr Motlícek},
  publisher = {ISCA},
}