Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning

Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen. Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning. In Nobutaka Ono, Noboru Harada, Yohei Kawaguchi, Annamaria Mesaros, Keisuke Imoto, Yuma Koizumi, Tatsuya Komatsu, editors, Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), Tokyo, Japan (full virtual), November 2-4, 2020. pages 110-114, 2020. [doi]

@inproceedings{NguyenDV20-0,
  title = {Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning},
  author = {Khoa Nguyen and Konstantinos Drossos and Tuomas Virtanen},
  year = {2020},
  url = {http://dcase.community/documents/workshop2020/proceedings/DCASE2020Workshop_Nguyen_45.pdf},
  researchr = {https://researchr.org/publication/NguyenDV20-0},
  cites = {0},
  citedby = {0},
  pages = {110-114},
  booktitle = {Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), Tokyo, Japan (full virtual), November 2-4, 2020},
  editor = {Nobutaka Ono and Noboru Harada and Yohei Kawaguchi and Annamaria Mesaros and Keisuke Imoto and Yuma Koizumi and Tatsuya Komatsu},
  isbn = {978-4-600-00566-5},
}