Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning

Khoa Nguyen, Konstantinos Drossos, Tuomas Virtanen. Temporal Sub-Sampling of Audio Feature Sequences for Automated Audio Captioning. In Nobutaka Ono, Noboru Harada, Yohei Kawaguchi, Annamaria Mesaros, Keisuke Imoto, Yuma Koizumi, Tatsuya Komatsu, editors, Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), Tokyo, Japan (full virtual), November 2-4, 2020. pages 110-114, 2020. [doi]

Abstract

Abstract is missing.