Listen Carefully and Tell: An Audio Captioning System Based on Residual Learning and Gammatone Audio Representation

Sergi Perez-Castanos, Javier Naranjo-Alcazar, Pedro Zuccarello, Maximo Cobos. Listen Carefully and Tell: An Audio Captioning System Based on Residual Learning and Gammatone Audio Representation. In Nobutaka Ono, Noboru Harada, Yohei Kawaguchi, Annamaria Mesaros, Keisuke Imoto, Yuma Koizumi, Tatsuya Komatsu, editors, Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), Tokyo, Japan (full virtual), November 2-4, 2020. pages 150-154, 2020. [doi]

Abstract

Abstract is missing.