Transfer Learning followed by Transformer for Automated Audio Captioning

Baekseung Kim, Hyejin Won, Il-Youp Kwak, Changwon Lim. Transfer Learning followed by Transformer for Automated Audio Captioning. In Frederic Font, Annamaria Mesaros, Daniel P. W. Ellis, Eduardo Fonseca, Magdalena Fuentes, Benjamin Elizalde, editors, Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), Online, November 15-19, 2021. pages 221-225, 2021. [doi]

Abstract

Abstract is missing.