Neural Audio Captioning Based on Conditional Sequence-to-Sequence Model

Shota Ikawa, Kunio Kashino. Neural Audio Captioning Based on Conditional Sequence-to-Sequence Model. In Michael I. Mandel, Justin Salamon, Daniel P. W. Ellis, editors, Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), New York University, NY, USA, October 2019. pages 99-103, 2019. [doi]

Abstract

Abstract is missing.