Crowdsourcing a Dataset of Audio Captions

Samuel Lipping, Konstantinos Drossos, Tuomas Virtanen. Crowdsourcing a Dataset of Audio Captions. In Michael I. Mandel, Justin Salamon, Daniel P. W. Ellis, editors, Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), New York University, NY, USA, October 2019. pages 139-143, 2019. [doi]

Abstract

Abstract is missing.