Fatemeh Pishdadian, Prem Seetharaman, Bongjun Kim, Bryan Pardo. Classifying Non-speech Vocals: Deep vs Signal Processing Representations. In Michael I. Mandel, Justin Salamon, Daniel P. W. Ellis, editors, Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), New York University, NY, USA, October 2019. pages 194-198, 2019. [doi]
Abstract is missing.