End-to-end audio-scene classification from raw audio: Multi time-frequency resolution CNN architecture for efficient representation learning

T. Vijaya Kumar, R. Shunmuga Sundar, Tilak Purohit, V. Ramasubramanian. End-to-end audio-scene classification from raw audio: Multi time-frequency resolution CNN architecture for efficient representation learning. In International Conference on Signal Processing and Communications, SPCOM 2020, Bangalore, India, July 19-24, 2020. pages 1-5, IEEE, 2020. [doi]

Abstract

Abstract is missing.