Khaled Koutini, Shahed Masoudian, Florian Schmid, Hamid Eghbal-zadeh, Jan Schlüter, Gerhard Widmer. Learning General Audio Representations With Large-Scale Training of Patchout Audio Transformers. In Joseph Turian, Björn W. Schuller, Dorien Herremans, Katrin Kirchoff, L. Paola García-Perera, Philippe Esling, editors, HEAR: Holistic Evaluation of Audio Representations, Virtual Event, December 13-14, 2021. Volume 166 of Proceedings of Machine Learning Research, pages 65-89, PMLR, 2021. [doi]
Abstract is missing.