Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving triplet loss

Keunhyoung Luke Kim, Jongpil Lee, Sangeun Kum, Juhan Nam. Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving triplet loss. In Jin Ha Lee 0001, Alexander Lerch 0001, Zhiyao Duan, Juhan Nam, Preeti Rao, Peter van Kranenburg, Ajay Srinivasamurthy, editors, Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR 2021, Online, November 7-12, 2021. pages 334-341, 2021. [doi]

Abstract

Abstract is missing.