CLAPSep: Leveraging Contrastive Pre-Trained Model for Multi-Modal Query-Conditioned Target Sound Extraction

Hao Ma, Zhiyuan Peng, Xu Li 0015, Mingjie Shao, Xixin Wu, Ju Liu. CLAPSep: Leveraging Contrastive Pre-Trained Model for Multi-Modal Query-Conditioned Target Sound Extraction. IEEE Transactions on Audio, Speech & Language Processing, 32:4945-4960, 2024. [doi]

Abstract

Abstract is missing.