Disentangling The Prosody And Semantic Information With Pre-Trained Model For In-Context Learning Based Zero-Shot Voice Conversion

Zhengyang Chen, Shuai Wang 0016, Mingyang Zhang 0003, Xuechen Liu, Junichi Yamagishi, Yanmin Qian. Disentangling The Prosody And Semantic Information With Pre-Trained Model For In-Context Learning Based Zero-Shot Voice Conversion. In IEEE Spoken Language Technology Workshop, SLT 2024, Macao, December 2-5, 2024. pages 698-704, IEEE, 2024. [doi]

Abstract

Abstract is missing.