DiTVC: One-Shot Voice Conversion via Diffusion Transformer with Environment and Speaking Rate Cloning

Yunyun Wang, Jiaqi Su, Adam Finkelstein, Rithesh Kumar, Ke Chen 0021, Zeyu Jin. DiTVC: One-Shot Voice Conversion via Diffusion Transformer with Environment and Speaking Rate Cloning. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2025, Tahoe City, CA, USA, October 12-15, 2025. pages 1-5, IEEE, 2025. [doi]

Abstract

Abstract is missing.