Disfluency Disentanglement Enhancement in Spoken-Text-Style Transfer for Spontaneous Speech Synthesis

Yuuto Nakata, Daiki Yoshioka, Wen-Chin Huang, Tomoki Toda. Disfluency Disentanglement Enhancement in Spoken-Text-Style Transfer for Spontaneous Speech Synthesis. In Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2025, Singapore, October 22-24, 2025. pages 1098-1103, IEEE, 2025. [doi]

Abstract

Abstract is missing.