Emotion-Rich Cross-Speaker TTS via Contrastive Prosody Enhancement

Jen-Tzung Chien, Bryan Gautama Ngo. Emotion-Rich Cross-Speaker TTS via Contrastive Prosody Enhancement. In Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2025, Singapore, October 22-24, 2025. pages 1110-1115, IEEE, 2025. [doi]

Abstract

Abstract is missing.