EFTTS: Zero-Shot Emotional Speech Synthesis via Conditional Flow Matching and Self-Supervised Representations

Haoyu Wang, Jiale Chen, Jiaxun Li, Sizhe Shan, Yuehai Wang. EFTTS: Zero-Shot Emotional Speech Synthesis via Conditional Flow Matching and Self-Supervised Representations. In Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2025, Singapore, October 22-24, 2025. pages 795-800, IEEE, 2025. [doi]

Authors

Haoyu Wang

This author has not been identified. Look up 'Haoyu Wang' in Google

Jiale Chen

This author has not been identified. Look up 'Jiale Chen' in Google

Jiaxun Li

This author has not been identified. Look up 'Jiaxun Li' in Google

Sizhe Shan

This author has not been identified. Look up 'Sizhe Shan' in Google

Yuehai Wang

This author has not been identified. Look up 'Yuehai Wang' in Google