Haoyu Wang, Jiale Chen, Jiaxun Li, Sizhe Shan, Yuehai Wang. EFTTS: Zero-Shot Emotional Speech Synthesis via Conditional Flow Matching and Self-Supervised Representations. In Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2025, Singapore, October 22-24, 2025. pages 795-800, IEEE, 2025. [doi]
Abstract is missing.