Haibin Wu, Xiaofei Wang 0009, Sefik Emre Eskimez, Manthan Thakker, Daniel Tompkins, Chung-Hsien Tsai, Canrun Li, Zhen Xiao, Sheng Zhao, Jinyu Li 0001, Naoyuki Kanda. Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-To-Speech. In IEEE Spoken Language Technology Workshop, SLT 2024, Macao, December 2-5, 2024. pages 690-697, IEEE, 2024. [doi]
Abstract is missing.