Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis

Dong Yang, Yiyi Cai, Yuki Saito 0001, Lixu Wang, Hiroshi Saruwatari. Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis. In Danielle Belgrave, Cheng Zhang 0005, Laura N. Montoya, Hsuan-Tien Lin, Razvan Pascanu, Piotr Koniusz, Marzyeh Ghassemi, Nancy Chen, Iván Vladimir Meza Ruíz, Arturo Loaiza-Bonilla, editors, Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, NeurIPS 2025, San Diago, CA, USA, December 2-7, 2025 / Mexico City, Mexico, November 30 - December 5, 2025. 2025. [doi]

Authors

Dong Yang

This author has not been identified. Look up 'Dong Yang' in Google

Yiyi Cai

This author has not been identified. Look up 'Yiyi Cai' in Google

Yuki Saito 0001

This author has not been identified. Look up 'Yuki Saito 0001' in Google

Lixu Wang

This author has not been identified. Look up 'Lixu Wang' in Google

Hiroshi Saruwatari

This author has not been identified. Look up 'Hiroshi Saruwatari' in Google