Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis

Minsu Kim, Pingchuan Ma 0001, Honglie Chen, Stavros Petridis, Maja Pantic. Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis. In Odette Scharenborg, Catharine Oertel, Khiet Truong, editors, 26th Annual Conference of the International Speech Communication Association, Interspeech 2025, Rotterdam, The Netherlands, 17-21 August 2025. ISCA, 2025. [doi]

Authors

Minsu Kim

This author has not been identified. Look up 'Minsu Kim' in Google

Pingchuan Ma 0001

This author has not been identified. Look up 'Pingchuan Ma 0001' in Google

Honglie Chen

This author has not been identified. Look up 'Honglie Chen' in Google

Stavros Petridis

This author has not been identified. Look up 'Stavros Petridis' in Google

Maja Pantic

This author has not been identified. Look up 'Maja Pantic' in Google