Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis

Minsu Kim, Pingchuan Ma 0001, Honglie Chen, Stavros Petridis, Maja Pantic. Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis. In Odette Scharenborg, Catharine Oertel, Khiet Truong, editors, 26th Annual Conference of the International Speech Communication Association, Interspeech 2025, Rotterdam, The Netherlands, 17-21 August 2025. ISCA, 2025. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: