Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis - researchr publication related

researchr

You are not signed in
Sign in
Sign up

Minsu Kim, Pingchuan Ma 0001, Honglie Chen, Stavros Petridis, Maja Pantic. Revival with Voice: Multi-modal Controllable Text-to-Speech Synthesis. In Odette Scharenborg, Catharine Oertel, Khiet Truong, editors, 26th Annual Conference of the International Speech Communication Association, Interspeech 2025, Rotterdam, The Netherlands, 17-21 August 2025. ISCA, 2025. [doi]

The following publications are possibly variants of this publication:

VCVTS: Multi-Speaker Video-to-Speech Synthesis Via Cross-Modal Knowledge Transfer from Voice ConversionDisong Wang, Shan Yang, Dan Su 0002, Xunying Liu, Dong Yu 0001, Helen Meng. icassp 2022: 7252-7256 [doi]

runs on WebDSL