Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech

Takaaki Saeki, Heiga Zen, Zhehuai Chen, Nobuyuki Morioka, Gary Wang, Yu Zhang 0033, Ankur Bapna, Andrew Rosenberg, Bhuvana Ramabhadran. Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-to-Speech. In IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP 2023, Rhodes Island, Greece, June 4-10, 2023. pages 1-5, IEEE, 2023. [doi]

Authors

Takaaki Saeki

This author has not been identified. Look up 'Takaaki Saeki' in Google

Heiga Zen

This author has not been identified. Look up 'Heiga Zen' in Google

Zhehuai Chen

This author has not been identified. Look up 'Zhehuai Chen' in Google

Nobuyuki Morioka

This author has not been identified. Look up 'Nobuyuki Morioka' in Google

Gary Wang

This author has not been identified. Look up 'Gary Wang' in Google

Yu Zhang 0033

This author has not been identified. Look up 'Yu Zhang 0033' in Google

Ankur Bapna

This author has not been identified. Look up 'Ankur Bapna' in Google

Andrew Rosenberg

This author has not been identified. Look up 'Andrew Rosenberg' in Google

Bhuvana Ramabhadran

This author has not been identified. Look up 'Bhuvana Ramabhadran' in Google