Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image

Shunsuke Goto, Kotaro Onishi, Yuki Saito, Kentaro Tachibana, Koichiro Mori. Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image. In Helen Meng, Bo Xu 0011, Thomas Fang Zheng, editors, Interspeech 2020, 21st Annual Conference of the International Speech Communication Association, Virtual Event, Shanghai, China, 25-29 October 2020. pages 1321-1325, ISCA, 2020. [doi]

Authors

Shunsuke Goto

This author has not been identified. Look up 'Shunsuke Goto' in Google

Kotaro Onishi

This author has not been identified. Look up 'Kotaro Onishi' in Google

Yuki Saito

This author has not been identified. Look up 'Yuki Saito' in Google

Kentaro Tachibana

This author has not been identified. Look up 'Kentaro Tachibana' in Google

Koichiro Mori

This author has not been identified. Look up 'Koichiro Mori' in Google