Hierarchical Timbre-Cadence Speaker Encoder for Zero-shot Speech Synthesis

Joun Yeop Lee, Jae-Sung Bae, Seongkyu Mun, Jihwan Lee, Ji-Hyun Lee, Hoon-Young Cho, Chanwoo Kim 0001. Hierarchical Timbre-Cadence Speaker Encoder for Zero-shot Speech Synthesis. In Naomi Harte, Julie Carson-Berndsen, Gareth Jones, editors, 24th Annual Conference of the International Speech Communication Association, Interspeech 2023, Dublin, Ireland, August 20-24, 2023. pages 4334-4338, ISCA, 2023. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: