Perceptually based automatic prosody labeling and prosodically enriched unit selection improve concatenative text-to-speech synthesis

Colin W. Wightman, Ann K. Syrdal, Georg Stemmer, Alistair Conkie, Mark C. Beutnagel. Perceptually based automatic prosody labeling and prosodically enriched unit selection improve concatenative text-to-speech synthesis. In Sixth International Conference on Spoken Language Processing, ICSLP 2000 / INTERSPEECH 2000, Beijing, China, October 16-20, 2000. pages 71-74, ISCA, 2000. [doi]