Two-stage prosody prediction for emotional text-to-speech synthesis

Hao Tang, Xi Zhou, Matthias Odisio, Mark Hasegawa-Johnson, Thomas S. Huang. Two-stage prosody prediction for emotional text-to-speech synthesis. In INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008. pages 2138-2141, ISCA, 2008. [doi]

@inproceedings{TangZOHH08,
  title = {Two-stage prosody prediction for emotional text-to-speech synthesis},
  author = {Hao Tang and Xi Zhou and Matthias Odisio and Mark Hasegawa-Johnson and Thomas S. Huang},
  year = {2008},
  url = {http://www.isca-speech.org/archive/interspeech_2008/i08_2138.html},
  researchr = {https://researchr.org/publication/TangZOHH08},
  cites = {0},
  citedby = {0},
  pages = {2138-2141},
  booktitle = {INTERSPEECH 2008, 9th Annual Conference of the International Speech Communication Association, Brisbane, Australia, September 22-26, 2008},
  publisher = {ISCA},
}