MsEmoTTS: Multi-Scale Emotion Transfer, Prediction, and Control for Emotional Speech Synthesis

Yi Lei, Shan Yang, Xinsheng Wang, Lei Xie 0001. MsEmoTTS: Multi-Scale Emotion Transfer, Prediction, and Control for Emotional Speech Synthesis. IEEE Transactions on Audio, Speech & Language Processing, 30:853-864, 2022. [doi]

@article{LeiYWX22,
  title = {MsEmoTTS: Multi-Scale Emotion Transfer, Prediction, and Control for Emotional Speech Synthesis},
  author = {Yi Lei and Shan Yang and Xinsheng Wang and Lei Xie 0001},
  year = {2022},
  doi = {10.1109/TASLP.2022.3145293},
  url = {https://doi.org/10.1109/TASLP.2022.3145293},
  researchr = {https://researchr.org/publication/LeiYWX22},
  cites = {0},
  citedby = {0},
  journal = {IEEE Transactions on Audio, Speech & Language Processing},
  volume = {30},
  pages = {853-864},
}