High quality voice conversion based on Gaussian mixture model with dynamic frequency warping

Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano. High quality voice conversion based on Gaussian mixture model with dynamic frequency warping. In Paul Dalsgaard, Børge Lindberg, Henrik Benner, Zheng-Hua Tan, editors, EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001. pages 349-352, ISCA, 2001. [doi]

@inproceedings{TodaSS01,
  title = {High quality voice conversion based on Gaussian mixture model with dynamic frequency warping},
  author = {Tomoki Toda and Hiroshi Saruwatari and Kiyohiro Shikano},
  year = {2001},
  url = {http://www.isca-speech.org/archive/eurospeech_2001/e01_0349.html},
  tags = {rule-based},
  researchr = {https://researchr.org/publication/TodaSS01},
  cites = {0},
  citedby = {0},
  pages = {349-352},
  booktitle = {EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001},
  editor = {Paul Dalsgaard and Børge Lindberg and Henrik Benner and Zheng-Hua Tan},
  publisher = {ISCA},
}