Fine-Tuning Pre-Trained Voice Conversion Model for Adding New Target Speakers with Limited Data

Takeshi Koshizuka, Hidefumi Ohmura, Kouichi Katsurada. Fine-Tuning Pre-Trained Voice Conversion Model for Adding New Target Speakers with Limited Data. In Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek, editors, Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021. pages 1339-1343, ISCA, 2021. [doi]

@inproceedings{KoshizukaOK21,
  title = {Fine-Tuning Pre-Trained Voice Conversion Model for Adding New Target Speakers with Limited Data},
  author = {Takeshi Koshizuka and Hidefumi Ohmura and Kouichi Katsurada},
  year = {2021},
  doi = {10.21437/Interspeech.2021-244},
  url = {https://doi.org/10.21437/Interspeech.2021-244},
  researchr = {https://researchr.org/publication/KoshizukaOK21},
  cites = {0},
  citedby = {0},
  pages = {1339-1343},
  booktitle = {Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021},
  editor = {Hynek Hermansky and Honza Cernocký and Lukás Burget and Lori Lamel and Odette Scharenborg and Petr Motlícek},
  publisher = {ISCA},
}