VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature

Chenpeng Du, Yiwei Guo, Xie Chen, Kai Yu 0004. VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 1596-1600, ISCA, 2022. [doi]

@inproceedings{DuGC022,
  title = {VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature},
  author = {Chenpeng Du and Yiwei Guo and Xie Chen and Kai Yu 0004},
  year = {2022},
  doi = {10.21437/Interspeech.2022-489},
  url = {https://doi.org/10.21437/Interspeech.2022-489},
  researchr = {https://researchr.org/publication/DuGC022},
  cites = {0},
  citedby = {0},
  pages = {1596-1600},
  booktitle = {Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022},
  editor = {Hanseok Ko and John H. L. Hansen},
  publisher = {ISCA},
}