Using Vaes and Normalizing Flows for One-Shot Text-To-Speech Synthesis of Expressive Speech

Vatsal Aggarwal, Marius Cotescu, Nishant Prateek, Jaime Lorenzo-Trueba, Roberto Barra-Chicote. Using Vaes and Normalizing Flows for One-Shot Text-To-Speech Synthesis of Expressive Speech. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020. pages 6179-6183, IEEE, 2020. [doi]

@inproceedings{AggarwalCPLB20,
  title = {Using Vaes and Normalizing Flows for One-Shot Text-To-Speech Synthesis of Expressive Speech},
  author = {Vatsal Aggarwal and Marius Cotescu and Nishant Prateek and Jaime Lorenzo-Trueba and Roberto Barra-Chicote},
  year = {2020},
  doi = {10.1109/ICASSP40776.2020.9053678},
  url = {https://doi.org/10.1109/ICASSP40776.2020.9053678},
  researchr = {https://researchr.org/publication/AggarwalCPLB20},
  cites = {0},
  citedby = {0},
  pages = {6179-6183},
  booktitle = {2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020},
  publisher = {IEEE},
  isbn = {978-1-5090-6631-5},
}