Triple M: A Practical Text-to-Speech Synthesis System with Multi-Guidance Attention and Multi-Band Multi-Time LPCNet

Shilun Lin, Fenglong Xie, Li Meng, Xinhui Li, Li Lu. Triple M: A Practical Text-to-Speech Synthesis System with Multi-Guidance Attention and Multi-Band Multi-Time LPCNet. In Hynek Hermansky, Honza Cernocký, Lukás Burget, Lori Lamel, Odette Scharenborg, Petr Motlícek, editors, Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021. pages 3640-3644, ISCA, 2021. [doi]

@inproceedings{LinXMLL21,
  title = {Triple M: A Practical Text-to-Speech Synthesis System with Multi-Guidance Attention and Multi-Band Multi-Time LPCNet},
  author = {Shilun Lin and Fenglong Xie and Li Meng and Xinhui Li and Li Lu},
  year = {2021},
  doi = {10.21437/Interspeech.2021-851},
  url = {https://doi.org/10.21437/Interspeech.2021-851},
  researchr = {https://researchr.org/publication/LinXMLL21},
  cites = {0},
  citedby = {0},
  pages = {3640-3644},
  booktitle = {Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August - 3 September 2021},
  editor = {Hynek Hermansky and Honza Cernocký and Lukás Burget and Lori Lamel and Odette Scharenborg and Petr Motlícek},
  publisher = {ISCA},
}