Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis

Kun Zhou 0003, Shengkui Zhao, Yukun Ma, Chong Zhang, Hao Wang, Dianwen Ng, Chongjia Ni, Trung Hieu Nguyen 0001, Jia Qi Yip, Bin Ma 0001. Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis. In Itshak Lapidot, Sharon Gannot, editors, 25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024. ISCA, 2024. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.