Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Yuxuan Wang, Daisy Stanton, Yu Zhang, R. J. Skerry-Ryan, Eric Battenberg, Joel Shor, Ying Xiao, Ye Jia, Fei Ren, Rif A. Saurous. Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis. In Jennifer G. Dy, Andreas Krause 0001, editors, Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018. Volume 80 of JMLR Workshop and Conference Proceedings, pages 5167-5176, JMLR.org, 2018. [doi]

@inproceedings{WangSZRBSXJRS18,
  title = {Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis},
  author = {Yuxuan Wang and Daisy Stanton and Yu Zhang and R. J. Skerry-Ryan and Eric Battenberg and Joel Shor and Ying Xiao and Ye Jia and Fei Ren and Rif A. Saurous},
  year = {2018},
  url = {http://proceedings.mlr.press/v80/wang18h.html},
  researchr = {https://researchr.org/publication/WangSZRBSXJRS18},
  cites = {0},
  citedby = {0},
  pages = {5167-5176},
  booktitle = {Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018},
  editor = {Jennifer G. Dy and Andreas Krause 0001},
  volume = {80},
  series = {JMLR Workshop and Conference Proceedings},
  publisher = {JMLR.org},
}