SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping

Yuma Koizumi, Heiga Zen, Kohei Yatabe, Nanxin Chen, Michiel Bacchiani. SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 803-807, ISCA, 2022. [doi]

@inproceedings{KoizumiZYCB22,
  title = {SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping},
  author = {Yuma Koizumi and Heiga Zen and Kohei Yatabe and Nanxin Chen and Michiel Bacchiani},
  year = {2022},
  doi = {10.21437/Interspeech.2022-301},
  url = {https://doi.org/10.21437/Interspeech.2022-301},
  researchr = {https://researchr.org/publication/KoizumiZYCB22},
  cites = {0},
  citedby = {0},
  pages = {803-807},
  booktitle = {Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022},
  editor = {Hanseok Ko and John H. L. Hansen},
  publisher = {ISCA},
}