Mellotron: Multispeaker Expressive Voice Synthesis by Conditioning on Rhythm, Pitch and Global Style Tokens - researchr publication

researchr

You are not signed in
Sign in
Sign up

Rafael Valle, Jason Li, Ryan Prenger, Bryan Catanzaro. Mellotron: Multispeaker Expressive Voice Synthesis by Conditioning on Rhythm, Pitch and Global Style Tokens. In 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020, Barcelona, Spain, May 4-8, 2020. pages 6189-6193, IEEE, 2020. [doi]

Abstract is missing.

runs on WebDSL