Masashi Aso, Shinnosuke Takamichi, Norihiro Takamune, Hiroshi Saruwatari. Acoustic model-based subword tokenization and prosodic-context extraction without language knowledge for text-to-speech synthesis. Speech Communication, 125:53-60, 2020. [doi]
Abstract is missing.