Transduce and Speak: Neural Transducer for Text-To-Speech with Semantic Token Prediction

Minchan Kim, Myeonghun Jeong, Byoung Jin Choi, Dongjune Lee, Nam Soo Kim. Transduce and Speak: Neural Transducer for Text-To-Speech with Semantic Token Prediction. In IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023, Taipei, Taiwan, December 16-20, 2023. pages 1-7, IEEE, 2023. [doi]

Authors

Minchan Kim

This author has not been identified. Look up 'Minchan Kim' in Google

Myeonghun Jeong

This author has not been identified. Look up 'Myeonghun Jeong' in Google

Byoung Jin Choi

This author has not been identified. Look up 'Byoung Jin Choi' in Google

Dongjune Lee

This author has not been identified. Look up 'Dongjune Lee' in Google

Nam Soo Kim

This author has not been identified. Look up 'Nam Soo Kim' in Google