Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation

Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae Min Kim, Kentaro Tachibana. Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 3018-3022, ISCA, 2022. [doi]

Authors

Ryo Terashima

This author has not been identified. Look up 'Ryo Terashima' in Google

Ryuichi Yamamoto

This author has not been identified. Look up 'Ryuichi Yamamoto' in Google

Eunwoo Song

This author has not been identified. Look up 'Eunwoo Song' in Google

Yuma Shirahata

This author has not been identified. Look up 'Yuma Shirahata' in Google

Hyun-Wook Yoon

This author has not been identified. Look up 'Hyun-Wook Yoon' in Google

Jae Min Kim

This author has not been identified. Look up 'Jae Min Kim' in Google

Kentaro Tachibana

This author has not been identified. Look up 'Kentaro Tachibana' in Google