Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation

Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee. Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. In Hanseok Ko, John H. L. Hansen, editors, Interspeech 2022, 23rd Annual Conference of the International Speech Communication Association, Incheon, Korea, 18-22 September 2022. pages 5195-5199, ISCA, 2022. [doi]

Authors

Sravya Popuri

This author has not been identified. Look up 'Sravya Popuri' in Google

Peng-Jen Chen

This author has not been identified. Look up 'Peng-Jen Chen' in Google

Changhan Wang

This author has not been identified. Look up 'Changhan Wang' in Google

Juan Pino

This author has not been identified. Look up 'Juan Pino' in Google

Yossi Adi

This author has not been identified. Look up 'Yossi Adi' in Google

Jiatao Gu

This author has not been identified. Look up 'Jiatao Gu' in Google

Wei-Ning Hsu

This author has not been identified. Look up 'Wei-Ning Hsu' in Google

Ann Lee

This author has not been identified. Look up 'Ann Lee' in Google