P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting

Sungwon Kim, Kevin J. Shih, Rohan Badlani, João Felipe Santos, Evelina Bakhturina, Mikyas Desta, Rafael Valle, Sungroh Yoon, Bryan Catanzaro. P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, Sergey Levine, editors, Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, NeurIPS 2023, New Orleans, LA, USA, December 10 - 16, 2023. 2023. [doi]

Authors

Sungwon Kim

This author has not been identified. Look up 'Sungwon Kim' in Google

Kevin J. Shih

This author has not been identified. Look up 'Kevin J. Shih' in Google

Rohan Badlani

This author has not been identified. Look up 'Rohan Badlani' in Google

João Felipe Santos

This author has not been identified. Look up 'João Felipe Santos' in Google

Evelina Bakhturina

This author has not been identified. Look up 'Evelina Bakhturina' in Google

Mikyas Desta

This author has not been identified. Look up 'Mikyas Desta' in Google

Rafael Valle

This author has not been identified. Look up 'Rafael Valle' in Google

Sungroh Yoon

This author has not been identified. Look up 'Sungroh Yoon' in Google

Bryan Catanzaro

This author has not been identified. Look up 'Bryan Catanzaro' in Google