SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training

Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu 0001, Lirong Dai, Jinyu Li 0001, Furu Wei. SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training. In Yoav Goldberg, Zornitsa Kozareva, Yue Zhang, editors, Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022, Abu Dhabi, United Arab Emirates, December 7-11. pages 1663-1676, Association for Computational Linguistics, 2022. [doi]

Authors

Ziqiang Zhang

This author has not been identified. Look up 'Ziqiang Zhang' in Google

Long Zhou

This author has not been identified. Look up 'Long Zhou' in Google

Junyi Ao

This author has not been identified. Look up 'Junyi Ao' in Google

Shujie Liu 0001

This author has not been identified. Look up 'Shujie Liu 0001' in Google

Lirong Dai

This author has not been identified. Look up 'Lirong Dai' in Google

Jinyu Li 0001

This author has not been identified. Look up 'Jinyu Li 0001' in Google

Furu Wei

This author has not been identified. Look up 'Furu Wei' in Google