Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation

Renjie Zheng, Junkun Chen, Mingbo Ma, Liang Huang 0001. Fused Acoustic and Text Encoding for Multimodal Bilingual Pretraining and Speech Translation. In Marina Meila, Tong Zhang 0001, editors, Proceedings of the 38th International Conference on Machine Learning, ICML 2021, 18-24 July 2021, Virtual Event. Volume 139 of Proceedings of Machine Learning Research, pages 12736-12746, PMLR, 2021. [doi]

Abstract

Abstract is missing.