Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations

Dan Oneata, Horia Cucu. Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations. In IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2022, New Orleans, LA, USA, June 19-20, 2022. pages 4578-4587, IEEE, 2022. [doi]

Abstract

Abstract is missing.