Cheng Luo, Yiguang Liu, Wenhui Sun, Zhoujian Sun. Multi-Modality Speech Recognition Driven by Background Visual Scenes. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2024, Seoul, Republic of Korea, April 14-19, 2024. pages 10926-10930, IEEE, 2024. [doi]
Abstract is missing.