Extending Pre-trained ASR Models to Cross-Modal and Cross-Lingual Speech-Text Retrieval

YuKai Li, Xiaohang Li, Xu Cao, Yuan Si, Jing Li, Ju Liu. Extending Pre-trained ASR Models to Cross-Modal and Cross-Lingual Speech-Text Retrieval. In Ying Tan 0002, Yuhui Shi 0001, editors, Advances in Swarm Intelligence - 16th International Conference on Swarm Intelligence, ICSI 2025, Yokohama, Japan, July 11-15, 2025, Proceedings, Part I. Volume 16011 of Lecture Notes in Computer Science, pages 205-216, Springer, 2025. [doi]

Abstract

Abstract is missing.