Extending Pre-trained ASR Models to Cross-Modal and Cross-Lingual Speech-Text Retrieval

YuKai Li, Xiaohang Li, Xu Cao, Yuan Si, Jing Li, Ju Liu. Extending Pre-trained ASR Models to Cross-Modal and Cross-Lingual Speech-Text Retrieval. In Ying Tan 0002, Yuhui Shi 0001, editors, Advances in Swarm Intelligence - 16th International Conference on Swarm Intelligence, ICSI 2025, Yokohama, Japan, July 11-15, 2025, Proceedings, Part I. Volume 16011 of Lecture Notes in Computer Science, pages 205-216, Springer, 2025. [doi]

Authors

YuKai Li

This author has not been identified. Look up 'YuKai Li' in Google

Xiaohang Li

This author has not been identified. Look up 'Xiaohang Li' in Google

Xu Cao

This author has not been identified. Look up 'Xu Cao' in Google

Yuan Si

This author has not been identified. Look up 'Yuan Si' in Google

Jing Li

This author has not been identified. Look up 'Jing Li' in Google

Ju Liu

This author has not been identified. Look up 'Ju Liu' in Google