Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment

Tianshu Yu, Haoyu Gao, Ting-En Lin, Min Yang, Yuchuan Wu, Wentao Ma, Chao Wang, Fei Huang, Yongbin Li. Speech-Text Pre-training for Spoken Dialog Understanding with Explicit Cross-Modal Alignment. In Anna Rogers, Jordan L. Boyd-Graber, Naoaki Okazaki, editors, Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023. pages 7900-7913, Association for Computational Linguistics, 2023. [doi]

Authors

Tianshu Yu

This author has not been identified. Look up 'Tianshu Yu' in Google

Haoyu Gao

This author has not been identified. Look up 'Haoyu Gao' in Google

Ting-En Lin

This author has not been identified. Look up 'Ting-En Lin' in Google

Min Yang

This author has not been identified. Look up 'Min Yang' in Google

Yuchuan Wu

This author has not been identified. Look up 'Yuchuan Wu' in Google

Wentao Ma

This author has not been identified. Look up 'Wentao Ma' in Google

Chao Wang

This author has not been identified. Look up 'Chao Wang' in Google

Fei Huang

This author has not been identified. Look up 'Fei Huang' in Google

Yongbin Li

This author has not been identified. Look up 'Yongbin Li' in Google