Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments

Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Jinyu Li 0001, Yashesh Gaur. Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments. In IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023, Taipei, Taiwan, December 16-20, 2023. pages 1-8, IEEE, 2023. [doi]

Authors

Sara Papi

This author has not been identified. Look up 'Sara Papi' in Google

Peidong Wang

This author has not been identified. Look up 'Peidong Wang' in Google

Junkun Chen

This author has not been identified. Look up 'Junkun Chen' in Google

Jian Xue

This author has not been identified. Look up 'Jian Xue' in Google

Jinyu Li 0001

This author has not been identified. Look up 'Jinyu Li 0001' in Google

Yashesh Gaur

This author has not been identified. Look up 'Yashesh Gaur' in Google