Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data

Yifan Peng, Jinchuan Tian, Brian Yan, Dan Berrebbi, Xuankai Chang, Xinjian Li, Jiatong Shi, Siddhant Arora, William Chen, Roshan S. Sharma, Wangyou Zhang, Yui Sudo, Muhammad Shakeel 0001, Jee-weon Jung, Soumi Maiti, Shinji Watanabe 0001. Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data. In IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023, Taipei, Taiwan, December 16-20, 2023. pages 1-8, IEEE, 2023. [doi]

Authors

Yifan Peng

This author has not been identified. Look up 'Yifan Peng' in Google

Jinchuan Tian

This author has not been identified. Look up 'Jinchuan Tian' in Google

Brian Yan

This author has not been identified. Look up 'Brian Yan' in Google

Dan Berrebbi

This author has not been identified. Look up 'Dan Berrebbi' in Google

Xuankai Chang

This author has not been identified. Look up 'Xuankai Chang' in Google

Xinjian Li

This author has not been identified. Look up 'Xinjian Li' in Google

Jiatong Shi

This author has not been identified. Look up 'Jiatong Shi' in Google

Siddhant Arora

This author has not been identified. Look up 'Siddhant Arora' in Google

William Chen

This author has not been identified. Look up 'William Chen' in Google

Roshan S. Sharma

This author has not been identified. Look up 'Roshan S. Sharma' in Google

Wangyou Zhang

This author has not been identified. Look up 'Wangyou Zhang' in Google

Yui Sudo

This author has not been identified. Look up 'Yui Sudo' in Google

Muhammad Shakeel 0001

This author has not been identified. Look up 'Muhammad Shakeel 0001' in Google

Jee-weon Jung

This author has not been identified. Look up 'Jee-weon Jung' in Google

Soumi Maiti

This author has not been identified. Look up 'Soumi Maiti' in Google

Shinji Watanabe 0001

This author has not been identified. Look up 'Shinji Watanabe 0001' in Google