On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

Jinchuan Tian, Yifan Peng, William Chen, KwangHee Choi, Karen Livescu, Shinji Watanabe 0001. On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models. In Itshak Lapidot, Sharon Gannot, editors, 25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024. ISCA, 2024. [doi]

Authors

Jinchuan Tian

This author has not been identified. Look up 'Jinchuan Tian' in Google

Yifan Peng

This author has not been identified. Look up 'Yifan Peng' in Google

William Chen

This author has not been identified. Look up 'William Chen' in Google

KwangHee Choi

This author has not been identified. Look up 'KwangHee Choi' in Google

Karen Livescu

This author has not been identified. Look up 'Karen Livescu' in Google

Shinji Watanabe 0001

This author has not been identified. Look up 'Shinji Watanabe 0001' in Google