On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

Jinchuan Tian, Yifan Peng, William Chen, KwangHee Choi, Karen Livescu, Shinji Watanabe 0001. On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models. In Itshak Lapidot, Sharon Gannot, editors, 25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024. ISCA, 2024. [doi]

@inproceedings{TianPCCL024,
  title = {On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models},
  author = {Jinchuan Tian and Yifan Peng and William Chen and KwangHee Choi and Karen Livescu and Shinji Watanabe 0001},
  year = {2024},
  doi = {10.21437/Interspeech.2024-1938},
  url = {https://doi.org/10.21437/Interspeech.2024-1938},
  researchr = {https://researchr.org/publication/TianPCCL024},
  cites = {0},
  citedby = {0},
  booktitle = {25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024},
  editor = {Itshak Lapidot and Sharon Gannot},
  publisher = {ISCA},
}