Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper

Zijian Fan, Xinwei Cao, Giampiero Salvi, Torbjørn Svendsen. Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. In 34th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2024, London, UK, September 22-25, 2024. pages 1-6, IEEE, 2024. [doi]

@inproceedings{FanCSS24,
  title = {Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper},
  author = {Zijian Fan and Xinwei Cao and Giampiero Salvi and Torbjørn Svendsen},
  year = {2024},
  doi = {10.1109/MLSP58920.2024.10734799},
  url = {https://doi.org/10.1109/MLSP58920.2024.10734799},
  researchr = {https://researchr.org/publication/FanCSS24},
  cites = {0},
  citedby = {0},
  pages = {1-6},
  booktitle = {34th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2024, London, UK, September 22-25, 2024},
  publisher = {IEEE},
  isbn = {979-8-3503-7225-0},
}