Zijian Fan, Xinwei Cao, Giampiero Salvi, Torbjørn Svendsen. Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. In 34th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2024, London, UK, September 22-25, 2024. pages 1-6, IEEE, 2024. [doi]
Abstract is missing.