Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper

Zijian Fan, Xinwei Cao, Giampiero Salvi, Torbjørn Svendsen. Towards Better Recognition of Spontaneous Children's Speech: Speaker-Clustering Fine-Tuning of Whisper. In 34th IEEE International Workshop on Machine Learning for Signal Processing, MLSP 2024, London, UK, September 22-25, 2024. pages 1-6, IEEE, 2024. [doi]

Authors

Zijian Fan

This author has not been identified. Look up 'Zijian Fan' in Google

Xinwei Cao

This author has not been identified. Look up 'Xinwei Cao' in Google

Giampiero Salvi

This author has not been identified. Look up 'Giampiero Salvi' in Google

Torbjørn Svendsen

This author has not been identified. Look up 'Torbjørn Svendsen' in Google