Segmentation of TV shows into scenes using speaker diarization and speech recognition

Hervé Bredin. Segmentation of TV shows into scenes using speaker diarization and speech recognition. In 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2012, Kyoto, Japan, March 25-30, 2012. pages 2377-2380, IEEE, 2012. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.