End-To-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings

Soumi Maiti, Hakan Erdogan, Kevin W. Wilson, Scott Wisdom, Shinji Watanabe 0001, John R. Hershey. End-To-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings. In IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2021, Toronto, ON, Canada, June 6-11, 2021. pages 7183-7187, IEEE, 2021. [doi]

Authors

Soumi Maiti

This author has not been identified. Look up 'Soumi Maiti' in Google

Hakan Erdogan

This author has not been identified. Look up 'Hakan Erdogan' in Google

Kevin W. Wilson

This author has not been identified. Look up 'Kevin W. Wilson' in Google

Scott Wisdom

This author has not been identified. Look up 'Scott Wisdom' in Google

Shinji Watanabe 0001

This author has not been identified. Look up 'Shinji Watanabe 0001' in Google

John R. Hershey

This author has not been identified. Look up 'John R. Hershey' in Google