Aggregating Frame-Level Information in the Spectral Domain With Self-Attention for Speaker Embedding

Youzhi Tu, Man-Wai Mak. Aggregating Frame-Level Information in the Spectral Domain With Self-Attention for Speaker Embedding. IEEE Transactions on Audio, Speech & Language Processing, 30:944-957, 2022. [doi]

Abstract

Abstract is missing.