Tran The Anh, Azmat Adnan, Yihao Wu, Chng Eng Siong. Ts-Vad+: Modularized Target-Speaker Voice Activity Detection for Robust Speaker Diarization. In Asia Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2025, Singapore, October 22-24, 2025. pages 630-635, IEEE, 2025. [doi]
Abstract is missing.