Sahil Kumar, Jialu Li 0004, Youshan Zhang. Vision Transformer Segmentation for Visual Bird Sound Denoising. In Itshak Lapidot, Sharon Gannot, editors, 25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024. ISCA, 2024. [doi]
Abstract is missing.