Vision Transformer Segmentation for Visual Bird Sound Denoising

Sahil Kumar, Jialu Li 0004, Youshan Zhang. Vision Transformer Segmentation for Visual Bird Sound Denoising. In Itshak Lapidot, Sharon Gannot, editors, 25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024. ISCA, 2024. [doi]

Possibly Related Publications

The following publications are possibly variants of this publication: