The following publications are possibly variants of this publication:
- SSLNet: A network for cross-modal sound source localization in visual scenesFan Feng, Yue Ming 0001, Nannan Hu. ijon, 500:1052-1062, 2022. [doi]
- Learning to Localize Sound Source in Visual ScenesArda Senocak, Tae Hyun Oh, Jun-Sik Kim, Ming-Hsuan Yang 0001, In-So Kweon. cvpr 2018: 4358-4366 [doi]
- Learning to Localize Sound Sources in Visual Scenes: Analysis and ApplicationsArda Senocak, Tae Hyun Oh, Junsik Kim 0001, Ming-Hsuan Yang 0001, In-So Kweon. pami, 43(5):1605-1619, 2021. [doi]
- Visual to Sound: Generating Natural Sound for Videos in the WildYipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara L. Berg. cvpr 2018: 2500-2503 [doi]
- Visual to Sound: Generating Natural Sound for Videos in the WildYipin Zhou, Zhaowen Wang, Chen Fang, Trung Bui, Tamara L. Berg. cvpr 2018: 3550-3558 [doi]
- Sound Source Localization is All about Cross-Modal AlignmentArda Senocak, Hyeonggon Ryu, Junsik Kim 0001, Tae Hyun Oh, Hanspeter Pfister, Joon Son Chung. iccv 2023: 7743-7753 [doi]
- Localizing Visual Sounds the Easy WayShentong Mo, Pedro Morgado 0001. eccv 2022: 218-234 [doi]