Mengyu Yang, Yiming Chen, Haozheng Pei, Siddhant Agarwal, Arun Balajee Vasudevan, James Hays. Clink! Chop! Thud! - Learning Object Sounds From Real-World Interactions. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 14549-14558, IEEE, 2025. [doi]
Abstract is missing.