Ziyang Chen, Prem Seetharaman, Bryan C. Russell, Oriol Nieto, David Bourgin, Andrew Owens, Justin Salamon. Video-Guided Foley Sound Generation with Multimodal Controls. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 18770-18781, Computer Vision Foundation / IEEE, 2025. [doi]
Abstract is missing.