VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation

Shoubin Yu, Difan Liu, Ziqiao Ma 0001, Yicong Hong, Yang Zhou 0019, Hao Tan 0002, Joyce Chai, Mohit Bansal. VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation. In IEEE/CVF International Conference on Computer Vision, ICCV 2025, Honolulu, HI, USA, October 19-25, 2025. pages 15147-15158, IEEE, 2025. [doi]

Abstract

Abstract is missing.