Joint Global-Local Frames Modeling to enhance semantic alignment for zero-shot long video editing

Zewen Yu, Pengchong Qiao, Jie Chen, Xiaoqin Zhang 0002. Joint Global-Local Frames Modeling to enhance semantic alignment for zero-shot long video editing. Neurocomputing, 650:130836, 2025. [doi]

Abstract

Abstract is missing.