Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Yuying Ge, Yizhuo Li 0001, Yixiao Ge, Ying Shan. Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 13606-13617, Computer Vision Foundation / IEEE, 2025. [doi]

This author has not been identified. Look up 'Yuying Ge' in GoogleThis author has not been identified. Look up 'Yizhuo Li 0001' in GoogleThis author has not been identified. Look up 'Yixiao Ge' in GoogleThis author has not been identified. Look up 'Ying Shan' in Google

runs on WebDSL