DrVideo: Document Retrieval Based Long Video Understanding

Ziyu Ma, Chenhui Gou, Hengcan Shi, Bin Sun 0001, Shutao Li, Hamid Rezatofighi, Jianfei Cai 0001. DrVideo: Document Retrieval Based Long Video Understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 18936-18946, Computer Vision Foundation / IEEE, 2025. [doi]

Authors

Ziyu Ma

This author has not been identified. Look up 'Ziyu Ma' in Google

Chenhui Gou

This author has not been identified. Look up 'Chenhui Gou' in Google

Hengcan Shi

This author has not been identified. Look up 'Hengcan Shi' in Google

Bin Sun 0001

This author has not been identified. Look up 'Bin Sun 0001' in Google

Shutao Li

This author has not been identified. Look up 'Shutao Li' in Google

Hamid Rezatofighi

This author has not been identified. Look up 'Hamid Rezatofighi' in Google

Jianfei Cai 0001

This author has not been identified. Look up 'Jianfei Cai 0001' in Google