Ziyu Ma, Chenhui Gou, Hengcan Shi, Bin Sun 0001, Shutao Li, Hamid Rezatofighi, Jianfei Cai 0001. DrVideo: Document Retrieval Based Long Video Understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, TN, USA, June 11-15, 2025. pages 18936-18946, Computer Vision Foundation / IEEE, 2025. [doi]
Abstract is missing.