EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone

Shraman Pramanick, Yale Song, Sayan Nag, Kevin Qinghong Lin, Hardik Shah, Mike Zheng Shou, Rama Chellappa, Pengchuan Zhang. EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone. In IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, October 1-6, 2023. pages 5262-5274, IEEE, 2023. [doi]

Authors

Shraman Pramanick

This author has not been identified. Look up 'Shraman Pramanick' in Google

Yale Song

This author has not been identified. Look up 'Yale Song' in Google

Sayan Nag

This author has not been identified. Look up 'Sayan Nag' in Google

Kevin Qinghong Lin

This author has not been identified. Look up 'Kevin Qinghong Lin' in Google

Hardik Shah

This author has not been identified. Look up 'Hardik Shah' in Google

Mike Zheng Shou

This author has not been identified. Look up 'Mike Zheng Shou' in Google

Rama Chellappa

This author has not been identified. Look up 'Rama Chellappa' in Google

Pengchuan Zhang

This author has not been identified. Look up 'Pengchuan Zhang' in Google