End-to-end Generative Pretraining for Multimodal Video Captioning - researchr publication authors

researchr

You are not signed in
Sign in
Sign up

Paul Hongsuck Seo, Arsha Nagrani, Anurag Arnab, Cordelia Schmid. End-to-end Generative Pretraining for Multimodal Video Captioning. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, New Orleans, LA, USA, June 18-24, 2022. pages 17938-17947, IEEE, 2022. [doi]

This author has not been identified. Look up 'Paul Hongsuck Seo' in GoogleThis author has not been identified. Look up 'Arsha Nagrani' in GoogleThis author has not been identified. Look up 'Anurag Arnab' in GoogleThis author has not been identified. Look up 'Cordelia Schmid' in Google

runs on WebDSL