VideoBERT: A Joint Model for Video and Language Representation Learning

Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy 0002, Cordelia Schmid. VideoBERT: A Joint Model for Video and Language Representation Learning. In 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019. pages 7463-7472, IEEE, 2019. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.