VideoBERT: A Joint Model for Video and Language Representation Learning

Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy 0002, Cordelia Schmid. VideoBERT: A Joint Model for Video and Language Representation Learning. In 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019. pages 7463-7472, IEEE, 2019. [doi]

Authors

Chen Sun

This author has not been identified. Look up 'Chen Sun' in Google

Austin Myers

This author has not been identified. Look up 'Austin Myers' in Google

Carl Vondrick

This author has not been identified. Look up 'Carl Vondrick' in Google

Kevin Murphy 0002

This author has not been identified. Look up 'Kevin Murphy 0002' in Google

Cordelia Schmid

This author has not been identified. Look up 'Cordelia Schmid' in Google