UNITER: UNiversal Image-TExt Representation Learning

Yen-Chun Chen 0001, Linjie Li, Licheng Yu, Ahmed El Kholy, Faisal Ahmed 0001, Zhe Gan, Yu Cheng 0001, Jingjing Liu 0001. UNITER: UNiversal Image-TExt Representation Learning. In Andrea Vedaldi, Horst Bischof, Thomas Brox, Jan-Michael Frahm, editors, Computer Vision - ECCV 2020 - 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part XXX. Volume 12375 of Lecture Notes in Computer Science, pages 104-120, Springer, 2020. [doi]

Abstract

Abstract is missing.