LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision

Zhijian Liu, Simon Stent, Jie Li, John Gideon, Song Han 0003. LocTex: Learning Data-Efficient Visual Representations from Localized Textual Supervision. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. pages 2147-2156, IEEE, 2021. [doi]

Abstract

Abstract is missing.