Parrot Captions Teach CLIP to Spot Text

Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang 0065, Weijia Li, Mike Zheng Shou. Parrot Captions Teach CLIP to Spot Text. In Ales Leonardis, Elisa Ricci 0001, Stefan Roth 0001, Olga Russakovsky, Torsten Sattler, Gül Varol, editors, Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLII. Volume 15100 of Lecture Notes in Computer Science, pages 368-385, Springer, 2024. [doi]

Abstract

Abstract is missing.