Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding

Wujian Peng, Sicheng Xie, Zuyao You, Shiyi Lan, Zuxuan Wu. Synthesize, Diagnose, and Optimize: Towards Fine-Grained Vision-Language Understanding. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 13279-13288, IEEE, 2024. [doi]

Abstract

Abstract is missing.