Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models

Xin Li, Yunfei Wu, Xinghua Jiang, Zhihao Guo, Mingming Gong, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun. Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models. In IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, WA, USA, June 16-22, 2024. pages 15546-15555, IEEE, 2024. [doi]

Authors

Xin Li

This author has not been identified. Look up 'Xin Li' in Google

Yunfei Wu

This author has not been identified. Look up 'Yunfei Wu' in Google

Xinghua Jiang

This author has not been identified. Look up 'Xinghua Jiang' in Google

Zhihao Guo

This author has not been identified. Look up 'Zhihao Guo' in Google

Mingming Gong

This author has not been identified. Look up 'Mingming Gong' in Google

Haoyu Cao

This author has not been identified. Look up 'Haoyu Cao' in Google

Yinsong Liu

This author has not been identified. Look up 'Yinsong Liu' in Google

Deqiang Jiang

This author has not been identified. Look up 'Deqiang Jiang' in Google

Xing Sun

This author has not been identified. Look up 'Xing Sun' in Google