VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification

Souhail Bakkali, Zuheng Ming, Mickaël Coustaty, Marçal Rusiñol, Oriol Ramos Terrades. VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification. Pattern Recognition, 139:109419, July 2023. [doi]

Authors

Souhail Bakkali

This author has not been identified. Look up 'Souhail Bakkali' in Google

Zuheng Ming

This author has not been identified. Look up 'Zuheng Ming' in Google

Mickaël Coustaty

This author has not been identified. Look up 'Mickaël Coustaty' in Google

Marçal Rusiñol

This author has not been identified. Look up 'Marçal Rusiñol' in Google

Oriol Ramos Terrades

This author has not been identified. Look up 'Oriol Ramos Terrades' in Google