VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification

Souhail Bakkali, Zuheng Ming, Mickaël Coustaty, Marçal Rusiñol, Oriol Ramos Terrades. VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification. Pattern Recognition, 139:109419, July 2023. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.