VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification

Souhail Bakkali, Zuheng Ming, Mickaël Coustaty, Marçal Rusiñol, Oriol Ramos Terrades. VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification. Pattern Recognition, 139:109419, July 2023. [doi]

@article{BakkaliMCRT23,
  title = {VLCDoC: Vision-Language contrastive pre-training model for cross-Modal document classification},
  author = {Souhail Bakkali and Zuheng Ming and Mickaël Coustaty and Marçal Rusiñol and Oriol Ramos Terrades},
  year = {2023},
  month = {July},
  doi = {10.1016/j.patcog.2023.109419},
  url = {https://doi.org/10.1016/j.patcog.2023.109419},
  researchr = {https://researchr.org/publication/BakkaliMCRT23},
  cites = {0},
  citedby = {0},
  journal = {Pattern Recognition},
  volume = {139},
  pages = {109419},
}