Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval

Mustafa Shukor, Nicolas Thome, Matthieu Cord. Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval. Computer Vision and Image Understanding, 247:104071, 2024. [doi]

Abstract

Abstract is missing.