Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval - researchr publication

researchr

You are not signed in
Sign in
Sign up

Mustafa Shukor, Nicolas Thome, Matthieu Cord. Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval. Computer Vision and Image Understanding, 247:104071, 2024. [doi]

Abstract is missing.

runs on WebDSL