Augmenting Vision-Language Retrieval: The Role of Multimodal LLMs as Synthetic Data Generators

Aidan Bell, James Gore, Behrooz Mansouri. Augmenting Vision-Language Retrieval: The Role of Multimodal LLMs as Synthetic Data Generators. In Nicola Ferro 0001, Maria Maistro, Gabriella Pasi, Omar Alonso, Andrew Trotman, Suzan Verberne, editors, Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2025, Padua, Italy, July 13-18, 2025. pages 3050-3054, ACM, 2025. [doi]

Abstract

Abstract is missing.