MIVC: Multiple Instance Visual Component for Visual-Language Models

Wenyi Wu, Qi Li, Wenliang Zhong, JunZhou Huang. MIVC: Multiple Instance Visual Component for Visual-Language Models. In IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2024, Waikoloa, HI, USA, January 3-8, 2024. pages 8102-8111, IEEE, 2024. [doi]

Abstract

Abstract is missing.