ProVLA: Compositional Image Search with Progressive Vision-Language Alignment and Multimodal Fusion

Zhizhang Hu, Xinliang Zhu, Son Tran, René Vidal, Arnab Dhua. ProVLA: Compositional Image Search with Progressive Vision-Language Alignment and Multimodal Fusion. In IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Workshops, Paris, France, October 2-6, 2023. pages 2764-2769, IEEE, 2023. [doi]

Authors

Zhizhang Hu

This author has not been identified. Look up 'Zhizhang Hu' in Google

Xinliang Zhu

This author has not been identified. Look up 'Xinliang Zhu' in Google

Son Tran

This author has not been identified. Look up 'Son Tran' in Google

René Vidal

This author has not been identified. Look up 'René Vidal' in Google

Arnab Dhua

This author has not been identified. Look up 'Arnab Dhua' in Google