ClipCrop: Conditioned Cropping Driven by Vision-Language Model

Zhihang Zhong, Mingxi Cheng, Zhirong Wu, Yuhui Yuan, Yinqiang Zheng, Ji Li, Han Hu 0001, Stephen Lin 0001, Yoichi Sato, Imari Sato. ClipCrop: Conditioned Cropping Driven by Vision-Language Model. In IEEE/CVF International Conference on Computer Vision, ICCV 2023 - Workshops, Paris, France, October 2-6, 2023. pages 294-304, IEEE, 2023. [doi]

Abstract

Abstract is missing.