Telling the What while Pointing to the Where: Multimodal Queries for Image Retrieval

Soravit Changpinyo, Jordi Pont-Tuset, Vittorio Ferrari, Radu Soricut. Telling the What while Pointing to the Where: Multimodal Queries for Image Retrieval. In 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021, Montreal, QC, Canada, October 10-17, 2021. pages 12116-12126, IEEE, 2021. [doi]

Abstract

Abstract is missing.