Video Referring Expression Comprehension via Transformer with Content-conditioned Query

Jiang Ji, Meng Cao, Tengtao Song, Long Chen, Yi Wang, Yuexian Zou. Video Referring Expression Comprehension via Transformer with Content-conditioned Query. In Wei Ji 0008, Yinwei Wei, Zhedong Zheng, Hao Fei 0001, Tat-Seng Chua, editors, Proceedings of the 1st International Workshop on Deep Multimodal Learning for Information Retrieval, MMIR 2023, Ottawa ON, Canada, 2 November 2023. pages 39-48, ACM, 2023. [doi]

Abstract

Abstract is missing.