The following publications are possibly variants of this publication:
- TransVG++: End-to-End Visual Grounding With Language Conditioned Vision TransformerJiajun Deng, Zhengyuan Yang, Daqing Liu, Tianlang Chen, Wengang Zhou, Yanyong Zhang, Houqiang Li, Wanli Ouyang. pami, 45(11):13636-13652, November 2023. [doi]
- SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual GroundingMengxue Qu, Yu Wu 0011, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao 0001, Yunchao Wei. eccv 2022: 546-562 [doi]