Hierarchical cross-modal contextual attention network for visual grounding

Xin Xu, Gang Lv, Yining Sun, Yuxia Hu, Fudong Nian. Hierarchical cross-modal contextual attention network for visual grounding. Multimedia Syst., 29(4):2073-2083, August 2023. [doi]

Abstract

Abstract is missing.