Image-Text Alignment using Adaptive Cross-attention with Transformer Encoder for Scene Graphs

Juyong Song, Sunghyun Choi 0001. Image-Text Alignment using Adaptive Cross-attention with Transformer Encoder for Scene Graphs. In 32nd British Machine Vision Conference 2021, BMVC 2021, Online, November 22-25, 2021. pages 343, BMVA Press, 2021. [doi]

Abstract

Abstract is missing.