Without detection: Two-step clustering features with local-global attention for image captioning

Xuan Li, Wenkai Zhang, Xian Sun, Xin Gao. Without detection: Two-step clustering features with local-global attention for image captioning. IET Computer Vision, 16(3):280-294, 2022. [doi]

Abstract

Abstract is missing.