Aligning Where to See and What to Tell: Image Captioning with Region-Based Attention and Scene-Specific Contexts

Kun Fu, Junqi Jin, Runpeng Cui, Fei Sha, Changshui Zhang. Aligning Where to See and What to Tell: Image Captioning with Region-Based Attention and Scene-Specific Contexts. IEEE Trans. Pattern Anal. Mach. Intell., 39(12):2321-2334, 2017. [doi]

Abstract

Abstract is missing.