Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

Peter Anderson 0001, Xiaodong He 0001, Chris Buehler, Damien Teney, Mark Johnson 0001, Stephen Gould, Lei Zhang. Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, June 18-22, 2018. pages 6077-6086, IEEE Computer Society, 2018. [doi]