Abstract is missing.
- Interactively Learning Visually Grounded Word Meanings from a Human TutorYanchao Yu, Arash Eshghi, Oliver Lemon. [doi]
- "Look, some Green Circles!": Learning to Quantify from ImagesIonut Sorodoc, Angeliki Lazaridou, Gemma Boleda, Aurélie Herbelot, Sandro Pezzelle, Raffaella Bernardi. [doi]
- Building a Bagpipe with a Bag and a Pipe: Exploring Conceptual Combination in VisionSandro Pezzelle, Ravi Shekhar, Raffaella Bernardi. [doi]
- Detecting Visually Relevant Sentences for Fine-Grained ClassificationOlivia Winn, Madhavan Kavanur Kidambi, Smaranda Muresan. [doi]
- Leveraging Captions in the Wild to Improve Object DetectionMert Kilickaya, Nazli Ikizler-Cinbis, Erkut Erdem, Aykut Erdem. [doi]
- Focused Evaluation for Image Description with Binary Forced-Choice TasksMicah Hodosh, Julia Hockenmaier. [doi]
- Text2voronoi: An Image-driven Approach to Differential DiagnosisAlexander Mehler, Tolga Uslu, Wahed Hemati. [doi]
- Multi30K: Multilingual English-German Image DescriptionsDesmond Elliott, Stella Frank, Khalil Sima'an, Lucia Specia. [doi]
- Natural Language Descriptions of Human Activities Scenes: Corpus Generation and AnalysisNouf Al Harbi, Yoshihiko Gotoh. [doi]
- Pragmatic Factors in Image Description: The Case of NegationsEmiel van Miltenburg, Roser Morante, Desmond Elliott. [doi]
- Combining Lexical and Spatial Knowledge to Predict Spatial Relations between Objects in ImagesManuela Hürlimann, Johan Bos. [doi]
- Exploring Different Preposition Sets, Models and Feature Sets in Automatic Generation of Spatial Image DescriptionsAnja Belz, Adrian Muscat, Brandon Birmingham. [doi]
- Automatic Annotation of Structured Facts in ImagesMohamed Elhoseiny, Scott Cohen, Walter Chang, Brian L. Price, Ahmed M. Elgammal. [doi]