Abstract is missing.
- The BURCHAK corpus: a Challenge Data Set for Interactive Learning of Visually Grounded Word MeaningsYanchao Yu, Arash Eshghi, Gregory Mills, Oliver Lemon. 1-10 [doi]
- The Use of Object Labels and Spatial Prepositions as Keywords in a Web-Retrieval-Based Image Caption Generation SystemBrandon Birmingham, Adrian Muscat. 11-20 [doi]
- Learning to Recognize Animals by Watching Documentaries: Using Subtitles as Weak SupervisionAparna Nurani Venkitasubramanian, Tinne Tuytelaars, Marie-Francine Moens. 21-30 [doi]
- Human Evaluation of Multi-modal Neural Machine Translation: A Case-Study on E-Commerce Listing TitlesIacer Calixto, Daniel Stein, Evgeny Matusov, Sheila Castilho, Andy Way. 31-37 [doi]
- The BreakingNews DatasetArnau Ramisa, Fei Yan, Francesc Moreno-Noguer, Krystian Mikolajczyk. 38-39 [doi]
- Automatic identification of head movements in video-recorded conversations: can words help?Patrizia Paggio, Costanza Navarretta, Bart Jongejan. 40-42 [doi]
- Multi-Modal Fashion Product RetrievalAntonio Rubio, LongLong Yu, Edgar Simo-Serra, Francesc Moreno-Noguer. 43-45 [doi]