Abstract is missing.
- Exploiting Scene Context for Image CaptioningRakshith Shetty, Hamed Rezazadegan Tavakoli, Jorma Laaksonen. 1-8 [doi]
- News Event Understanding by Mining Latent Factors From Multimodal TensorsChun-Yu Tsai, Ruilin Xu, Robert E. Colgan, John R. Kender. 9-16 [doi]
- Cross-modal Classification by Completing Unimodal RepresentationsThi Quynh Nhi Tran, Hervé Le Borgne, Michel Crucianu. 17-25 [doi]
- Semantic Indexing of Wearable Camera Images: Kids'Cam ConceptsAlan F. Smeaton, Kevin McGuinness, Cathal Gurrin, Jiang Zhou, Noel E. O'Connor, Peng Wang 0012, Brian Davis, Lucas Azevedo, André Freitas, Louise Signal, Moira Smith, James Stanley, Michelle Barr, Tim Chambers, Cliona Ní Mhurchu. 27-34 [doi]
- Jointly Representing Images and Text: Dependency Graphs, Word Senses, and Multimodal EmbeddingsFrank Keller. 35-36 [doi]
- Multimodal and Crossmodal Representation Learning from Textual and Visual Features with Bidirectional Deep Neural Networks for Video HyperlinkingVedran Vukotic, Christian Raymond, Guillaume Gravier. 37-44 [doi]
- User Video Summarization Based on Joint Visual and Semantic Affinity GraphZhuo Lei, Ke Sun 0006, Qian Zhang, Guoping Qiu. 45-52 [doi]
- Disinformation in Multimedia Annotation: Misleading Metadata Detection on YouTubePayal Bajaj, Mridul Kavidayal, Priyanshu Srivastava, Md Nadeem Akhtar, Ponnurangam Kumaraguru. 53-61 [doi]
- Beyond Language and Vision, Towards Truly Multimedia IntegrationTat-Seng Chua. 63 [doi]