Abstract is missing.
- Keynote: Towards Explainability in AI and Multimedia ResearchTat-Seng Chua. 1 [doi]
- Interactive Video Retrieval in the Age of Deep LearningJakub Lokoc, Klaus Schoeffmann, Werner Bailer, Luca Rossetto, Cathal Gurrin. 2-4 [doi]
- Similarity Search in 3D Human Motion DataJan Sedmidubský, Pavel Zezula. 5-6 [doi]
- A Geographical-Temporal Awareness Hierarchical Attention Network for Next Point-of-Interest RecommendationTongcun Liu, Jianxin Liao, Zhigen Wu, Yulong Wang, Jingyu Wang. 7-15 [doi]
- The Focus-Aspect-Value Model for Explainable Prediction of Subjective Visual InterpretationTushar Karayil, Philipp Blandfort, Jörn Hees, Andreas Dengel. 16-24 [doi]
- Context-Aware Embeddings for Automatic Art AnalysisNoa Garcia, Benjamin Renoust, Yuta Nakashima. 25-33 [doi]
- Methods of Multi-Modal Data ExplorationTomás Grosup. 34-37 [doi]
- Benchmarking Search and Annotation in Continuous Human Skeleton SequencesJan Sedmidubský, Petr Elias, Pavel Zezula. 38-42 [doi]
- A Genetic Programming Approach for Searching on Nearest Neighbors GraphsJavier Alvaro Vargas Muñoz, Zanoni Dias, Ricardo da Silva Torres. 43-47 [doi]
- Weakly Supervised Image Retrieval via Coarse-scale Feature Fusion and Multi-level Attention BlocksXinyao Nie, Hong Lu 0001, Zijian Wang, Jingyuan Liu, Zehua Guo. 48-52 [doi]
- Integrity Verification in Medical Image Retrieval Systems using Spread Spectrum SteganographyPeter U. Eze, Udaya Parampalli, Robin J. Evans, Dongxi Liu. 53-57 [doi]
- An Unsupervised Genetic Algorithm Framework for Rank Selection and Fusion on Image RetrievalLucas Pascotti Valem, Daniel Carlos Guimarães Pedronette. 58-62 [doi]
- Collaborating CNN and SVM for Automatic Image AnnotationZhixin Li, Lan Lin, Canlong Zhang, Huifang Ma, Weizhong Zhao. 63-67 [doi]
- Relationship Detection Based on Object Semantic Inference and Attention MechanismsLiang Zhang 0010, Shuai Zhang, Peiyi Shen, Guangming Zhu, Syed Afaq Ali Shah, Mohammed Bennamoun. 68-72 [doi]
- 3D Human Tracking with Catadioptric Omnidirectional CameraFakhreddine Ababsa, Hicham Hadj-Abdelkader, Marouane Boui. 73-77 [doi]
- Learning Task Relatedness in Multi-Task Learning for Images in ContextGjorgji Strezoski, Nanne van Noord, Marcel Worring. 78-86 [doi]
- High-Capacity Convolutional Video Steganography with Temporal Residual ModelingXinyu Weng, Yongzhi Li, Lu Chi, Yadong Mu. 87-95 [doi]
- Learning Discriminative Features for Image RetrievalYinghao Wang, Chen Chen, Jiong Wang, Yingying Zhu. 96-104 [doi]
- DeepMarks: A Secure Fingerprinting Framework for Digital Rights Management of Deep Learning ModelsHuili Chen, Bita Darvish Rouhani, Cheng Fu, Jishen Zhao, Farinaz Koushanfar. 105-113 [doi]
- Feature Pyramid HashingYifan Yang, Libing Geng, Hanjiang Lai, Yan Pan, Jian Yin. 114-122 [doi]
- Deep Policy Hashing Network with Listwise SupervisionShaoying Wang, Hanjiang Lai, Yifan Yang, Jian Yin. 123-131 [doi]
- RobustiQ: A Robust ANN Search Method for Billion-scale Similarity Search on GPUsWei Chen 0003, Jincai Chen, Fuhao Zou, Yuan-Fang Li, Ping Lu, Wei Zhao. 132-140 [doi]
- Triplet Fusion Network Hashing for Unpaired Cross-Modal RetrievalZhikai Hu, Xin Liu, Xingzhi Wang, Yiu-ming Cheung, Nannan Wang, Yewang Chen. 141-149 [doi]
- A Hierarchical Attentive Deep Neural Network Model for Semantic Music Annotation Integrating Multiple Music RepresentationsQianqian Wang, Feng Su, Yuyang Wang. 150-158 [doi]
- Adversary Guided Asymmetric Hashing for Cross-Modal RetrievalWen Gu, Xiaoyan Gu, Jingzi Gu, Bo Li, Zhi Xiong, Weiping Wang. 159-167 [doi]
- Understanding, Categorizing and Predicting Semantic Image-Text RelationsChristian Otto, Matthias Springstein, Avishek Anand, Ralph Ewerth. 168-176 [doi]
- VIRET: A Video Retrieval Tool for Interactive Known-item SearchJakub Lokoc, Gregor Kovalcík, Tomás Soucek, Jaroslav Moravec, Premysl Cech. 177-181 [doi]
- Self-Supervised Visual Representations for Cross-Modal RetrievalYash Patel, Lluís Gómez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar. 182-186 [doi]
- Multimodal Dialog for Browsing Large Visual Catalogs using Exploration-Exploitation Paradigm in a Joint Embedding SpaceIndrani Bhattacharya, Arkabandhu Chowdhury, Vikas C. Raykar. 187-191 [doi]
- Unsupervised Rank-Preserving Hashing for Large-Scale Image RetrievalSvebor Karaman, Xudong Lin, Xuefeng Hu, Shih-Fu Chang. 192-196 [doi]
- PhonoNet: Multi-Stage Deep Neural Networks for Raga Identification in Hindustani Classical MusicSauhaarda Chowdhuri. 197-201 [doi]
- Hierarchical Variational Network for User-Diversified & Query-Focused Video SummarizationPin Jiang, Yahong Han. 202-206 [doi]
- Stacked Self-Attention Networks for Visual Question AnsweringQiang Sun, Yanwei Fu. 207-211 [doi]
- Joint Cluster Unary Loss for Efficient Cross-Modal HashingShifeng Zhang, Jianmin Li, Bo Zhang. 212-216 [doi]
- Cross-Modal Video Moment Retrieval with Spatial and Language-Temporal AttentionBin Jiang, Xin Huang, Chao Yang 0015, Junsong Yuan. 217-225 [doi]
- Deep Semantic Space with Intra-class Low-rank Constraint for Cross-modal RetrievalPeipei Kang, Zehang Lin, Zhenguo Yang, Xiaozhao Fang, Qing Li, Wenyin Liu. 226-234 [doi]
- Assist Users' Interactions in Font Search with Unexpected but Useful Concepts Generated by Multimodal LearningSaemi Choi, Shun Matsumura, Kiyoharu Aizawa. 235-243 [doi]
- Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention NetworksPo-Yao Huang, Vaibhav, Xiaojun Chang, Alexander G. Hauptmann. 244-252 [doi]
- Deep Association: End-to-end Graph-Based Learning for Multiple Object Tracking with Conv-Graph Neural NetworkCong Ma, Yuan Li, Fan Yang, Ziwei Zhang, Yueqing Zhuang, Huizhu Jia, Xiaodong Xie. 253-261 [doi]
- Multi-shot Person Re-identification through Set Distance with Visual Distributional RepresentationTing-Yao Hu, Alexander G. Hauptmann. 262-270 [doi]
- Take Goods from Shelves: A Dataset for Class-Incremental Object DetectionYu Hao, Yanwei Fu, Yu-Gang Jiang. 271-278 [doi]
- Annotating Objects and Relations in User-Generated VideosXindi Shang, Donglin Di, Junbin Xiao, Yu Cao, Xun Yang, Tat-Seng Chua. 279-287 [doi]
- Towards Cloud Distributed Image Indexing by Sparse HashingAndré Mourão, João Magalhães. 288-296 [doi]
- Emotion Reinforced Visual StorytellingNanxing Li, Bei Liu, Zhizhong Han, Yu-Shen Liu, Jianlong Fu. 297-305 [doi]
- Who's Afraid of Adversarial Queries?: The Impact of Image Modifications on Content-based Image RetrievalZhuoran Liu, Zhengyu Zhao, Martha Larson. 306-314 [doi]
- RACKNet: Robust Allocation of Convolutional Kernels in Neural Networks for Image ClassificationYash Garg, K. Selçuk Candan. 315-323 [doi]
- A Benchmark of Visual Storytelling in Social MediaGonçalo Marcelino, David Semedo, André Mourão, Saverio G. Blasi, Marta Mrak, João Magalhães. 324-328 [doi]
- qwLSH: Cache-conscious Indexing for Processing Similarity Search Query Workloads in High-Dimensional SpacesOmid Jafari, John Ossorgin, Parth Nagarkar. 329-333 [doi]
- V3C1 Dataset: An Evaluation of Content CharacteristicsFabian Berns, Luca Rossetto, Klaus Schoeffmann, Christian Beecks, George Awad. 334-338 [doi]
- Increasingly Packing Multiple Facial-Informatics Modules in A Unified Deep-Learning Model via Lifelong LearningSteven C. Y. Hung, Jia-Hong Lee, Timmy S. T. Wan, Chien-Hung Chen, Yi-Ming Chan, Chu-Song Chen. 339-343 [doi]
- Cross-modal Collaborative Manifold Propagation for Image RecommendationMeng Jian, Ting Jia, Xun Yang, Lifang Wu, Lina Huo. 344-348 [doi]
- Progressive Image Enhancement under Aesthetic GuidanceXiaoyu Du, Xun Yang, Zhiguang Qin, Jinhui Tang. 349-353 [doi]
- Cross-Database Micro-Expression Recognition: A BenchmarkYuan Zong, Wenming Zheng, Xiaopeng Hong, Chuangao Tang, Zhen Cui, Guoying Zhao. 354-363 [doi]
- Naturalness Preserved Image Aesthetic Enhancement with Perceptual Encoder ConstraintLeida Li, Yuzhe Yang, Hancheng Zhu. 364-372 [doi]
- Hierarchical Attention based Neural Network for Explainable RecommendationDawei Cong, Yanyan Zhao, Bing Qin 0001, Yu Han, Murray Zhang, Alden Liu, Nat Chen. 373-381 [doi]
- Image Emotion Distribution Learning with Graph Convolutional NetworksTao He, Xiaoming Jin. 382-390 [doi]
- Multimodal Multimedia Retrieval with vitrivrRalph Gasser, Luca Rossetto, Heiko Schuldt. 391-394 [doi]
- Recognizing User-Defined Subsequences in Human Motion DataJan Sedmidubský, Pavel Zezula. 395-398 [doi]
- DietLens-Eout: Large Scale Restaurant Food Photo RecognitionZhipeng Wei, Jingjing Chen, Zhaoyan Ming, Chong-Wah Ngo, Tat-Seng Chua, Fengfeng Zhou. 399-403 [doi]
- Blockchain and IoT-based Secure Multimedia Retrieval System for a Massive Crowd: Sharing Economy PerspectiveMd. Abdur Rahman, George Loukas, Syed Maruf Abdullah, Areej Abdu, Syed Sadiqur Rahman, Elham Hassanain, Yasmine Arafa. 404-407 [doi]
- EAGER: Edge-Aided imaGe undERstanding SystemJianzhong He, Xiaobin Liu, Shiliang Zhang. 408-412 [doi]