Journal: IEEE Transactions on Multimedia

Volume 22, Issue 9

2193 -- 2206Mohammad Kazemi, Mohammad Ghanbari 0001, Shervin Shirmohammadi. Intra Coding Strategy for Video Error Resiliency: Behavioral Analysis
2207 -- 2220Guyue Hu, Bo Cui, Shan Yu. Joint Learning in the Spatio-Temporal and Frequency Domains for Skeleton-Based Action Recognition
2221 -- 2233Bingshu Wang, Yong Zhao, C. L. Philip Chen. Moving Cast Shadows Segmentation Using Illumination Invariant Feature
2234 -- 2245Cheng Xu, Biao Leng, Bo Chen, Cheng Zhang, Xiaochen Zhou. Learning Discriminative and Generative Shape Embeddings for Three-Dimensional Shape Retrieval
2246 -- 2261Kun Sun 0002, Wenbing Tao, Yuhua Qian. Guide to Match: Multi-Layer Feature Matching With a Hybrid Gaussian Mixture Model
2262 -- 2277Ji-Hwan Park, Ievgeniia Gutenko, Arie E. Kaufman. Transfer Function-Guided Saliency-Aware Compression for Transmitting Volumetric Data
2278 -- 2292Shanfeng Hu, Hubert P. H. Shum, Nauman Aslam, Frederick W. B. Li, Xiaohui Liang. A Unified Deep Metric Representation for Mesh Saliency Detection and Non-Rigid Shape Matching
2293 -- 2306Hanbo Wu, Xin Ma 0001, Yibin Li. Convolutional Networks With Channel and STIPs Attention Model for Action Recognition in Videos
2307 -- 2320Wenbin Che, Xiaopeng Fan, Ruiqin Xiong, Debin Zhao. Visual Relationship Embedding Network for Image Paragraph Generation
2321 -- 2330Mohammad Nazmus Sadat, Rui Dai, Lingchao Kong, Jingyi Zhu. QoE-Aware Multi-Source Video Streaming in Content Centric Networks
2331 -- 2344Yucheng Zhu, Guangtao Zhai, Xiongkuo Min, Jiantao Zhou 0001. The Prediction of Saliency Map for Head and Eye Movements in 360 Degree Images
2345 -- 2353Devraj Mandal, Pramod Rao, Soma Biswas. Semi-Supervised Cross-Modal Retrieval With Label Prediction
2354 -- 2365Xin Fu, Yao Zhao 0001, Yunchao Wei, Yufeng Zhao, Shikui Wei. Rich Features Embedding for Cross-Modal Retrieval: A Simple Baseline
2366 -- 2381Jiarun Song, Fuzheng Yang, Wei Zhang 0072, Wenjie Zou, Yuqun Fan, Peiyun Di. A Fast FoV-Switching DASH System Based on Tiling Mechanism for Practical Omnidirectional Video Services
2382 -- 2395Pantelis Maniotis, Eirina Bourtsoulatze, Nikolaos Thomos. Tile-Based Joint Caching and Delivery of 360° Videos in Heterogeneous Networks
2396 -- 2408Rafael Asorey-Cacheda, Antonio-Javier García-Sánchez, Joan García-Haro. An Efficient NVoD Scheme Using Implicit Error Correction and Subchannels for Wireless Networks
2409 -- 2419Jinyu Zhang, Yifan Zhang, Mengru Shen. A Distance-Driven Alliance for a P2P Live Video System
2420 -- 2433Hongliang Yan, Zhetao Li, Qilong Wang, Peihua Li, Yong Xu 0001, Wangmeng Zuo. Weighted and Class-Specific Maximum Mean Discrepancy for Unsupervised Domain Adaptation
2434 -- 2443Ke Ning, Ming Cai, Di Xie, Fei Wu 0001. An Attentive Sequence to Sequence Translator for Localizing Video Clips by Natural Language
2444 -- 2453Fengxiang Yang, Zhun Zhong, Zhiming Luo, Sheng Lian, Shaozi Li. Leveraging Virtual and Real Person for Unsupervised Person Re-Identification
2454 -- 2466Yun Yi, Hanli Wang, Qinyu Li. Affective Video Content Analysis With Adaptive Fusion Recurrent Network
2467 -- 2478Luming Zhang, Xiaoming Ju, Yiyang Yao, Zhenguang Liu. Massive-Scale Genre Communities Learning Using a Noise-Tolerant Deep Architecture

Volume 22, Issue 8

1917 -- 1928Wenbin Yin, Yunhui Shi, Wangmeng Zuo, Xiaopeng Fan. A Co-Prediction-Based Compression Scheme for Correlated Images
1929 -- 1938Youqing Wu, Youzhi Xiang, Yutang Guo, Jin Tang, Zhaoxia Yin. An Improved Reversible Data Hiding in Encrypted Images Using Parametric Binary Tree Labeling
1939 -- 1954Saeed Mahmoudpour, Peter Schelkens. A Multi-Attribute Blind Quality Evaluator for Tone-Mapped Images
1955 -- 1968Mingkuan Yuan, Yuxin Peng. CKD: Cross-Task Knowledge Distillation for Text-to-Image Synthesis
1969 -- 1984Pengwen Dai, Hua Zhang 0008, Xiaochun Cao. Deep Multi-Scale Context Aware Feature Aggregation for Curved Scene Text Detection
1985 -- 1997Yongyong Chen, Xiaolin Xiao, Yicong Zhou. Jointly Learning Kernel Representation Tensor and Affinity Matrix for Multi-View Clustering
1998 -- 2011Shunli Zhang, Li Zhang, Alexander G. Hauptmann. Fuzzy Least Squares Support Vector Machine With Adaptive Membership for Object Tracking
2012 -- 2023Huaidong Zhang, Xuemiao Xu, Hai He, Shengfeng He, Guoqiang Han, Jing Qin 0001, Dapeng Wu. Fast User-Guided Single Image Reflection Removal via Edge-Aware Cascaded Networks
2024 -- 2037Zhaolin Wan, Ke Gu 0001, Debin Zhao. Reduced Reference Stereoscopic Image Quality Assessment Using Sparse Representation and Natural Scene Statistics
2038 -- 2047Lukas Krasula, Karel Fliegel, Patrick Le Callet. FFTMI: Features Fusion for Natural Tone-Mapped Images Quality Evaluation
2048 -- 2060Xu Lu, Lei Zhu 0002, Jingjing Li, Huaxiang Zhang, Heng Tao Shen. Efficient Supervised Discrete Multi-View Hashing for Large-Scale Multimedia Search
2061 -- 2073Yuxin Peng, Jian Zhang 0032, Zhaoda Ye. Deep Reinforcement Learning for Image Hashing
2074 -- 2085Bo Jiang 0002, Jin Tang, Bin Luo 0001. Feature Matching With Intra-Group Sparse Model
2086 -- 2097Shuhui Jiang, Zhaowen Wang, Aaron Hertzmann, Hailin Jin, Yun Fu 0001. Visual Font Pairing
2098 -- 2110Feng Xue, Richang Hong, Xiangnan He 0001, Jianwei Wang, Shengsheng Qian, Changsheng Xu. Knowledge-Based Topic Model for Multi-Modal Social Event Analysis
2111 -- 2125Lianxin Yang, Dan Wu 0001, Yueming Cai, Xin Shi, Yan Wu. Learning-Based User Clustering and Link Allocation for Content Recommendation Based on D2D Multicast Communications
2126 -- 2137Zhenzhen Wang, Weixiang Hong, Yap-Peng Tan, Junsong Yuan. Pruning 3D Filters For Accelerating 3D ConvNets
2138 -- 2148Hao Song, Che Sun, Xinxiao Wu, Mei Chen, Yunde Jia. Learning Normal Patterns via Adversarial Attention-Based Autoencoder for Abnormal Event Detection in Videos
2149 -- 2162Longteng Guo, Jing Liu 0001, Shichen Lu, Hanqing Lu. Show, Tell, and Polish: Ruminant Decoding for Image Captioning
2163 -- 2176Sheng Yang, Guosheng Lin, Qiuping Jiang, Weisi Lin. A Dilated Inception Network for Visual Saliency Prediction
2177 -- 2190Yongpeng Wu, Dehui Kong, Shaofan Wang, Jinghua Li, Baocai Yin. An Unsupervised Real-Time Framework of Human Pose Tracking From Range Image Sequences

Volume 22, Issue 7

1661 -- 1666Yonghong Tian 0001, Cees G. M. Snoek, Jingdong Wang, Zhu Liu 0001, Rainer Lienhart, Susanne Boll. Guest Editorial Multimedia Computing With Interpretable Machine Learning
1667 -- 1679Haichuan Ma, Dong Liu 0002, Ruiqin Xiong, Feng Wu 0001. iWave: CNN-Based Wavelet-Like Transform for Image Compression
1680 -- 1691Wenhui Xiao, Huiguo He, Tingting Wang 0004, Hongyang Chao. The Interpretable Fast Multi-Scale Deep Decoder for the Standard HEVC Bitstreams
1692 -- 1703Xingxing Zhang, Shupeng Gui, Zhenfeng Zhu, Yao Zhao 0001, Ji Liu 0002. Hierarchical Prototype Learning for Zero-Shot Recognition
1704 -- 1719Ryosuke Furuta, Naoto Inoue, Toshihiko Yamasaki. PixelRL: Fully Convolutional Network With Reinforcement Learning for Image Processing
1720 -- 1729Donghao Gu, Yaowei Li, Feng Jiang 0001, Zhaojing Wen, Shaohui Liu, Wuzhen Shi, Guangming Lu, Changsheng Zhou. VINet: A Visually Interpretable Image Diagnosis Network
1730 -- 1743Weimin Tan, Bo Yan 0001, Chuming Lin, Xuejing Niu. Cycle-IR: Deep Cyclic Image Retargeting
1744 -- 1755Min Zhang, Haoxuan You, Pranav Kadam, Shan Liu, C. C. Jay Kuo. PointHop: An Explainable Machine Learning Method for Point Cloud Classification
1756 -- 1768Shaohua Wan, Yu Xia, Lianyong Qi, Yee-Hong Yang, Mohammed Atiquzzaman. Automated Colorization of a Grayscale Image With Seed Points Propagation
1769 -- 1784Mohammad Soltanian, Sajjad Amini, Shahrokh Ghaemmaghami. Spatio-Temporal VLAD Encoding of Visual Events Using Temporal Ordering of the Mid-Level Deep Semantics
1785 -- 1795Chuanbin Liu, Hongtao Xie, Zhengjun Zha, Lingyun Yu, Zhineng Chen, Yongdong Zhang. Bidirectional Attention-Recognition Model for Fine-Grained Object Classification
1796 -- 1807Yulong Wang, Hang Su 0006, Bo Zhang 0010, Xiaolin Hu 0001. Learning Reliable Visual Saliency For Model Explanations
1808 -- 1822Hao Liu, Penghui Sun, Jiaqiang Zhang, Suping Wu, Zhenhua Yu, Xuehong Sun. Similarity-Aware and Variational Deep Adversarial Learning for Robust Facial Age Estimation
1823 -- 1835Wenwu Zhu 0001, Xin Wang 0019, Wen Gao 0001. Multimedia Intelligence: When Multimedia Meets Artificial Intelligence
1836 -- 1846Zheng-Jun Zha, Jiawei Liu, Di Chen, Feng Wu 0001. Adversarial Attribute-Text Embedding for Person Search With Natural Language Query
1847 -- 1861Xinrui Cui, Dan Wang 0011, Z. Jane Wang 0001. Feature-Flow Interpretation of Deep Convolutional Neural Networks
1862 -- 1873Ricardo Sanchez-Matilla, Chau Yi Li, Ali Shahin Shamsabadi, Riccardo Mazzon, Andrea Cavallaro. Exploiting Vulnerabilities of Deep Neural Networks for Privacy Protection
1874 -- 1888Yuhui Xu, Wenrui Dai, Yingyong Qi, Junni Zou, Hongkai Xiong. Iterative Deep Neural Network Quantization With Lipschitz Constraint
1889 -- 1903Sajjad Amini, Shahrokh Ghaemmaghami. Towards Improving Robustness of Deep Neural Networks to Adversarial Perturbations
1904 -- 1916Zona Kostic, Aleksandar Jevremovic. What Image Features Boost Housing Market Predictions?

Volume 22, Issue 6

1385 -- 1394Yanxiong Li, Mingle Liu, Wucheng Wang, Yuhan Zhang, Qianhua He. Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration
1395 -- 1406Miaohui Wang, Jian-xiong, Long Xu, Wuyuan Xie, King Ngi Ngan, Jing Qin 0001. Rate Constrained Multiple-QP Optimization for HEVC
1407 -- 1422Yunfeng Zhang, Ping Wang 0016, Fangxun Bao, Xunxiang Yao, Caiming Zhang, Hongwei Lin. A Single-Image Super-Resolution Method Based on Progressive-Iterative Approximation
1423 -- 1432Bingjie Xu, Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli. Interact as You Intend: Intention-Driven Human-Object Interaction Detection
1433 -- 1446Federico Angelini, Zeyu Fu, Yang Long 0001, Ling Shao 0001, Syed Mohsen Naqvi. 2D Pose-Based Real-Time Human Action Recognition With Occlusion-Handling
1447 -- 1457Vahid Khorasani Ghassab, Nizar Bouguila. Light Field Super-Resolution Using Edge-Preserved Graph-Based Regularization
1458 -- 1469Yuebin Wang, Liqiang Zhang, Feiping Nie, Xingang Li, Zhijun Chen, Faqiang Wang. WeGAN: Deep Image Hashing With Weighted Generative Adversarial Networks
1470 -- 1484Jin Wang, Wei Xu, Jian-Feng Cai, Qing Zhu, Yunhui Shi, Baocai Yin. Multi-Direction Dictionary Learning Based Depth Map Super-Resolution With Autoregressive Modeling
1485 -- 1495Hai-Miao Hu, Hongda Zhang, Zichen Zhao, Bo Li 0006, Jin Zheng. Adaptive Single Image Dehazing Using Joint Local-Global Illumination Adjustment
1496 -- 1506Heyu Zhou, An-An Liu, Weizhi Nie, Jie Nie. Multi-View Saliency Guided Deep Neural Network for 3-D Object Retrieval and Classification
1507 -- 1518Min Wang, Wengang Zhou, Qi Tian 0001, Houqiang Li. Neighborhood Pyramid Preserving Hashing
1519 -- 1530Haitao Zeng, Xinhang Song, Gongwei Chen, Shuqiang Jiang. Learning Scene Attribute for Scene Recognition
1531 -- 1541Miklas Strøm Kristoffersen, Sven Ewan Shepstone, Zheng-Hua Tan. The Importance of Context When Recommending TV Content: Dataset and Algorithms
1542 -- 1554Shuyan Li, Zhixiang Chen, Xiu Li, Jiwen Lu, Jie Zhou 0001. Unsupervised Variational Video Hashing With 1D-CNN-LSTM Networks
1555 -- 1566Peiguang Jing, Shu Ye, Liqiang Nie, Jing Liu 0002, Yuting Su. Low-Rank Regularized Multi-Representation Learning for Fashion Compatibility Prediction
1567 -- 1576Xuedou Xiao, Wei Wang 0050, Taobin Chen, Yang Cao 0002, Tao Jiang 0002, Qian Zhang 0001. Sensor-Augmented Neural Adaptive Bitrate Video Streaming on UAVs
1577 -- 1590Fuchen Long, Ting Yao, Zhaofan Qiu, Xinmei Tian, Tao Mei 0001, Jiebo Luo. Coarse-to-Fine Localization of Temporal Action Proposals
1591 -- 1604Xiongtao Chen, Wenmin Wang. Uni-and-Bi-Directional Video Prediction via Learning Object-Centric Transformation
1605 -- 1618Chaoqun Wan, Yue Wu, Xinmei Tian, Jianqiang Huang, Xian-Sheng Hua 0001. Concentrated Local Part Discovery With Fine-Grained Part Representation for Person Re-Identification
1619 -- 1633Xianjun Han, Hongyu Yang, Guanyu Xing, Yanli Liu 0002. Asymmetric Joint GANs for Normalizing Face Illumination From a Single Image
1634 -- 1646Chenchen Li, Jialin Wang, Hongwei Wang 0004, Miao Zhao, Wenjie Li 0002, Xiaotie Deng. Visual-Texual Emotion Analysis With Deep Coupled Video and Danmu Neural Networks
1647 -- 1659Xiaoyan Gao, Fuli Feng, Xiangnan He 0001, Heyan Huang, Xinyu Guan, Chong Feng, Zhaoyan Ming, Tat-Seng Chua. Hierarchical Attention Network for Visually-Aware Food Recommendation

Volume 22, Issue 5

1113 -- 1125Minxiang Ye, Cheng Yang, Vladimir Stankovic 0001, Lina Stankovic, Samuel Cheng. Distinct Feature Extraction for Video-Based Gait Phase Classification
1126 -- 1138Tilo Strutz, Phillip Möller. Screen Content Compression Based on Enhanced Soft Context Formation
1139 -- 1152Chieh-Chi Kao, Yuxiang Wang, Jonathan Waltman, Pradeep Sen. Patch-Based Image Hallucination for Super Resolution With Detail Reconstruction From Similar Sample Images
1153 -- 1167Yunxiao Li, Shuai Li 0001, Chenglizhao Chen, Aimin Hao, Hong Qin. Accurate and Robust Video Saliency Detection via Self-Paced Diffusion
1168 -- 1181YongQing Liang, Xin Li 0003. Reassembling Shredded Document Stripes Using Word-Path Metric and Greedy Composition Optimal Matching Solver
1182 -- 1192Lin Xie, Feifei Lee, Li Liu 0010, Zhong Yin, Qiu Chen. Hierarchical Coding of Convolutional Features for Scene Recognition
1193 -- 1207Ying Wang, Yifan Dong, Songtao Guo, Yuanyuan Yang 0001, Xiaofeng Liao. Latency-Aware Adaptive Video Summarization for Mobile Edge Clouds
1208 -- 1219Xiongli Chai, Feng Shao, Qiuping Jiang, Yo-Sung Ho. MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution
1220 -- 1233Wenfeng Song, Shuai Li 0001, Ji Liu, Aimin Hao, Qinping Zhao, Hong Qin. Contextualized CNN for Scene-Aware Depth Estimation From Single RGB Image
1234 -- 1248Weipeng Hu, Haifeng Hu 0001. Disentangled Spectrum Variations Networks for NIR-VIS Face Recognition
1249 -- 1258Alexandra Covaci, Estêvão Bissoli Saleme, Gebremariam Mesfin, Nadia Hussain, Elahe Kani-Zabihi, Gheorghita Ghinea. How Do We Experience Crossmodal Correspondent Mulsemedia Content?
1259 -- 1272Tao Xiang, Ying Yang, Shangwei Guo. Blind Night-Time Image Quality Assessment: Subjective and Objective Approaches
1273 -- 1284Luming Zhang, Jianwei Yin, Ping Li 0006, Yongheng Shang, Roger Zimmermann, Ling Shao 0001. Flickr Image Community Analytics by Deep Noise-Refined Matrix Factorization
1285 -- 1297Yehao Li, Ting Yao, Yingwei Pan, Hongyang Chao, Tao Mei 0001. Deep Metric Learning With Density Adaptivity
1298 -- 1309Yujuan Ding, Wai-Keung Wong, Zhihui Lai, Yudong Chen 0002. Study on 2D Feature-Based Hash Learning
1310 -- 1322Yiling Wu, Shuhui Wang, Qingming Huang. Online Fast Adaptive Low-Rank Similarity Learning for Cross-Modal Retrieval
1323 -- 1332Zhiyang Xia, Ping Yi, Yunyu Liu, Bo Jiang, Wei Wang 0190, Ting Zhu. GENPass: A Multi-Source Deep Learning Model for Password Guessing
1333 -- 1344Shuang Qiu, Yao Zhao 0001, Jianbo Jiao, Yunchao Wei, Shikui Wei. Referring Image Segmentation by Generative Adversarial Learning
1345 -- 1357Yabin Zhang, Kui Jia, Zhixin Wang. Part-Aware Fine-Grained Object Categorization Using Weakly Supervised Part Detection Network
1358 -- 1371Dongyu She, Jufeng Yang, Ming-Ming Cheng, Yu-Kun Lai, Paul L. Rosin, Liang Wang 0001. WSCNet: Weakly Supervised Coupled Networks for Visual Sentiment Classification and Detection
1372 -- 1383Ning Xu 0003, Hanwang Zhang, An-An Liu, Weizhi Nie, Yuting Su, Jie Nie, Yongdong Zhang. Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning

Volume 22, Issue 4

833 -- 845Dayong Wang, Yu Sun 0003, Ce Zhu, Weisheng Li 0001, Frédéric Dufaux. Fast Depth and Inter Mode Prediction for Quality Scalable High Efficiency Video Coding
846 -- 859Deyang Liu, Ping An, Ran Ma, Wenfa Zhan, Xinpeng Huang, Ali Abdullah Yahya. Content-Based Light Field Image Compression Method With Gaussian Process Regression
860 -- 873Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto. Energy Compaction-Based Image Compression Using Convolutional AutoEncoder
874 -- 884Zhaoxia Yin, Youzhi Xiang, Xinpeng Zhang. Reversible Data Hiding in Encrypted Images Based on Multi-MSB Prediction and Huffman Coding
885 -- 896Cheng Deng, Xu Yang, Feiping Nie, Dapeng Tao. Saliency Detection via a Multiple Self-Weighted Graph-Based Manifold Ranking
897 -- 911Chandramani Chaudhary, Poonam Goyal, Dhanashree Nellayi Prasad, Yi-Ping Phoebe Chen. Enhancing the Quality of Image Tagging Using a Visio-Textual Knowledge Base
912 -- 920Badri Narayan Subudhi, Veerakumar Thangaraj, Esakkirajan Sankaralingam, Ashish Ghosh. Kernelized Fuzzy Modal Variation for Local Change Detection From Video Scenes
921 -- 933Xun Liu, Mischa Dohler, Yansha Deng. Vibrotactile Quality Assessment: Hybrid Metric Design Based on SNR and SSIM
934 -- 948Yang Liu, Volkan Kiliç, Jian Guan, Wenwu Wang. Audio-Visual Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking
949 -- 960Xiwen Liu, Xiaoming Tao, Mai Xu, Yafeng Zhan, Jianhua Lu. An EEG-Based Study on Perception of Video Distortion Under Various Content Motion Conditions
961 -- 969Lukas Krasula, Yoann Baveye, Patrick Le Callet. Training Objective Image and Video Quality Estimators Using Multiple Databases
970 -- 979Muwei Jian, Junyu Dong, Maoguo Gong, Hui Yu 0001, Liqiang Nie, Yilong Yin, Kin-Man Lam. Learning the Traditional Art of Chinese Calligraphy via Three-Dimensional Reconstruction and Assessment
980 -- 991Hyunmin Jung, Hyuk-Jae Lee, Chae-Eun Rhee. Flexibly Connectable Light Field System For Free View Exploration
992 -- 1004Thanh-Toan Do, Tuan Hoang, Dang-Khoa Le Tan, Anh-Dzung Doan, Ngai-Man Cheung. Compact Hash Code Learning With Binary Deep Neural Network
1005 -- 1015Riza Arda Kirmizioglu, A. Murat Tekalp. Multi-Party WebRTC Services Using Delay and Bandwidth Aware SDN-Assisted IP Multicasting of Scalable Video Over 5G Networks
1016 -- 1031Chung-Chi Tsai, Kuang-Jui Hsu, Yen-Yu Lin, Xiaoning Qian, Yung-Yu Chuang. Deep Co-Saliency Detection via Stacked Autoencoder-Enabled Fusion and Self-Trained CNNs
1032 -- 1041Wenqiao Zhang, Siliang Tang, Yanpeng Cao, Shiliang Pu, Fei Wu 0001, Yueting Zhuang. Frame Augmented Alternating Attention Network for Video Question Answering
1042 -- 1054Zewei He, Yanpeng Cao, Lei Du, Baobei Xu, Jiangxin Yang, Yanlong Cao, Siliang Tang, Yueting Zhuang. MRFN: Multi-Receptive-Field Network for Fast and Accurate Single Image Super-Resolution
1055 -- 1068Zhi Jin, Muhammad Zafar Iqbal, Dmytro Bobkov, Wenbin Zou, Xia Li 0006, Eckehard G. Steinbach. A Flexible Deep CNN Framework for Image Restoration
1069 -- 1083Zhe Zhang, Chung-Horng Lung, Marc St-Hilaire, Ioannis Lambadaris. An SDN-Based Caching Decision Policy for Video Caching in Information-Centric Networking
1084 -- 1097Shangfei Wang, Longfei Hao, Qiang Ji. Knowledge-Augmented Multimodal Deep Regression Bayesian Networks for Emotion Video Tagging
1098 -- 1110Tianliang Liu, Junwei Wan, Xiubin Dai, Feng Liu 0028, Quanzeng You, Jiebo Luo. Sentiment Recognition for Short Annotated GIFs Using Visual-Textual Fusion
1111 -- 0Zhaoqiang Xia, Xiaopeng Hong, Xingyu Gao, Xiaoyi Feng, Guoying Zhao. Corrections to "Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions"

Volume 22, Issue 3

569 -- 578Xianjun Xia, Roberto Togneri, Ferdous Sohel, Yuanjun Zhao, David Huang 0001. Multi-Task Learning for Acoustic Event Detection Using Event and Frame Position Information
579 -- 593Ruben Verhack, Thomas Sikora, Glenn Van Wallendael, Peter Lambert. Steered Mixture-of-Experts for Light Field Images and Video: Representation and Coding
594 -- 609Qianru Jiang, Sheng Li 0005, Zhihui Zhu, Huang Bai, Xiongxiong He, Rodrigo C. de Lamare. Design of Compressed Sensing System With Probability-Based Prior Information
610 -- 625Pan Gao, Manoranjan Paul. Rate-Distortion Optimal Joint Texture and Depth Map Coding for 3-D Video Streaming
626 -- 640Zhaoqiang Xia, Xiaopeng Hong, Xingyu Gao, Xiaoyi Feng, Guoying Zhao. Spatiotemporal Recurrent Convolutional Networks for Recognizing Spontaneous Micro-Expressions
641 -- 654Fan Tang, Weiming Dong, Yiping Meng, Chongyang Ma, Fuzhang Wu, Xinrui Li, Tong-Yee Lee. Image Retargetability
655 -- 665Xiaoting Fan, Jianjun Lei, Yuming Fang, Qingming Huang, Nam Ling, Chunping Hou. Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending
666 -- 675Qiao Liu, Zhenyu He, Xin Li 0034, Yuan Zheng. PTB-TIR: A Thermal Infrared Pedestrian Tracking Benchmark
676 -- 687Bo Yan 0001, Xuejing Niu, Bahetiyaer Bare, Weimin Tan. Semantic Segmentation Guided Pixel Fusion for Image Retargeting
688 -- 703Konstantina Fotiadou, Grigorios Tsagkatakis, Panagiotis Tsakalides. Snapshot High Dynamic Range Imaging via Sparse Representations and Feature Learning
704 -- 716Chongyi Li, Chunle Guo, Jichang Guo, Ping Han, Huazhu Fu, Runmin Cong. PDR-Net: Perception-Inspired Single Image Dehazing Network With Refinement
717 -- 729Wooyoung Jang. MLC STT-MRAM-Aware Memory Subsystem for Smart Image Applications
730 -- 743Jianwen Lou, Yiming Wang, Charles Nduka, Mahyar Hamedi, Ifigeneia Mavridou, Fei-Yue Wang 0001, Hui Yu 0001. Realistic Facial Expression Reconstruction for VR HMD Users
744 -- 759Ching-Ling Fan, Shou-Cheng Yen, Chun-Ying Huang, Cheng-Hsin Hsu. Optimizing Fixation Prediction Using Recurrent Neural Networks for 360$^{\circ }$ Video Streaming in Head-Mounted Virtual Reality
760 -- 774Chao Ma 0005, Chen Gong 0002, Xiang Li, Xiaolin Huang, Wei Liu 0005, Jie Yang 0002. Toward Making Unsupervised Graph Hashing Discriminative
775 -- 785Lingling Zhang, Minnan Luo, Jun Liu 0002, Xiaojun Chang, Yi Yang 0001, Alexander G. Hauptmann. Deep Top-$k$ Ranking for Image-Sentence Matching
786 -- 794Liping Zhao 0005, Tao Lin, Dongyu Zhang, Kailun Zhou, Shuhui Wang. An Ultra-Low Complexity and High Efficiency Approach for Lossless Alpha Channel Coding
795 -- 807Cheng Zhan, Han Hu 0003, Zhi Wang 0001, Rongfei Fan, Dusit Niyato. Unmanned Aircraft System Aided Adaptive Video Streaming: A Joint Optimization Approach
808 -- 818Lingxiang Wu, Min Xu 0001, Jinqiao Wang, Stuart W. Perry. Recall What You See Continually Using GridLSTM in Image Captioning
819 -- 829Sebastian Agethen, Winston H. Hsu. Deep Multi-Kernel Convolutional LSTM Networks and an Attention-Based Mechanism for Videos
830 -- 0Chenggang Yan, Yunbin Tu, Xingzheng Wang, Yongbing Zhang, Xinhong Hao, Yongdong Zhang, Qionghai Dai. Corrections to "STAT: Spatial-Temporal Attention Mechanism for Video Captioning"

Volume 22, Issue 2

285 -- 297Bin Xiao 0002, Ge Ou, Han Tang, Xiu-Li Bi, Weisheng Li 0001. Multi-Focus Image Fusion by Hessian Matrix Based Decomposition
298 -- 310Bo-kyeong Kim, Geon-min Kim, Soo-Young Lee. Style-Controlled Synthesis of Clothing Segments for Fashion Image Manipulation
311 -- 323Ke Gu 0001, Zhifang Xia, Junfei Qiao, Weisi Lin. Deep Dual-Channel Neural Network for Image-Based Smoke Detection
324 -- 336Guangxiao Ma, Chenglizhao Chen, Shuai Li 0001, Chong Peng, Aimin Hao, Hong Qin. Salient Object Detection via Multiple Instance Joint Re-Learning
337 -- 348Haijun Liu, ShiGuang Wang, Wen Wang, Jian Cheng 0003. Multi-Scale Based Context-Aware Net for Action Detection
349 -- 364Congxuan Zhang, Liyue Ge, Zhen Chen 0004, Ming Li, Wen Liu, Hao Chen. 1 Optical Flow Estimation Using Joint Filtering
365 -- 379Youtian Du, Xue Wang, Yunbo Cui, Hang Wang, Chang Su. Kernel-Based Mixture Mapping for Image and Text Association
380 -- 393Shifeng Zhang, Yiliang Xie, Jun Wan 0001, Hansheng Xia, Stan Z. Li, Guodong Guo. WiderPerson: A Diverse Dataset for Dense Pedestrian Detection in the Wild
394 -- 406Ke Xu 0003, Tanfeng Sun, Xinghao Jiang. Video Anomaly Detection and Localization Based on an Adaptive Intra-Frame Classification Network
407 -- 420Ming Cheung, James She. Detecting Social Signals in User-Shared Images for Connection Discovery Using Deep Learning
421 -- 431Yeqiang Qian, Ming Yang 0002, Xu Zhao, Chunxiang Wang, Bing Wang 0006. Oriented Spatial Transformer Network for Pedestrian Detection Using Fish-Eye Camera
432 -- 444Lixing Chen, Linqi Song, Jacob Chakareski, Jie Xu 0001. Collaborative Content Placement Among Wireless Edge Caching Stations With Time-to-Live Cache
445 -- 458Zeyu Xu, Yang Cao 0002, Wei Wang 0050, Tao Jiang 0002, Qian Zhang 0001. Incentive Mechanism for Cooperative Scalable Video Coding (SVC) Multicast Based on Contract Theory
459 -- 473Hao Chen 0036, Xu Zhang 0006, Yiling Xu, Zhan Ma, Wenjun Zhang 0001. Efficient Mobile Video Streaming via Context-Aware RaptorQ-Based Unequal Error Protection
474 -- 486Kefan Xiao, Shiwen Mao, Jitendra K. Tugnait. Robust QoE-Driven DASH Over OFDMA Networks
487 -- 501Cheng Shi, Chi-Man Pun. Multiscale Superpixel-Based Hyperspectral Image Classification Using Recurrent Neural Networks With Stacked Autoencoders
502 -- 514Pau Rodríguez, Diego Velazquez Dorta, Guillem Cucurull, Josep M. Gonfaus, F. Xavier Roca, Jordi Gonzàlez 0001. Pay Attention to the Activations: A Modular Attention Mechanism for Fine-Grained Image Recognition
515 -- 523Wei Zhang 0066, Xuanyu He, Weizhi Lu. Exploring Discriminative Representations for Image Emotion Recognition With CNNs
524 -- 539Lihua Lu, Yao Lu 0001, Ruizhe Yu, Huijun Di, Lin Zhang, Shunzhou Wang. GAIM: Graph Attention Interaction Model for Collective Activity Recognition
540 -- 553Zheng Zhang, Qin Zou 0001, Yuewei Lin, Long Chen 0005, Song Wang 0002. Improved Deep Hashing With Soft Pairwise Similarity for Multi-Label Image Retrieval
554 -- 565Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli. Video Storytelling: Textual Summaries for Events

Volume 22, Issue 12

3025 -- 3038Shijie Hao, Xu Han, Yanrong Guo, Xin Xu 0007, Meng Wang 0001. Low-Light Image Enhancement With Semi-Decoupled Decomposition
3039 -- 3050Heqian Qiu, Hongliang Li, Qingbo Wu 0001, Fanman Meng, Linfeng Xu, King Ngi Ngan, Hengcan Shi. Hierarchical Context Features Embedding for Object Detection
3051 -- 3063Hongkai Yu, Kang Zheng, Jianwu Fang, Hao Guo 0002, Song Wang 0002. A New Method and Benchmark for Detecting Co-Saliency Within a Single Image
3064 -- 3074Zelong Zeng, Zhixiang Wang, Zheng Wang 0007, Yinqiang Zheng, Yung-Yu Chuang, Shin'ichi Satoh. Illumination-Adaptive Person Re-Identification
3075 -- 3087Ruifan Li, Ning Wang, Fangxiang Feng, Guangwei Zhang, Xiaojie Wang. Exploring Global and Local Linguistic Representations for Text-to-Image Synthesis
3088 -- 3100Junyu Gao, Changsheng Xu. CI-GNN: Building a Category-Instance Graph for Zero-Shot Video Classification
3101 -- 3114Xinhong Ma, Tianzhu Zhang, Changsheng Xu. Multi-Level Correlation Adversarial Hashing for Cross-Modal Retrieval
3115 -- 3127T. J. Tsai 0001, Daniel Yang, Mengyi Shan, Thitaree Tanprasert, Teerapat Jenrungrot. Using Cell Phone Pictures of Sheet Music To Retrieve MIDI Passages
3128 -- 3138Weiqing Min, Shuhuan Mei, Zhuo Li, Shuqiang Jiang. A Two-Stage Triplet Network Training Framework for Image Retrieval
3139 -- 3152Omar Eltobgy, Omar Arafa, Mohamed Hefeeda. Mobile Streaming of Live 360-Degree Videos
3153 -- 3165Hantang Liu, Yinghao Xu, Jialiang Zhang, Jianke Zhu, Yang Li 0041, Steven C. H. Hoi. DeepFacade: A Deep Learning Approach to Facade Parsing With Symmetric Loss
3166 -- 3179Yixiong Zou, Yemin Shi 0001, Daochen Shi, Yaowei Wang, Yongsheng Liang, Yonghong Tian 0001. Adaptation-Oriented Feature Projection for One-Shot Action Recognition
3180 -- 3195Cairong Zhao, Xinbi Lv, Zhang Zhang 0001, Wangmeng Zuo, Jun Wu 0006, Duoqian Miao. Deep Fusion Feature Representation Learning With Hard Mining Center-Triplet Loss for Person Re-Identification
3196 -- 3209Jing Yu, Weifeng Zhang, Yuhang Lu, Zengchang Qin, Yue Hu 0002, Jianlong Tan, Qi Wu 0001. Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval
3210 -- 3223Huaizheng Zhang, Linsen Dong, Guanyu Gao, Han Hu 0003, Yonggang Wen 0001, Kyle Guan. DeepQoE: A Multimodal Learning Framework for Video Quality of Experience (QoE) Prediction
3224 -- 3235Linwei Ye, Zhi Liu 0003, Yang Wang 0003. Dual Convolutional LSTM Network for Referring Image Segmentation
3236 -- 3248Yubao Sun, Ying Yang, Qingshan Liu 0001, Jiwei Chen, Xiao-Tong Yuan, Guodong Guo. Learning Non-Locally Regularized Compressed Sensing Network With Half-Quadratic Splitting

Volume 22, Issue 11

2749 -- 2763Weiyao Lin, Xiaoyi He, Xintong Han, Dong Liu 0002, John See, Junni Zou, Hongkai Xiong, Feng Wu 0001. Partition-Aware Adaptive Switching Neural Networks for Post-Processing in HEVC
2764 -- 2779Heming Sun, Zhengxue Cheng, Masaru Takeuchi, Jiro Katto. Enhanced Intra Prediction for Video Coding by Using Multiple Neural Networks
2780 -- 2791Arnaud Delmotte, Kenichiro Tanaka, Hiroyuki Kubo, Takuya Funatomi, Yasuhiro Mukaigawa. Blind Watermarking for 3-D Printed Objects by Locally Modifying Layer Thickness
2792 -- 2807Yan Yan 0001, Ying Huang, Si Chen 0002, Chunhua Shen, Hanzi Wang. Joint Deep Learning of Facial Expression Synthesis and Recognition
2808 -- 2819Wei Wang 0108, Xavier Alameda-Pineda, Dan Xu 0002, Elisa Ricci 0001, Nicu Sebe. Learning How to Smile: Expression Video Generation With Conditional Adversarial Recurrent Nets
2820 -- 2832Bo Huang 0012, Tingfa Xu, Shenwang Jiang, Yiwen Chen, Yu Bai 0009. Robust Visual Tracking via Constrained Multi-Kernel Correlation Filters
2833 -- 2843Jingna Sun, Wenming Yang, Jing-Hao Xue, Qingmin Liao. An Equalized Margin Loss for Face Recognition
2844 -- 2857Yongshan Zhang, Jia Wu 0001, Zhihua Cai, Philip S. Yu. Multi-View Multi-Label Learning With Sparse Feature Selection for Image Annotation
2858 -- 2872Tu Bui, Daniel Cooper, John P. Collomosse, Mark Bell, Alex Green 0002, John Sheridan, Jez Higgins, Arindra Das, Jared Robert Keller, Olivier Thereaux. Tamper-Proofing Video With Hierarchical Attention Autoencoder Hashing on Blockchain
2873 -- 2888Dongmei Mo, Zhihui Lai, Xizhao Wang, Wai-Keung Wong. Jointly Sparse Locality Regression for Image Feature Extraction
2889 -- 2904Xin Yuan 0002, Raziel Haimi-Cohen. Image Compression Based on Compressive Sensing: End-to-End Comparison With JPEG
2905 -- 2913Hao Luo 0004, Wei Jiang 0009, Xing Fan, Chi Zhang. STNReID: Deep Convolutional Networks With Pairwise Spatial Transformer Networks for Partial Person Re-Identification
2914 -- 2925Guihua Wen, Tian-Yuan Chang, Huihui Li, Lijun Jiang. Dynamic Objectives Learning for Facial Expression Recognition
2926 -- 2937Tong Zhang 0021, Wenming Zheng, Zhen Cui 0001, Yuan Zong, Chaolong Li, Xiaoyan Zhou, Jian Yang 0003. Deep Manifold-to-Manifold Transforming Network for Skeleton-Based Action Recognition
2938 -- 2949Guangming Sun, Bufan Shi, Xiaodong Chen, Andrey S. Krylov, Yong Ding 0003. Learning Local Quality-Aware Structures of Salient Regions for Stereoscopic Images via Deep Neural Networks
2950 -- 2962Yongzhe Xu, Jiangchuan Hu, Kanoksak Wattanachote, Kun Zeng, Yongyi Gong. Sketch-Based Shape Retrieval via Best View Selection and a Cross-Domain Similarity Measure
2963 -- 2976Tongtong Feng, Haifeng Sun, Qi Qi 0001, Jingyu Wang 0001, Jianxin Liao. Vabis: Video Adaptation Bitrate System for Time-Critical Live Streaming
2977 -- 2989Kaijun Zhu, Ruxin Wang, Qingsong Zhao, Jun Cheng 0002, Dapeng Tao. A Cuboid CNN Model With an Attention Mechanism for Skeleton-Based Action Recognition
2990 -- 3001Jun Li 0072, Xianglong Liu, Wenxuan Zhang, Mingyuan Zhang, Jingkuan Song, Nicu Sebe. Spatio-Temporal Attention Networks for Action Recognition and Detection
3002 -- 3013Yihang Lou, Ling-Yu Duan, Yong Luo 0002, Ziqian Chen, Tongliang Liu, Shiqi Wang 0001, Wen Gao 0001. Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm
3014 -- 3024Chenggang Yan, Biyao Shao, Hao Zhao, Ruixin Ning, Yongdong Zhang, Feng Xu 0005. 3D Room Layout Estimation From a Single RGB Image

Volume 22, Issue 10

2481 -- 2496Danilo Avola, Marco Cascio, Luigi Cinque, Gian Luca Foresti, Cristiano Massaroni, Emanuele Rodolà. 2-D Skeleton-Based Action Recognition via Two-Branch Stacked LSTM-RNNs
2497 -- 2510Jiaying Liu 0001, Sifeng Xia, Wenhan Yang. Deep Reference Generation With Multi-Domain Hierarchical Constraints for Inter Prediction
2511 -- 2525Fei Peng, Li-Ping Yin, Le-Bing Zhang, Min Long. CGR-GAN: CG Facial Image Regeneration for Antiforensics Based on Generative Adversarial Network
2526 -- 2536Li Li, Weiming Zhang, Kejiang Chen, Nenghai Yu. Steganographic Security Analysis From Side Channel Steganalysis and Its Complementary Attacks
2537 -- 2550Faming Fang, Tingting Wang, Yang Wang, Tieyong Zeng, Guixu Zhang. Variational Single Image Dehazing for Enhanced Visualization
2551 -- 2563Runpeng Cui, Zhong Cao, Weishen Pan, Changshui Zhang, Jianqiang Wang 0003. Deep Gesture Video Generation With Learning on Regions of Interest
2564 -- 2578Qiqin Dai, Henry Chopp, Emeline Pouyet, Oliver Cossairt, Marc Walton, Aggelos K. Katsaggelos. Adaptive Image Sampling Using Deep Learning and Its Application on X-Ray Fluorescence Image Reconstruction
2579 -- 2596Pingping Tang, Yuning Dong, Jiong Jin, Shiwen Mao. Fine-Grained Classification of Internet Video Traffic From QoS Perspective Using Fractal Spectrum
2597 -- 2609Hao Luo 0004, Wei Jiang 0009, Youzhi Gu, Fuxu Liu, Xingyu Liao, Shenqi Lai, Jianyang Gu. A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification
2610 -- 2622Shiwei Zhang, Lin Song, Changxin Gao, Nong Sang. GLNet: Global Local Network for Weakly Supervised Action Localization
2623 -- 2634Qi Kuang, Xin Jin, Qinping Zhao, Bin Zhou. Deep Multimodality Learning for UAV Video Aesthetic Quality Assessment
2635 -- 2644Jiachen Yang, Yang Zhao, Bin Jiang 0003, Wen Lu, Xinbo Gao. No-Reference Quality Evaluation of Stereoscopic Video Based on Spatio-Temporal Texture
2645 -- 2658Florian Thalmann, Geraint A. Wiggins, Mark B. Sandler. Representing Modifiable and Reusable Musical Content on the Web With Constrained Multi-Hierarchical Structures
2659 -- 2671Weiqing Min, Shuqiang Jiang, Ramesh C. Jain. Food Recommendation: Framework, Existing Solutions, and Challenges
2672 -- 2683Tongyu Dai, Xinggong Zhang, Yihang Zhang, Zongming Guo. Statistical Learning Based Congestion Control for Real-Time Video Communication
2684 -- 2697Peilun Zhou, Tong Xu, Zhizhuo Yin, Dong Liu 0002, Enhong Chen, Guangyi Lv, Changliang Li. Character-Oriented Video Summarization With Visual and Textual Cues
2698 -- 2710Yamin Han, Peng Zhang 0005, Tao Zhuo, Wei Huang 0013, Yufei Zha, Yanning Zhang. Ensemble Tracking Based on Diverse Collaborative Framework With Multi-Cue Dynamic Fusion
2711 -- 2722Li Yuan, Francis Eng Hock Tay, Ping Li 0006, Jiashi Feng. Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks
2723 -- 2733Peihao Chen, Chuang Gan, Guangyao Shen, Wenbing Huang, Runhao Zeng, Mingkui Tan. Relation Attention for Temporal Action Localization
2734 -- 2747Kui Jiang, Zhongyuan Wang, Peng Yi, Guangcheng Wang, Ke Gu 0001, Junjun Jiang. ATMFN: Adaptive-Threshold-Based Multi-Model Fusion Network for Compressed Face Hallucination

Volume 22, Issue 1

1 -- 0Wenwu Zhu 0001. Message From the Outgoing Editor-in-Chief
2 -- 0J. Luo. Editorial
3 -- 14S. Chandrakala, S. L. Jayalakshmi. Generative Model Driven Representation Learning in a Hybrid Framework for Environmental Audio Scene and Sound Event Recognition
15 -- 29Kuo-Wei Chen, Ying-Sheng Luo, Yu-chi Lai, Yan-Lin Chen, Chih-Yuan Yao, Hung-Kuo Chu, Tong-Yee Lee. Image Vectorization With Real-Time Thin-Plate Spline
30 -- 44Joongchol Shin, Minseo Kim, Joonki Paik, SangKeun Lee. 0-Norm for Single Image Dehazing
45 -- 58Linwei Zhu, Sam Kwong, Yun Zhang 0002, Shiqi Wang, Xu Wang 0006. Generative Adversarial Network-Based Intra Prediction for Video Coding
59 -- 68Wei Xiao, Xiaolin Huang, Fan He, Jorge Silva, Saba Emrani, Arin Chaudhuri. Online Robust Principal Component Analysis With Change Point Detection
69 -- 81Javier Cubelos, Pablo Carballeira, Jesús Gutiérrez, Narciso García. QoE Analysis of Dense Multiview Video With Head-Mounted Devices
82 -- 95Lixiang Li, Guoqian Wen, Zeming Wang, Yixian Yang. Efficient and Secure Image Communication System Based on Compressed Sensing for IoT Monitoring Applications
96 -- 107Yufei Zha, Tao Ku, Yunqiang Li, Peng Zhang. Deep Position-Sensitive Tracking
108 -- 121Sarala Ghimire, Jae Young Choi, Bumshik Lee. Using Blockchain for Improved Video Integrity Verification
122 -- 137Sijie Mai, Songlong Xing, Haifeng Hu 0001. Locally Confined Modality Fusion Network With a Global Perspective for Multimodal Human Affective Computing
138 -- 147Laura Cabrera Quiros, David M. J. Tax, Hayley Hung. Gestures In-The-Wild: Detecting Conversational Hand Gestures in Crowded Scenes Using a Multimodal Fusion of Bags of Video Trajectories and Body Worn Acceleration
148 -- 159Guoyun Tu, Yanwei Fu, Boyang Li, Jiarui Gao, Yu-Gang Jiang, Xiangyang Xue. A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization
160 -- 173Zhengzheng Tu, Tian Xia, Chenglong Li 0002, Xiaoxiao Wang, Yan Ma, Jin Tang. RGB-T Image Saliency Detection via Collaborative Graph Learning
174 -- 187Jian Zhang 0032, Yuxin Peng. Multi-Pathway Generative Adversarial Hashing for Unsupervised Cross-Modal Retrieval
188 -- 200Yanbin Hao, Chong-Wah Ngo, Benoit Huet. Neighbourhood Structure Preserving Cross-Modal Embedding for Video Hyperlinking
201 -- 214Xin-Lin Huang, Xiaowei Tang, Fei Hu 0001. Dynamic Spectrum Access for Multimedia Transmission Over Multi-User, Multi-Channel Cognitive Radio Networks
215 -- 228Jiale Bai, Zefan Li, Bingbing Ni, Minsi Wang, Xiaokang Yang, Chuanping Hu, Wen Gao 0001. Loopy Residual Hashing: Filling the Quantization Gap for Image Retrieval
229 -- 241Chenggang Yan, Yunbin Tu, Xingzheng Wang, Yongbing Zhang, Xinhong Hao, Yongdong Zhang, Qionghai Dai. STAT: Spatial-Temporal Attention Mechanism for Video Captioning
242 -- 255Shafin Rahman, Salman H. Khan 0001, Nick Barnes. Deep0Tag: Deep Multiple Instance Learning for Zero-Shot Image Tagging
256 -- 270Songtao Wu, Sheng-hua Zhong, Yan Liu. A Novel Convolutional Neural Network for Image Steganalysis With Shared Normalization
271 -- 283Silvia Cascianelli, Gabriele Costante, Alessandro Devo, Thomas A. Ciarfuglia, Paolo Valigi, Mario Luca Fravolini. The Role of the Input in Natural Language Video Description