| 1 | -- | 4 | Tao Mei 0001, Jason J. Corso, Gunhee Kim, Jiebo Luo, Chunhua Shen, Hanwang Zhang. Guest Editorial Introduction to the Special Section on Video and Language |
| 5 | -- | 16 | Yuan Liu, Jingyuan Chen, Xinpeng Chen, Bing Deng, Jianqiang Huang, Xiansheng Hua 0001. Centerness-Aware Network for Temporal Action Proposal |
| 17 | -- | 30 | Linghui Li, Yongdong Zhang 0001, Sheng Tang, Lingxi Xie, Xiaoyong Li, Qi Tian 0001. Adaptive Spatial Location With Balanced Loss for Video Captioning |
| 31 | -- | 42 | Yi Zheng, Yuejie Zhang, Rui Feng, Tao Zhang 0022, Weiguo Fan. Stacked Multimodal Attention Network for Context-Aware Video Captioning |
| 43 | -- | 51 | Chenggang Yan, Yiming Hao, Liang Li 0003, Jian Yin 0003, Anan Liu, Zhendong Mao, Zhenyu Chen 0003, Xingyu Gao. Task-Adaptive Attention for Image Captioning |
| 52 | -- | 62 | Yi Bin, Yujuan Ding, Bo Peng, Liang Peng, Yang Yang 0002, Tat-Seng Chua. Entity Slot Filling for Visual Captioning |
| 63 | -- | 74 | Jipeng Zhang, Jie Shao, Rui Cao, Lianli Gao, Xing Xu 0001, Heng Tao Shen. Action-Centric Relation Transformer Network for Video Question Answering |
| 75 | -- | 91 | Lizhi Xiong, Xiao Han, Ching-Nung Yang, Yun Qing Shi 0001. Robust Reversible Watermarking in Encrypted Image With Secure Multi-Party Based on Lightweight Cryptography |
| 92 | -- | 104 | Yongyong Chen, Xiaolin Xiao, Chong Peng, Guangming Lu, Yicong Zhou. Low-Rank Tensor Graph Learning for Multi-View Subspace Clustering |
| 105 | -- | 119 | Jinyuan Liu, Xin Fan 0001, Ji-jiang, Risheng Liu, Zhongxuan Luo. Learning a Deep Multi-Scale Feature Ensemble and an Edge-Attention Guidance for Image Fusion |
| 120 | -- | 134 | Yu Meng, Zhan Ma. Viewport-Based Omnidirectional Video Quality Assessment: Database, Modeling and Inference |
| 135 | -- | 146 | Zichi Wang, Guorui Feng, Xinpeng Zhang 0001. Repeatable Data Hiding: Towards the Reusability of Digital Images |
| 147 | -- | 159 | Hamid Nodehi, Asadollah Shahbahrami. Multi-Metric Re-Identification for Online Multi-Person Tracking |
| 160 | -- | 171 | Hongchen Tan, Xiuping Liu, Yuhao Bian, Huasheng Wang, Baocai Yin. Incomplete Descriptor Mining With Elastic Loss for Person Re-Identification |
| 172 | -- | 182 | Youngeun Kim, Sungeun Hong. Adaptive Graph Adversarial Networks for Partial Domain Adaptation |
| 183 | -- | 197 | Wenjun Shi, Jingwei Xu, Dongchen Zhu, Guanghui Zhang, Xianshun Wang, Jiamao Li, Xiaolin Zhang. RGB-D Semantic Segmentation and Label-Oriented Voxelgrid Fusion for Accurate 3D Semantic Mapping |
| 198 | -- | 209 | Tianlang Chen, Chen Fang, Xiaohui Shen, Yiheng Zhu, Zhili Chen, Jiebo Luo. Anatomy-Aware 3D Human Pose Estimation With Bone-Based Pose Decomposition |
| 210 | -- | 223 | Ting Wang, Wing W. Y. Ng, Jinde Li, Qiuxia Wu, Shuai Zhang 0001, Chris D. Nugent, Colin Shewell. A Deep Clustering via Automatic Feature Embedded Learning for Human Activity Recognition |
| 224 | -- | 239 | Xinlin Wang, Shuiping Gou, Jichao Li, Yinghai Zhao, Zhen Liu, Changzhe Jiao, Shasha Mao. Self-Paced Feature Attention Fusion Network for Concealed Object Detection in Millimeter-Wave Image |
| 240 | -- | 252 | Cen Chen, Kenli Li 0001, Wei Wei 0006, Joey Tianyi Zhou, Zeng Zeng. Hierarchical Graph Neural Networks for Few-Shot Learning |
| 253 | -- | 261 | Wanli Xue, Weilun Xie, Yao Zhang, Shengyong Chen. Stable Linear Structures and Seam Measurements for Parallax Image Stitching |
| 262 | -- | 274 | Wenjie Yang, Houjing Huang, Xiaotang Chen, Kaiqi Huang. Bottom-Up Foreground-Aware Feature Fusion for Practical Person Search |
| 275 | -- | 285 | Hehe Fan, Tao Zhuo, Xin Yu 0002, Yi Yang, Mohan S. Kankanhalli. Understanding Atomic Hand-Object Interaction With Human Intention |
| 286 | -- | 301 | Biswarup Ganguly, Anwesa Bhattacharya, Ananya Srivastava, Debangshu Dey, Sugata Munshi. Single Image Haze Removal With Haze Map Optimization for Various Haze Concentrations |
| 302 | -- | 314 | Baoyu Chen, Yi Zhang, Hongchen Tan, Baocai Yin, Xiuping Liu. PMAN: Progressive Multi-Attention Network for Human Pose Transfer |
| 315 | -- | 329 | Tianshan Liu, Kin-Man Lam 0001, Rui Zhao, Guoping Qiu. Deep Cross-Modal Representation Learning and Distillation for Illumination-Invariant Pedestrian Detection |
| 330 | -- | 344 | Wencheng Zhu, Jiahao Li, Jiwen Lu, Jie Zhou 0001. Separable Structure Modeling for Semi-Supervised Video Object Segmentation |
| 345 | -- | 358 | Zhaoqing Pan, Weijie Yu, Jianjun Lei, Nam Ling, Sam Kwong. TSAN: Synthesized View Quality Enhancement via Two-Stream Attention Network for 3D-HEVC |
| 359 | -- | 373 | Kun Yang, Dong Liu 0002, Zhibo Chen 0001, Feng Wu 0001, Weiping Li. Spatiotemporal Generative Adversarial Network-Based Dynamic Texture Synthesis for Surveillance Video Coding |
| 374 | -- | 387 | Zijie Zhuang, Longhui Wei, Lingxi Xie, Haizhou Ai, Qi Tian 0001. Camera-Based Batch Normalization: An Effective Distribution Alignment Method for Person Re-Identification |
| 388 | -- | 397 | Jie Wu, Chunlei Wu, Jing Lu, Leiquan Wang, Xue-rong Cui. Region Reinforcement Network With Topic Constraint for Image-Text Matching |
| 398 | -- | 410 | Guilherme Paim, Hussam Amrouch, Eduardo Antônio César da Costa, Sergio Bampi, Jörg Henkel. Bridging the Gap Between Voltage Over-Scaling and Joint Hardware Accelerator-Algorithm Closed-Loop |
| 411 | -- | 422 | Yeongmin Lee, Hyeji Kim. A High-Throughput Depth Estimation Processor for Accurate Semiglobal Stereo Matching Using Pipelined Inter-Pixel Aggregation |
| 423 | -- | 436 | Chenqi Kong, Baoliang Chen, Wenhan Yang, Haoliang Li, Peilin Chen, Shiqi Wang 0001. Appearance Matters, So Does Audio: Revealing the Hidden Face via Cross-Modality Transfer |
| 437 | -- | 450 | Ning Xie 0007, Junjie Chen, Yicong Chen, Ji Hu, Qiqi Zhang, Changsheng Chen, Lei Huang 0001. Detection of Information Hiding at Anti-Copying 2D Barcodes |