Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 34, Issue 9

7803 -- 7819Kai Niu 0002, Yanyi Liu, Yuzhou Long, Yan Huang 0008, Liang Wang 0001, Yanning Zhang. An Overview of Text-Based Person Search: Recent Advances and Future Directions
7820 -- 7829Guoyu Yang, Jie Lei 0002, Hao Tian, Zunlei Feng, Ronghua Liang. Asymptotic Feature Pyramid Network for Labeling Pixels and Regions
7830 -- 7843Ke Song, Guoqiang Liang, Zhaojie Chen, Yanning Zhang. Non-Exemplar Class-Incremental Learning by Random Auxiliary Classes Augmentation and Mixed Features
7844 -- 7855Wujie Zhou, Bitao Jian, Meixin Fang, Xiena Dong, Yuanyuan Liu 0004, Qiuping Jiang. DGPINet-KD: Deep Guided and Progressive Integration Network With Knowledge Distillation for RGB-D Indoor Scene Analysis
7856 -- 7869Qiming Li, Jinghang Cheng, Yin Gao, Jun Li 0043. Learning Geometric Information via Transformer Network for Key-Points Based Motion Segmentation
7870 -- 7881Yandong Bi, Huajie Jiang, Yongli Hu, Yanfeng Sun, Baocai Yin. Fair Attention Network for Robust Visual Question Answering
7882 -- 7895Jian Sun, Hao Sun, Lin Lei, Kefeng Ji, Gangyao Kuang. TirSA: A Three Stage Approach for UAV-Satellite Cross-View Geo-Localization Based on Self-Supervised Feature Enhancement
7896 -- 7911Yu Xue, Lai-Man Po, Wing Yin Yu, Haoxuan Wu, Xuyuan Xu, Kun Li, Yuyang Liu. Self-Calibration Flow Guided Denoising Diffusion Model for Human Pose Transfer
7912 -- 7921Yi Zhang, Xiaotian Zhu. Attention-Based Layer Fusion and Token Masking for Weakly Supervised Semantic Segmentation
7922 -- 7934Dongyue Li, Songlin Du. ContextMatcher: Detector-Free Feature Matching With Cross-Modality Context
7935 -- 7946Daosong Hu, Kai Huang 0001. Semi-Supervised Multitask Learning Using Gaze Focus for Gaze Estimation
7947 -- 7961Hao Feng, Keyi Zhou, Wengang Zhou, Yufei Yin, Jiajun Deng, Qi Sun, Houqiang Li. Recurrent Generic Contour-Based Instance Segmentation With Progressive Learning
7962 -- 7974Yisheng Zhao, Huaiyu Zhu 0004, Ruohong Huan, Yaoqi Bao, Yun Pan. Heterogeneous Graph Network for Action Detection
7975 -- 7985Sungjune Park, Hyunjun Kim, Yong Man Ro. Integrating Language-Derived Appearance Elements With Visual Cues in Pedestrian Detection
7986 -- 7997Enhao Zhang, Chuanxing Geng, Chaohua Li, Songcan Chen. Dynamic Learnable Logit Adjustment for Long-Tailed Visual Recognition
7998 -- 8012Fupeng Chu, Yang Cong, Ronghan Chen. OPEN: Occlusion-Invariant Perception Network for Single Image-Based 3D Shape Retrieval
8013 -- 8025Xiao He, Mingrui Zhu, Nannan Wang 0001, Xinbo Gao 0001. Few-Shot Font Generation by Learning Style Difference and Similarity
8026 -- 8040Tianhuan Huang, Xianye Ben, Chen Gong 0002, Wenzheng Xu, Qiang Wu 0001, Hongchao Zhou. GaitDAN: Cross-View Gait Recognition via Adversarial Domain Adaptation
8041 -- 8052Yinan Wu 0001, Licheng Jiao, Xu Liu 0006, Fang Liu 0001, Shuyuan Yang, Lingling Li 0002. Domain Adaptation-Aware Transformer for Hyperspectral Object Tracking
8053 -- 8066Shiyao Li, Zhenhua Zhu, Hanbo Sun, Xuefei Ning, Guohao Dai, Yiming Hu, Huazhong Yang, Yu Wang 0002. Toward High-Accuracy and Real-Time Two-Stage Small Object Detection on FPGA
8067 -- 8079Fan Wan, Xingyu Miao, Haoran Duan, Jingjing Deng 0001, Rui Gao, Yang Long 0001. Sentinel-Guided Zero-Shot Learning: A Collaborative Paradigm Without Real Data Exposure
8080 -- 8092Chenhao Wu, Qingbo Wu 0001, Rui Ma, King Ngi Ngan, Hongliang Li 0001, Fanman Meng, Heqian Qiu. Continual Cross-Domain Image Compression via Entropy Prior Guided Knowledge Distillation and Scalable Decoding
8093 -- 8106Shuai Guo 0002, Qiuwen Wang, Yijie Gao, Rong Xie, Lin Li 0062, Fang Zhu, Li Song 0001. Depth-Guided Robust Point Cloud Fusion NeRF for Sparse Input Views
8107 -- 8121Yufan Wang, Le Huang, Qunfei Zhao, Zeyang Xia, Ning Zhao. Hybrid Shape Deformation for Face Reconstruction in Aesthetic Orthodontics
8122 -- 8134Shaojie Zhang, Jianqin Yin, Yonghao Dang, Jiajun Fu. SiT-MLP: A Simple MLP With Point-Wise Topology Feature Learning for Skeleton-Based Action Recognition
8135 -- 8147Jiahe Zhu, Jinji Zheng, Xinyi Xia, Yifan Li, Zhiru Li, Xicai Li. IGM-MELv2: Infrared Guiding Modal Multiuser Eye Localization System on ARM CPU for Autostereoscopic Displays
8148 -- 8160Anlei Zhu, YingHui Wang, Jinlong Yang 0002, Tao Yan, Haomiao Ma, Wei Li 0121. YOWOv3: A Lightweight Spatio-Temporal Joint Network for Video Action Detection
8161 -- 8171Chang Liu 0071, Jie Zhao 0014, Chunjuan Bo, Shengming Li, Dong Wang 0004, Huchuan Lu. LGTrack: Exploiting Local and Global Properties for Robust Visual Tracking
8172 -- 8187Zhaoqilin Yang, GaoYun An, ZhenXing Zheng, Shan Cao, Qiuqi Ruan. GBC: Guided Alignment and Adaptive Boosting CLIP Bridging Vision and Language for Robust Action Recognition
8188 -- 8200Jie Wu, Leyuan Fang, Jun Yue. TAKD: Target-Aware Knowledge Distillation for Remote Sensing Scene Classification
8201 -- 8214Wenlve Zhou, Zhiheng Zhou. Unsupervised Domain Adaption Harnessing Vision-Language Pre-Training
8215 -- 8229Wenmin Huang, Weiqi Luo 0001, Xiaochun Cao, Jiwu Huang. Interactive Generative Adversarial Networks With High-Frequency Compensation for Facial Attribute Editing
8230 -- 8241Yifei Qian, Xiaopeng Hong, Zhongliang Guo 0001, Ognjen Arandjelovic, Carl R. Donovan. Semi-Supervised Crowd Counting With Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes
8242 -- 8252Zhimin Wei, Zhipeng Zhang, Peng Wu, Ji Wang, Peng Wang 0015, Yanning Zhang. Fine-Granularity Alignment for Text-Based Person Retrieval Via Semantics-Centric Visual Division
8253 -- 8265Haoyuan Jin, Xuesong Nie, Yunfeng Yan, Xi Chen, Zhihang Zhu, Donglian Qi. AHOR: Online Multi-Object Tracking With Authenticity Hierarchizing and Occlusion Recovery
8266 -- 8280Zhilin Zhang, Chengxiu Liu, Xiaoxu Wang, Ziyu Han, Guantai Yang, Cheng Wang, Panfeng Huang, Qianbo Lu. DLP-Fusion: Depth of Field, Light Source, and Polarization Fusion Toward Intelligent Optical Imaging for Complex Scenes
8281 -- 8291Ye Huang, Di Kang, Shenghua Gao, Wen Li 0001, Lixin Duan. High-Level Feature Guided Decoding for Semantic Segmentation
8292 -- 8309Tung Minh Tran, Doanh C. Bui, Tam V. Nguyen 0002, Khang Nguyen 0001. Transformer-Based Spatio-Temporal Unsupervised Traffic Anomaly Detection in Aerial Videos
8310 -- 8326Xiaoqiang Zhu, Jiayu Zhou, Lihua You, Xiaosong Yang, Jian Chang, Jian-Jun Zhang 0001, Dan Zeng 0001. DFIE3D: 3D-Aware Disentangled Face Inversion and Editing via Facial-Contrastive Learning
8327 -- 8342Zidong Liu, Jiasong Wu, Zeyu Shen, Xin Chen, Qianyu Wu, Zhiguo Gui, Lotfi Senhadji, Huazhong Shu. Improving End-to-End Sign Language Translation With Adaptive Video Representation Enhanced Transformer
8343 -- 8354Yongzhe Yuan, Yue Wu 0004, Mingyu Yue, Maoguo Gong, Xiaolong Fan, Wenping Ma 0001, Qiguang Miao. Learning Discriminative Features via Multi-Hierarchical Mutual Information for Unsupervised Point Cloud Registration
8355 -- 8367ATing Yin, Yaonan Wang 0001, Jianxu Mao, Hui Zhang 0023, Xiuyi Chen. Category-Contextual Relation Encoding Network for Few-Shot Object Detection
8368 -- 8381Wen Wen, Mu Li 0005, Yiru Yao, Xiangjie Sui, Yabin Zhang 0002, Long Lan, Yuming Fang, Kede Ma. Perceptual Quality Assessment of Virtual Reality Videos in the Wild
8382 -- 8397Guangning Xu, Michael K. Ng 0001, Yunming Ye, Xutao Li, Ge Song, Bowen Zhang 0005, Zhichao Huang. TLS-MWP: A Tensor-Based Long- and Short-Range Convolution for Multiple Weather Prediction
8398 -- 8411Hao Liu, Lijun He, Miao Zhang, Fan Li 0003. VADiffusion: Compressed Domain Information Guided Conditional Diffusion for Video Anomaly Detection
8412 -- 8426Guanyi Li, Junjie Zhang 0002, Enquan Yang, Haoran Jiang, Dan Zeng 0001. Multi-Level Information Fusion Network With Edge Information Injection for Single-Band Cloud Detection
8427 -- 8441Yiwen Shan, Dong Hu, Zhi Wang 0015. A Novel Truncated Norm Regularization Method for Multi-Channel Color Image Denoising
8442 -- 8455Tong Qiao, Hang Shao, Shichuang Xie, Ran Shi. Unsupervised Generative Fake Image Detector
8456 -- 8468Sicheng Pan, Yingming Li. EBDNet: Integrating Optical Flow With Kernel Prediction for Burst Denoising
8469 -- 8480Yong Wang, Pengbo Zhou, Guohua Geng, Li An, Kang Li, Ruoxue Li. Neighborhood Multi-Compound Transformer for Point Cloud Registration
8481 -- 8493Yuting Yang 0008, Licheng Jiao, Xu Liu 0006, Lingling Li 0002, Fang Liu 0001, Shuyuan Yang, Xiangrong Zhang. Efficient LWPooling: Rethinking the Wavelet Pooling for Scene Parsing
8494 -- 8508Xiaoxu Chen, Jingfan Tan, Tao Wang 0052, Kaihao Zhang, Wenhan Luo, Xiaochun Cao. Toward Real-World Blind Face Restoration With Generative Diffusion Prior
8509 -- 8521Yu Wang 0073, Liquan Chen, Kunliang Yu, Tong Fu. A Secure Spatio-Temporal Chaotic Pseudorandom Generator for Image Encryption
8522 -- 8535Xiao Wang, Yang Lu 0009, Wanchuan Yu, Yanwei Pang, Hanzi Wang. Few-Shot Action Recognition via Multi-View Representation Learning
8536 -- 8546Zhidan Ran, Xuan Wei, Wei Liu, Xiaobo Lu. Multiscale Aligned Spatial-Temporal Interaction for Video-Based Person Re-Identification
8547 -- 8561Lan Li 0005, Meiping Song, Qiang Zhang 0011, Yushuai Dong, Yulei Wang 0002, Qiangqiang Yuan. Local Extremum Constrained Total Variation Model for Natural and Hyperspectral Image Non-Blind Deblurring
8562 -- 8575Yuzhen Niu, Rui Xu, Zhihua Lin, Wenxi Liu. STD-Net: Spatio-Temporal Decomposition Network for Video Demoiréing With Sparse Transformers
8576 -- 8588Eunpil Park, Jaejun Yoo, Jae-Young Sim. Universal Dehazing via Haze Style Transfer
8589 -- 8601Liangliang Song, Zhixi Feng, Shuyuan Yang, Xinyu Zhang, Licheng Jiao. Interactive Spectral-Spatial Transformer for Hyperspectral Image Classification
8602 -- 8613Meiqi Wu, Kaiqi Huang, Yuanqiang Cai, Shiyu Hu, YuZhong Zhao, Weiqiang Wang. Finger in Camera Speaks Everything: Unconstrained Air-Writing for Real-World
8614 -- 8628Haoran Wei, Qingbo Wu 0001, Chenhao Wu, King Ngi Ngan, Hongliang Li 0001, Fanman Meng, Heqian Qiu. Robust Unpaired Image Dehazing via Adversarial Deformation Constraint
8629 -- 8643Yurong Chen 0003, Yaonan Wang 0001, Hui Zhang 0023. Prior Images Guided Generative Autoencoder Model for Dual-Camera Compressive Spectral Imaging
8644 -- 8656Kaijie He, Jun Xie 0003, Xinguang Dai, Kenglun Chang, Feng Chen 0044, Zhepeng Wang 0002. STADet: Streaming Timing-Aware Video Lane Detection
8657 -- 8671Sanaz Nami, Farhad Pakdaman, Mahmoud Reza Hashemi, Shervin Shirmohammadi, Moncef Gabbouj. Lightweight Multitask Learning for Robust JND Prediction Using Latent Space and Reconstructed Frames
8672 -- 8683Wenyan Pan, Wentao Ma, Shan Zhao 0002, Lichuan Gu, Guolong Shi, Zhihua Xia, Meng Wang 0001. Image Manipulation Detection With Cascade Hierarchical Graph Representation
8684 -- 8697Rongqin Liang, Yuanman Li, Jiantao Zhou 0001, Xia Li 0006. Text-Driven Traffic Anomaly Detection With Temporal High-Frequency Modeling in Driving Videos
8698 -- 8709Aihua Zheng, Fan Yuan, Haichuan Zhang, Jiaxiang Wang 0001, Chao Tang, Chenglong Li 0002. Public-Private Attributes-Based Variational Adversarial Network for Audio-Visual Cross-Modal Matching
8710 -- 8721Jiayuan Xie, Jiali Chen, Zhenghao Liu, Yi Cai 0001, Qingbao Huang, Qing Li 0001. Video Question Generation for Dynamic Changes
8722 -- 8735Yabin Zhu, Chenglong Li 0002, Xiao Wang 0014, Jin Tang 0001, Zhixiang Huang. RGBT Tracking via Progressive Fusion Transformer With Dynamically Guided Learning
8736 -- 8752Yihui Fan, Xin Jin 0002, Siyao Zhou 0003, Shun Zou. Light Fields Stitching for Windowed-6DoF VR Content
8753 -- 8765Yingxue Xu, Guihua Wen, Yang Hu, Pei Yang 0001. Modeling Hierarchical Structural Distance for Unsupervised Domain Adaptation
8766 -- 8778Huimin Ma, Siwei Wang 0001, Junpu Zhang, Shengju Yu, Suyuan Liu, Xinwang Liu 0002, Kunlun He. Symmetric Multi-View Subspace Clustering With Automatic Neighbor Discovery
8779 -- 8793Wei Huang, Zhiliang Peng, Li Dong 0004, Furu Wei, Qixiang Ye, Jianbin Jiao. Generic-to-Specific Distillation of Masked Autoencoders
8794 -- 8807Xinju Wu, Pingping Zhang, Meng Wang 0017, Peilin Chen, Shiqi Wang 0001, Sam Kwong. Geometric Prior Based Deep Human Point Cloud Geometry Compression
8808 -- 8820Xiaohan Fang, Peilin Chen, Meng Wang 0017, Xi Xie, Shiqi Wang 0001, Shanshe Wang, Siwei Ma. Exploiting Bidirectional Quality Impulse for Reference Picture Resampled Gaming Video Coding
8821 -- 8835Chen Zhu, Guo Lu, Huanbang Chen, Donghui Feng 0003, Shen Wang, Yan Zhao, Rong Xie, Li Song 0001. A Character Position-Aware Compression Framework for Screen Text Image
8836 -- 8847Hadi Amirpour, Klaus Schoeffmann, Mohammad Ghanbari 0001, Christian Timmerer. DeepVCA: Deep Video Complexity Analyzer
8848 -- 8861Chunhui Yang, Jiayu Yang, Yongqi Zhai, Ronggang Wang. FICNet: An End to End Network for Free-View Image Coding
8862 -- 8880Liying Gao, Bingliang Jiao, Yuzhou Long, Kai Niu 0002, He Huang, Peng Wang 0015, Yanning Zhang. Contrastive Pedestrian Attentive and Correlation Learning Network for Occluded Person Re-Identification
8881 -- 8895Kaixiang Chen, Pengfei Fang, Zi Ye, Liyan Zhang 0001. Multi-Scale Explicit Matching and Mutual Subject Teacher Learning for Generalizable Person Re-Identification
8896 -- 8911Ruomei Wang 0001, Jiawei Feng, Fuwei Zhang, Xiaonan Luo, Yuanmao Luo. Modality-Aware Heterogeneous Graph for Joint Video Moment Retrieval and Highlight Detection
8912 -- 8923Yuzhe Fu, Changchun Zhou, Tianling Huang, Eryi Han, Yifan He, Hailong Jiao. SoftAct: A High-Precision Softmax Architecture for Transformers Supporting Nonlinear Functions
8924 -- 8938Mingyue Niu, Ya Li, Jianhua Tao 0001, Xiuzhuang Zhou, Björn W. Schuller. DepressionMLP: A Multi-Layer Perceptron Architecture for Automatic Depression Level Prediction via Facial Keypoints and Action Units
8939 -- 8952Xianglong Wang, Eric Rigall, Xifeng An, Zhihao Li, Qing Cai, Shu Zhang 0002, Junyu Dong. A New Benchmark and Low Computational Cost Localization Method for Cephalometric Analysis
8953 -- 8965Zhaojie Chu, Kailing Guo, Xiaofen Xing, Yilin Lan, Bolun Cai, Xiangmin Xu. CorrTalk: Correlation Between Hierarchical Speech and Facial Activity Variances for 3D Animation
8966 -- 8971Qinghai Zheng. Flexible and Parameter-Free Graph Learning for Multi-View Spectral Clustering
8972 -- 8977Dengyong Zhang, Jiahao Chen, Xin Liao, Feng Li, Jiaxin Chen, Gaobo Yang. Face Forgery Detection via Multi-Feature Fusion and Local Enhancement
8978 -- 8982Binzhe Li, Bolin Chen, Zhao Wang 0004, Shiqi Wang 0001, Yan Ye. Semantic Face Compression for Metaverse: A Compact 3D Descriptor Based Approach