Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 34, Issue 10

8984 -- 8996Keyang Cheng, Honggang Cui, Humaira abdul Ghafoor, Hao Wan, Qirong Mao, Yongzhao Zhan 0001. Tiny Object Detection via Regional Cross Self-Attention Network
8997 -- 9009Baojie Fan, Kexin Zhang, Jiandong Tian. HCPVF: Hierarchical Cascaded Point-Voxel Fusion for 3D Object Detection
9010 -- 9023Xingyue Zhao, Zhongyu Li 0002, Xiangde Luo, Peiqi Li, Peng Huang, Jianwei Zhu, Yang Liu, Jihua Zhu, Meng Yang, Shi Chang, Jun Dong. Ultrasound Nodule Segmentation Using Asymmetric Learning With Simple Clinical Annotation
9024 -- 9038Ye Huang, Di Kang, Liang Chen 0026, Wenjing Jia, Xiangjian He, Lixin Duan, Xuefei Zhe, Linchao Bao. CARD: Semantic Segmentation With Efficient Class-Aware Regularized Decoder
9039 -- 9052Lei Qi 0001, Ziang Liu 0013, Yinghuan Shi, Xin Geng 0001. Generalizable Metric Network for Cross-Domain Person Re-Identification
9053 -- 9063Guangtong Zhang, Bineng Zhong, Qihua Liang, Zhiyi Mo, Ning Li, Shuxiang Song. One-Stream Stepwise Decreasing for Vision-Language Tracking
9064 -- 9077Zixu Wang, Congxuan Zhang, Zhen Chen 0004, Weiming Hu, Ke Lu 0002, Liyue Ge, Zige Wang. ACR-Net: Learning High-Accuracy Optical Flow via Adaptive-Aware Correlation Recurrent Network
9078 -- 9089Lin Zhang, Bo Zhang 0069, Botian Shi, Jiayuan Fan 0001, Tao Chen 0003. Few-Shot Cross-Domain Object Detection With Instance-Level Prototype-Based Meta-Learning
9090 -- 9101Xingyu Chen, Jiaxu Liu, Zeyang Liu, Lipeng Wan 0003, Xuguang Lan, Nanning Zheng 0001. Knowledge Graph Enhancement for Fine-Grained Zero-Shot Learning on ImageNet21K
9102 -- 9111Xiantao Hu, Bineng Zhong, Qihua Liang, Shengping Zhang, Ning Li, Xianxian Li. Toward Modalities Correlation for RGB-T Tracking
9112 -- 9124Yuan-Ming Li, Ling-An Zeng, Jingke Meng, Wei-Shi Zheng 0001. Continual Action Assessment via Task-Consistent Score-Discriminative Feature Distribution Modeling
9125 -- 9138Xiao Lin, Minghao Zhu, Ronghao Dang, Guangliang Zhou, Shaolong Shu, Feng Lin 0001, Chengju Liu, Qijun Chen. CLIPose: Category-Level Object Pose Estimation With Pre-Trained Vision-Language Knowledge
9139 -- 9152Guolong Sun, Zhitong Xiong, Yuan Yuan 0001. Detail-Preserving and Diverse Image Translation for Adverse Visual Object Detection
9153 -- 9165Menghao Tan, Weifeng Gao, Hong Li 0007, Jin Xie 0003, Maoguo Gong. Universal Binary Neural Networks Design by Improved Differentiable Neural Architecture Search
9166 -- 9180Yuming Yan, Huimin Yu, Yubin Wang, Shuyi Song, Weihu Huang, Juncan Jin. Unified Stability and Plasticity for Lifelong Person Re-Identification in Cloth-Changing and Cloth-Consistent Scenarios
9181 -- 9194Hongbin Xu, Weitao Chen, Baigui Sun, Xuansong Xie, Wenxiong Kang. RobustMVS: Single Domain Generalized Deep Multi-View Stereo
9195 -- 9208Huilong Xie, Wenwei Song, Wenxiong Kang. Learning an Augmented RGB Representation for Dynamic Hand Gesture Authentication
9209 -- 9222Xiaoqin Zhang 0002, Yuewang Xu, Tao Wang 0052, Tangfei Liao. Multi-Prior Driven Network for RGB-D Salient Object Detection
9223 -- 9236Qibo Chen, Baozhen Ge, Jianing Quan. Unambiguous Pyramid Cost Volumes Fusion for Stereo Matching
9237 -- 9249Zhen Zhou, Qing Zhu, Mingtao Feng, Yaonan Wang 0001, Jianqiao Luo, Zhiqiang Miao, Lin Chen, Yang Mo. Unsupervised Homography Estimation With Pixel-Level SVDD
9250 -- 9263Zezong Zhang, Jianeng Tang, Feng Zhang, Tingting Huang, Mingsheng Lu. Medical Image Encryption Based on Josephus Scrambling and Dynamic Cross-Diffusion for Patient Privacy Security
9264 -- 9280Chenxi Song, Shigang Wang, Jian Wei, Yan Zhao 0012. FewarNet: An Efficient Few-Shot View Synthesis Network Based on Trend Regularization
9281 -- 9297Yaofo Chen, Yong Guo, Daihai Liao, Fanbing Lv, Hengjie Song, James Tin-Yau Kwok, Mingkui Tan. Automated Dominative Subspace Mining for Efficient Neural Architecture Search
9298 -- 9310Jiwei Shen, Shujing Lyu, Yue Lu 0001. LithoPW: Leveraging Visual Memory Encoding and Defect-Aware Optimization for Precise Determination of the Lithography Process Windows
9311 -- 9325Liuhao Zhu, Yixiang Fang, Yi Zhao, Yi Peng, Junxiang Wang, Jiangqun Ni. Lite Localization Network and DUE-Based Watermarking for Color Image Copyright Protection
9326 -- 9340Zhenlei Dai, Liangchen Hu, HuaiJiang Sun. Block Diagonal Graph Embedded Discriminative Regression for Image Representation
9341 -- 9355Weiqi Li, Bin Chen, Shuai Liu, Shijie Zhao, Bowen Du 0002, Yongbing Zhang, Jian Zhang. 2-Net: Dual-Domain Deep Convolutional Coding Network for Compressive Sensing
9356 -- 9370Yule Duan, Chuang Chen, Maixia Fu, Yinsheng Li, Xiuwen Gong, Fulin Luo. Dimensionality Reduction via Multiple Neighborhood-Aware Nonlinear Collaborative Analysis for Hyperspectral Image Classification
9371 -- 9385Renzhong Qiao, Hongbing Ji, Zhigang Zhu 0002, Wenbo Zhang 0007. Local-to-Global Semantic Learning for Multi-View 3D Object Detection From Point Cloud
9386 -- 9399Dingyi Li, Yu Liu 0023, Zengfu Wang, Jian Yang 0003. Video Rescaling With Recurrent Diffusion
9400 -- 9413Jiaqi Cui, Yan Wang 0015, Luping Zhou, Yuchen Fei, Jiliu Zhou, Dinggang Shen. 3D Point-Based Multi-Modal Context Clusters GAN for Low-Dose PET Image Denoising
9414 -- 9427Shaoqian Wang, Bo Li 0090, Yuchao Dai. Efficient Multi-View Stereo by Dynamic Cost Volume and Cross-Scale Propagation
9428 -- 9444Fan Wang, Xiang Zhang 0023, Zhangjie Fu. An Iterative Two-Stage Probability Adjustment Strategy With Progressive Incremental Searching for Image Steganography
9445 -- 9457Yanan Liu, Yanqiu Li, Hao Zhang 0110, Xuejie Zhang 0002, Dan Xu 0001. Decoupled Knowledge Embedded Graph Convolutional Network for Skeleton-Based Human Action Recognition
9458 -- 9471Hu Gao, Jing Yang, Ying Zhang, Ning Wang, Jingfan Yang, Depeng Dang. Prompt-Based Ingredient-Oriented All-in-One Image Restoration
9472 -- 9483Dong Huang 0001, Xiaozhi Deng, Ding-Hua Chen, Zihao Wen, Weijun Sun, Chang-Dong Wang, Jian-Huang Lai. Deep Clustering With Hybrid-Grained Contrastive and Discriminative Learning
9484 -- 9498Hui Luo, Shuhai Zhang, Zhuangwei Zhuang, Jiajie Mai, Mingkui Tan, Jianlin Zhang 0001. Learning to Generate Diverse Data From a Temporal Perspective for Data-Free Quantization
9499 -- 9514Shalayiding Sirejiding, Bayram Bayramli, Yuxiang Lu, Suizhi Huang, Hongtao Lu, Yue Ding 0001. Adaptive Task-Wise Message Passing for Multi-Task Learning: A Spatial Interaction Perspective
9515 -- 9527Qiuping Jiang, Feiyang Liu, Zhihua Wang, Shiqi Wang 0001, Weisi Lin. Rethinking and Conceptualizing Just Noticeable Difference Estimation by Residual Learning
9528 -- 9539Pan Liu, Yongqiang Zhao 0001, Kai-Feng, Seong G. Kong. Physics-Driven Multispectral Filter Array Pattern Optimization and Hyperspectral Image Reconstruction
9540 -- 9549Siyang Dai, Jun Liu 0036, Ngai-Man Cheung. Uncertainty-Aware Pedestrian Crossing Prediction via Reinforcement Learning
9550 -- 9561Junteng Zhang, Junzhe Zhang, Wenxi Ma, Dandan Ding, Zhan Ma. Content-Aware Rate Control for Geometry-Based Point Cloud Compression
9562 -- 9577Yangke Ying, Jin Wang, Yunhui Shi, Nam Ling, Baocai Yin. Dual-Domain Feature Fusion and Multi-Level Memory-Enhanced Network for Spectral Compressive Imaging
9578 -- 9590Kai Xu 0012, Lichun Wang 0002, Shuang Li, Jianjia Xin, Baocai Yin. Self-Distillation With Augmentation in Feature Space
9591 -- 9605Hengyu Man, Xiaopeng Fan, Riyu Lu, Chang Yu, Debin Zhao. MetaIP: Meta-Network-Based Intra Prediction With Customized Parameters for Video Coding
9606 -- 9619Daxin Li, Yuanchao Bai, Kai Wang, Junjun Jiang, Xianming Liu, Wen Gao 0001. GroupedMixer: An Entropy Model With Group-Wise Token-Mixers for Learned Image Compression
9620 -- 9632Hanyue Tu, Li Li 0040, Wengang Zhou, Houqiang Li. Toward On-Demand Transmission: Joint Feature and Image Coding With Reversible Neural Networks
9633 -- 9646Yiting Shao, Xiaodong Yang, Wei Gao 0003, Shan Liu 0001, Ge Li 0002. 3D Point Cloud Attribute Compression Using Diffusion-Based Texture-Aware Intra Prediction
9647 -- 9663Ziqing Ge, Siwei Ma, Wen Gao 0001, Jingshan Pan, Chuanmin Jia. NLIC: Non-Uniform Quantization-Based Learned Image Compression
9664 -- 9677Fengling Li, Bowen Wang, Lei Zhu 0002, Jingjing Li 0001, Zheng Zhang 0006, Xiaojun Chang. Cross-Domain Transfer Hashing for Efficient Cross-Modal Retrieval
9678 -- 9691Guoxin Xiong, Meng Meng, Tianzhu Zhang, Dongming Zhang, Yongdong Zhang 0001. Reference-Aware Adaptive Network for Image-Text Matching
9692 -- 9705Sheng Fang, Tiantian Dang, Shuhui Wang, Qingming Huang. Linguistic Hallucination for Text-Based Video Retrieval
9706 -- 9717Wenrui Li, Ruiqin Xiong, Xiaopeng Fan. Multi-Layer Probabilistic Association Reasoning Network for Image-Text Retrieval
9718 -- 9731Sheng Liu, Annan Li, Yuwei Zhao, Jiahao Wang, Yunhong Wang. EvCap: Element-Aware Video Captioning
9732 -- 9744Linhao Qu, Yingfan Ma, Xiaoyuan Luo, Qinhao Guo, Manning Wang, Zhijian Song. Rethinking Multiple Instance Learning for Whole Slide Image Classification: A Good Instance Classifier Is All You Need
9745 -- 9756Zhaohuan Zhan, Jinghui Qin, Wei Zhuo 0006, Guang Tan. Enhancing Vision and Language Navigation With Prompt-Based Scene Knowledge
9760 -- 9773Weihong Ren, Jinguo Luo, Weibo Jiang, Liangqiong Qu, Zhi Han, Jiandong Tian, Honghai Liu 0001. Learning Self- and Cross-Triplet Context Clues for Human-Object Interaction Detection
9774 -- 9785Ning Li, Bineng Zhong, Yaozong Zheng, Qihua Liang, Zhiyi Mo, Shuxiang Song. Robust Tracking via Combing Top-Down and Bottom-Up Attention
9786 -- 9797Xun Gong 0002, Xuan Tan, Yang Xiang. Contrastive Mean Teacher for Intra-Camera Supervised Person Re-Identification
9798 -- 9807MingQi Lu, Siyuan Yang, Xiaobo Lu, Jun Liu 0036. Cross-Modal Contrastive Pre-Training for Few-Shot Skeleton Action Recognition
9808 -- 9821Hu Huang, Shuiping Gou, Ruimin Li, Xinbo Gao 0001. Joint-Wise Temporal Self-Similarity Periodic Selection Network for Repetitive Fitness Action Counting
9822 -- 9835KyuJin Shim, Junyoung Byun, Kangwook Ko, Jubi Hwang, Changick Kim. Enhancing Robustness of Multi-Object Trackers With Temporal Feature Mix
9836 -- 9851Zihao Dong, Zizhen Liu, Runmin Cong, Tiyu Fang, Xiuli Shao, Sam Kwong. UAFer: A Unified Model for Class-Agnostic Binary Segmentation With Uncertainty-Aware Feature Reassembly
9852 -- 9865Lorenzo Papa, Paolo Russo 0001, Irene Amerini. D4D: An RGBD Diffusion Model to Boost Monocular Depth Estimation
9866 -- 9881Jiaping Lin, Gang Liang, Rongchuan Zhang. LTTrack: Rethinking the Tracking Framework for Long-Term Multi-Object Tracking
9882 -- 9897Chengxing Lin, Wenju Xu, Jian Zhu 0001, Yongwei Nie, Ruichu Cai, Xuemiao Xu. PatchMixing Masked Autoencoders for 3D Point Cloud Self-Supervised Learning
9898 -- 9909Yi He, Lei Yang, Shilin Wang, Alan Wee-Chung Liew. Lip Feature Disentanglement for Visual Speaker Authentication in Natural Scenes
9910 -- 9924Yijing Dai, Yingjian Li, Dongpeng Chen, Jinxing Li, Guangming Lu. Multimodal Decoupled Distillation Graph Neural Network for Emotion Recognition in Conversation
9925 -- 9938Rui Ding, Meng Yang 0002, Nanning Zheng 0001. Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection
9939 -- 9953Shengyu Hou, Mengyin Fu, Rongchuan Wang, Yi Yang 0009, Wenjie Song 0001. Self-Supervised Monocular Depth Estimation for All-Day Images Based on Dual-Axis Transformer
9954 -- 9966Hang Yao, Qiguang Miao, Peipei Zhao, Chaoneng Li, Xin Li, Guanwen Feng, Ruyi Liu. Exploration of Class Center for Fine-Grained Visual Classification
9967 -- 9978Haoran Wang, Qinghua Cheng, Baosheng Yu, Yibing Zhan, Dapeng Tao, Liang Ding 0006, Haibin Ling. Free-Form Composition Networks for Egocentric Action Recognition
9979 -- 9996Qin Yang, Wenxuan Gao, Chenglin Li, Hao Wang 0183, Wenrui Dai, Junni Zou, Hongkai Xiong, Pascal Frossard. 360Spred: Saliency Prediction for 360-Degree Videos Based on 3D Separable Graph Convolutional Networks
9997 -- 10010Zhuo Chen, Xudong Xu, Yichao Yan, Ye Pan, Wenhan Zhu, Wayne Wu, Bo Dai 0002, Xiaokang Yang. HyperStyle3D: Text-Guided 3D Portrait Stylization via Hypernetworks
10011 -- 10022Zhiqin Zhu, Renzhong Zheng, Guanqiu Qi, Shuang Li, Yuanyuan Li, Xinbo Gao 0001. Small Object Detection Method Based on Global Multi-Level Perception and Dynamic Region Aggregation
10023 -- 10035JianXiong Zhou, Ying Wu 0001. Outlier-Probability-Based Feature Adaptation for Robust Unsupervised Anomaly Detection on Contaminated Training Data
10036 -- 10049Xuelin Zhu, Jianshu Li, Jiuxin Cao, Dongqi Tang, Jian Liu, Bo Liu 0004. Semantic-Guided Representation Enhancement for Multi-Label Image Classification
10050 -- 10062Chunlei Peng, Bo Wang, Decheng Liu, Nannan Wang 0001, Ruimin Hu, Xinbo Gao 0001. MRLReID: Unconstrained Cross-Resolution Person Re-Identification With Multi-Task Resolution Learning
10063 -- 10076Wanying Zhang, Mengyuan Liu, Xinshun Wang, Shen Zhao, Can Wang 0006. CHAMP: A Large-Scale Dataset for Skeleton-Based Composite HumAn Motion Prediction
10077 -- 10091Xiaoqin Zhang 0002, Hongqi Yu, Yong Qin, Xiaolong Zhou 0001, Sixian Chan. Video-Based Multi-Camera Vehicle Tracking via Appearance-Parsing Spatio-Temporal Trajectory Matching Network
10092 -- 10106Zhi-Long Han, Ting-Zhu Huang, Xi-Le Zhao, Hao Zhang, Wei-Hao Wu. Nested Fully-Connected Tensor Network Decomposition for Multi-Dimensional Visual Data Recovery
10107 -- 10120Linwei Fan, Jin Cui, Huiyu Li, Xiaoyu Yan, Hui Liu 0016, Caiming Zhang 0001. Complementary Blind-Spot Network for Self-Supervised Real Image Denoising
10121 -- 10134Mingkai Qiu, Yuhuan Lu, Xiying Li, Qiang Lu. Camera-Aware Differentiated Clustering With Focal Contrastive Learning for Unsupervised Vehicle Re-Identification
10135 -- 10151Mohsen Jenadeleh, Raouf Hamzaoui, Ulf-Dietrich Reips, Dietmar Saupe. Crowdsourced Estimation of Collective Just Noticeable Difference for Compressed Video With the Flicker Test and QUEST+
10152 -- 10165Zetao Shi, Yuenan Li 0001, Feiyang Zhang. Reflection Removal via Recurrent Learning Guided by Physics Prior and Focal Perceptual Loss
10166 -- 10181Ling Li, Yan Zhang, Lin Yuan, Xinbo Gao 0001. PLGNet: Prior-Guided Local and Global Interactive Hybrid Network for Face Super-Resolution
10182 -- 10193Can Xu, Le Hui, Yuehui Han, Haobo Jiang, Jiaxin Chen, Jin Xie 0001, Jian Yang 0003. Learning Local Semantic Region Activations for Weakly Supervised Object Localization
10194 -- 10207Zijian Liu, Xiaoheng Deng, Ping Jiang, Conghao Lv, Geyong Min, Xin Wang. Edge Perception Camouflaged Object Detection Under Frequency Domain Reconstruction
10208 -- 10222Wanyun Li, Jack Fan, Pinxue Guo, Lingyi Hong, Wei Zhang. HFVOS: History-Future Integrated Dynamic Memory for Video Object Segmentation
10223 -- 10236Cong Zhang, Honggang Qi, Shuhui Wang, Yuezun Li, Siwei Lyu. COMICS: End-to-End Bi-Grained Contrastive Learning for Multi-Face Forgery Detection
10237 -- 10249Zhiqiang Kou, Jing Wang 0113, Yuheng Jia, Xin Geng 0001. Inaccurate Label Distribution Learning
10250 -- 10265Jiaxi Liu, Jinghao Niu, Weifeng Li, Xin Li, Binbin He, Hao Zhou, Yanjuan Liu, Ding Li, Bo Wang, Wensheng Zhang 0002. XFMP: A Benchmark for Explainable Fine-Grained Abnormal Behavior Recognition on Medical Personal Protective Equipment
10266 -- 10280Wentao Zou, Xiao Lu 0002, Zhilv Yi, Ling Zhang, Gang Fu, Ping Li 0016, Chunxia Xiao. Eyeglass Reflection Removal With Joint Learning of Reflection Elimination and Content Inpainting
10281 -- 10298Deyang Wu, Xinpeng Zhang 0001, Jiayan Wang, Li Li, Guorui Feng. Novel Robust Video Watermarking Scheme Based on Concentric Ring Subband and Visual Cryptography With Piecewise Linear Chaotic Mapping
10299 -- 10309Huasheng Wang, Jiang Liu, Hongchen Tan, Jianxun Lou, Xiaochang Liu, Wei Zhou 0021, Hantao Liu. Blind Image Quality Assessment via Adaptive Graph Attention
10310 -- 10325Xinyi Wu, Santiago López-Tapia, Xijun Wang 0003, Rafael Molina 0001, Aggelos K. Katsaggelos. Real-Time Lightweight Video Super-Resolution With RRED-Based Perceptual Constraint
10326 -- 10338Ziyang Hong, C. Patrick Yue. Real-Time 3D Visual Perception by Cross-Dimensional Refined Learning
10339 -- 10352Wang Liu, Wei Gao 0003, Ge Li 0002, Siwei Ma, Tiesong Zhao, Hui Yuan 0001. Enlarged Motion-Aware and Frequency-Aware Network for Compressed Video Artifact Reduction
10353 -- 10367Daixun Li, Weiying Xie, Zixuan Wang, Yibing Lu, Yunsong Li, Leyuan Fang. FedDiff: Diffusion Model Driven Federated Learning for Multi-Modal and Multi-Clients
10368 -- 10384Jiaxuan Zhao, Licheng Jiao, Chao Wang, Xu Liu 0006, Fang Liu 0001, Lingling Li 0002, Mengru Ma, Shuyuan Yang. Knowledge Guided Evolutionary Transformer for Remote Sensing Scene Classification
10385 -- 10398Chunyan She, Fujun Han, Lidan Wang 0001, Shukai Duan, Tingwen Huang. MPC-Net: Multi-Prior Collaborative Network for Low-Light Image Enhancement
10399 -- 10410Wansen Wu, Meng Cao, Yue Hu, Yong Peng 0006, Long Qin, Quanjun Yin. Visual Grounding With Dual Knowledge Distillation
10411 -- 10423Zhiqi Pang, Lingling Zhao, Yang Liu, Gaurav Sharma 0001, Chunyu Wang. Inter-Modality Similarity Learning for Unsupervised Multi-Modality Person Re-Identification
10424 -- 10436Yue Que, Li Xiong 0018, Weiguo Wan, Xue Xia, Zhiwei Liu. Denoising Diffusion Probabilistic Model for Face Sketch-to-Photo Synthesis
10437 -- 10448Shenghao Li, Zezeng Li, Zhanpeng Wang, Zebin Xu, Na Lei, Zhongxuan Luo. Measure-Driven Neural Solver for Optimal Transport Mapping
10449 -- 10463Xianyao You, Caiyun Liu, Jun Li, Yan Sun, Ximeng Liu. FedMDO: Privacy-Preserving Federated Learning via Mixup Differential Objective
10464 -- 10478Hui Liu 0016, Gongguan Chen, Meng Liu 0006, Liqiang Nie. Pre-Trained Transformer-Based Parallel Multi-Channel Adaptive Image Sequence Interpolation Network
10479 -- 10493Wu Chen, Qiuping Jiang, Wei Zhou 0021, Long Xu, Weisi Lin. Dynamic Hypergraph Convolutional Network for No-Reference Point Cloud Quality Assessment
10494 -- 10506Yahui Xu, Jiwei Wei, Yi Bin, Yang Yang 0002, Zeyu Ma, Heng Tao Shen. Set of Diverse Queries With Uncertainty Regularization for Composed Image Retrieval
10507 -- 10520Yang Liu, Fang Liu 0001, Licheng Jiao, Qianyue Bao, Long Sun, Shuo Li 0010, Lingling Li 0002, Xu Liu 0006. Multi-Grained Gradual Inference Model for Multimedia Event Extraction