Journal: IEEE Trans. Circuits Syst. Video Techn.

Volume 35, Issue 1

3 -- 18Xijie Cheng, Xiaohui He 0001, Mengjia Qiao, Panle Li, Peng Chang, Tianhao Zhang, Xiaoyu Guo, Jinyong Wang, Zhihui Tian, Guangsheng Zhou. Multi-View Graph Convolutional Network With Spectral Component Decompose for Remote Sensing Images Classification
19 -- 32Junbin Zhuang 0001, Yan Zheng, Baolong Guo 0001, Yunyi Yan. Globally Deformable Information Selection Transformer for Underwater Image Enhancement
33 -- 44Jianan Li 0001, Xiaoying Yuan, Haolin Qin, Ying Wang 0064, Xincong Liu, Tingfa Xu. CVT-Track: Concentrating on Valid Tokens for One-Stream Tracking
45 -- 61Minglei Li 0002, Wushuang Gong, Pengfei Yan, Xiang Li 0084, Yuchen Jiang, Hao Luo 0003, Hang Zhou, Shen Yin. Joint Lesion Detection and Classification of Breast Ultrasound Video via a Clinical Knowledge-Aware Framework
62 -- 74Yihui Liang, Qian Fu, Kun Zou, Guisong Liu, Han Huang 0002. Enhancing Transparent Object Matting Using Predicted Definite Foreground and Background
75 -- 90Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao 0001. A Masked Reference Token Supervision-Based Iterative Visual-Language Framework for Robust Visual Grounding
91 -- 102Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Jianlin Feng, Hongyang Chao, Tao Mei 0001. Exploring Vision-Language Foundation Model for Novel Object Captioning
103 -- 117Jiabing Xiong, Qiang Ling 0001. Mask-Guided Siamese Tracking With a Frequency-Spatial Hybrid Network
118 -- 133Qinglei Li, Qi Wang, Yongbin Qin, Xinyu Dong, Xingcai Wu, Shiming Chen 0002, Wu Liu, Yong-Jin Liu 0001, Jiebo Luo 0001. DRC: Discrete Representation Classifier With Salient Features via Fixed-Prototype
134 -- 147Pengxiang Li 0002, Chengtang Yao, Yunde Jia, Yuwei Wu 0001. Inter-Scale Similarity Guided Cost Aggregation for Stereo Matching
148 -- 161Xiao Wang 0014, Jiandong Jin, Chenglong Li 0002, Jin Tang 0001, Cheng Zhang 0010, Wei Wang 0115. Pedestrian Attribute Recognition via CLIP-Based Prompt Vision-Language Fusion
162 -- 177Zongyang Zhao, Jiehu Kang, Luyuan Feng, Jian Liang, Yuqi Ren, Bin Wu. LFA-Net: Enhanced PointNet and Assignable Weights Transformer Network for Partial-to-Partial Point Cloud Registration
178 -- 194Jiehua Zhang, Liang Li 0003, Chenggang Yan 0001, Wei Ke 0003, Yihong Gong. Monocular Depth Estimation on Adverse Weathers With Curriculum Domain Distribution Alignment
195 -- 206Junrui Xiao, Zhikai Li, Jianquan Li, Lianwei Yang, Qingyi Gu. BinaryViT: Toward Efficient and Accurate Binary Vision Transformers
207 -- 218Minjun Shen, Guobao Xiao, Changcai Yang, Junwen Guo, Lei Zhu 0002. CLG-Net: Rethinking Local and Global Perception in Lightweight Two-View Correspondence Learning
219 -- 231Zhuomin Liang, Liang Bai, Jinyu Fan, Xian Yang 0001, Jiye Liang. Contrastive Learning With Enhancing Detailed Information for Pre-Training Vision Transformer
232 -- 244Bo Liu, Chengrong Yang, Jing Guo, Yun Yang 0003. A Novel Semi-Supervised Object Detection Approach via Scale Rebalancing and Global Proposal Contrast Consistency
245 -- 258Yijin Yang, Xiaodong Gu 0001. Attention-Based Gating Network for Robust Segmentation Tracking
259 -- 272Zhuang Luo, Yang Xiao 0007, Feng Yang 0012, Joey Tianyi Zhou, Zhiwen Fang. Rhythmer: Ranking-Based Skill Assessment With Rhythm-Aware Transformer
273 -- 286Anjun Chen, Xiangyu Wang, Kun Shi 0003, Yuchi Huo, Jiming Chen 0001, Qi Ye. Toward Weather-Robust 3D Human Body Reconstruction: Millimeter-Wave Radar-Based Dataset, Benchmark, and Multi-Modal Fusion
287 -- 299Fei Wu 0001, Jun Yin, Xiaochuan Li, Jianfeng Wu, Da Jin, Jiamin Yang. CoNet: A Consistency-Oriented Network for Camouflaged Object Segmentation
300 -- 314Tongtong Yuan, Xuange Zhang, Bo Liu 0011, Kun Liu, Jian Jin, Zhenzhen Jiao. Surveillance Video-and-Language Understanding: From Small to Large Multimodal Models
315 -- 328Hao Liu, Yong Zhou 0003, Bing Liu 0016, Ming Yan 0007, Joey Tianyi Zhou. L2A: Learning Affinity From Attention for Weakly Supervised Continual Semantic Segmentation
329 -- 342Min Xie, Jieyu Zhao, Kedi Shen. A Novel SO(3) Rotational Equivariant Masked Autoencoder for 3D Mesh Object Analysis
343 -- 356Yuwen Pan, Rui Sun 0006, Yuan Wang, Wenfei Yang, Tianzhu Zhang, Yongdong Zhang 0001. Purify Then Guide: A Bi-Directional Bridge Network for Open-Vocabulary Semantic Segmentation
357 -- 366Shanaka Ramesh Gunasekara, Wanqing Li 0001, Jack Yang 0003, Philip O. Ogunbona. Asynchronous Joint-Based Temporal Pooling for Skeleton-Based Action Recognition
367 -- 379Peiyu Guan, Zhiqiang Cao, Shengxuan Fan, Yuequan Yang, Junzhi Yu, Shuo Wang 0001. Hardness-Aware Metric Learning With Cluster-Guided Attention for Visual Place Recognition
380 -- 393Jinfan Liu, Yichao Yan, Junjie Li, Weiming Zhao, Pengzhi Chu, Xingdong Sheng, Yunhui Liu 0006, Xiaokang Yang. IPAD: Industrial Process Anomaly Detection Dataset
394 -- 404Tianyu Sun, Dingchang Hu, Yixiang Dai, Guijin Wang. Diffusion-Based Depth Inpainting for Transparent and Reflective Objects
405 -- 417Xiaomin Li, Qinghe Wang, Dezhuang Li, Mengmeng Ge, Xu Jia 0012, You He, Huchuan Lu. MoBox: Enhancing Video Object Segmentation With Motion-Augmented Box Supervision
418 -- 430Kexing Ding, Ting Lu 0002, Wei Fu 0003, Leyuan Fang. Cross-Scene Hyperspectral Image Classification With Consistency-Aware Customized Learning
431 -- 444Pan Liu, Yuanyang Bu, Yong-Qiang Zhao 0001, Seong G. Kong. Enhancing Visual Data Completion With Pseudo Side Information Regularization
445 -- 460Fan Yang 0032, Sosuke Yamao, Ikuo Kusajima, Atsunori Moteki, Shoichi Masui, Shan Jiang 0006. YOWO: You Only Walk Once to Jointly Map an Indoor Scene and Register Ceiling-Mounted Cameras
461 -- 476HuaQing Hao, Weibin Liu, Weiwei Xing. Prior-Structure Driven Weakly-Supervised Learning for Fine-Grained Human Parsing
477 -- 491Ke Gu 0001, Hongyan Liu 0004, Yuchen Liu, Junfei Qiao 0001, Guangtao Zhai, Wenjun Zhang 0001. Perceptual Information Fidelity for Quality Estimation of Industrial Images
492 -- 507Linfeng Ma, Han Fang, Zehua Ma, Zhaoyang Jia, Weiming Zhang 0001, Nenghai Yu. C³shartMark: A Chart Watermarking Scheme With Consecutive-Encoding and Concurrent-Decoding
508 -- 519Jiawei Mao, Guangyi Zhao, Xuesong Yin, Yuanqi Chang. SwinStyleformer is a Favorable Choice for Image Inversion
520 -- 533Wenhao Xu, Changwei Wang 0001, Xuxiang Feng, Rongtao Xu, Longzhao Huang, Zherui Zhang, Li Guo 0004, Shibiao Xu. Generalization Boosted Adapter for Open-Vocabulary Segmentation
534 -- 546Weijia Wu 0001, Zhuang Li, Yuanqiang Cai, Hong Zhou, Mike Zheng Shou. A Bilingual, Open World Video Text Dataset and Real-Time Video Text Spotting With Contrastive Learning
547 -- 560Fan Yang, Binbin Liang, Wei Li 0075, Jianwei Zhang 0013. Multidimensional Fusion Network for Multispectral Object Detection
561 -- 576Yuxuan Gu, Yi Jin 0002, Ben Wang 0005, Zhixiang Wei, Xiaoxiao Ma 0006, Haoxuan Wang, Pengyang Ling, Huaian Chen, Enhong Chen. Seed Optimization With Frozen Generator for Superior Zero-Shot Low-Light Image Enhancement
577 -- 588Jiacheng Hou, Zhong Ji, Jinyu Yang, Feng Zheng. Bidirectional Error-Aware Fusion Network for Video Inpainting
589 -- 600Yongkang Zhang 0001, Han Zhang, Jun Li 0072, Zhiping Shi 0002, Jian Yang 0030, Kaixin Yang, Shuo Yin, Qiuyan Liang, Xianglong Liu 0001. Bullet-Screen-Emoji Attack With Temporal Difference Noise for Video Action Recognition
601 -- 616Yuanyuan Li, Zetian Mi, Yulin Wang, Shuaiyong Jiang, XianPing Fu. TAFormer: A Transmission-Aware Transformer for Underwater Image Enhancement
617 -- 631Junfei Shi, Shanshan Ji, Haiyan Jin, Junhuai Li, Maoguo Gong, Weisi Lin. Content-Adaptive Multi-Region Deep Network for Polarimetric SAR Image Classification
632 -- 642Kaihui Cheng, Chule Yang, Xiao Liu, Naiyang Guan, Zhiyuan Wang. LPN: Language-Guided Prototypical Network for Few-Shot Classification
643 -- 656Yu Xie, Lianhang Luo, Tianpei Cao, Bin Yu 0011, A. Kai Qin. Contrastive Learning Network for Unsupervised Graph Matching
657 -- 669Chen Yang 0020, Junxiao Wang, Huixiao Meng, Shuyuan Yang, Zhixi Feng. Negative Class Guided Spatial Consistency Network for Sparsely Supervised Semantic Segmentation of Remote Sensing Images
670 -- 683Chang Wan, Ming-Hsuan Yang 0001, Minglu Li 0001, Yunliang Jiang, Zhonglong Zheng. Nested Annealed Training Scheme for Generative Adversarial Networks
684 -- 697Xiao Jiang, Yiyuan Xie, Yushu Zhang 0001, Yichen Ye, Fang Xu, Lili Li, Ye Su, Zhuang Chen. Reversible Data Hiding in Encrypted Images Using Reservoir Computing-Based Data Fusion Strategy
698 -- 712Xueli Geng, Lingling Li 0002, Licheng Jiao, Xu Liu 0006, Fang Liu 0001, Shuyuan Yang. Knowledge-Aware Geometric Contourlet Semantic Learning for Hyperspectral Image Classification
713 -- 727Yaowu Fan, Jia Wan 0001, Andy J. Ma. Learning Crowd Scale and Distribution for Weakly Supervised Crowd Counting and Localization
728 -- 740Zhishe Wang, Zhuoqun Zhang, Wuqiang Qi, Fengbao Yang, Jiawei Xu 0004. FreqGAN: Infrared and Visible Image Fusion via Unified Frequency Adversarial Learning
741 -- 753Shi Chen, Lefei Zhang, Liangpei Zhang 0001. Cyclic Cross-Modality Interaction for Hyperspectral and Multispectral Image Fusion
754 -- 768Heqian Qiu, Lanxiao Wang, Taijin Zhao, Fanman Meng, Qingbo Wu 0001, Hongliang Li 0001. MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension
769 -- 782Bin Fan 0002, Ying Guo, Yuchao Dai, Chao Xu 0006, Boxin Shi. Self-Supervised Learning for Rolling Shutter Temporal Super-Resolution
783 -- 799Guanbo Wang, Haiyan Li, Victor S. Sheng, Yujun Ma, Hongwei Ding, Hongzhi Zhao. DPMNet: A Remote Sensing Forest Fire Real-Time Detection Network Driven by Dual Pathways and Multidimensional Interactions of Features
800 -- 810Lei Qi 0001, Dongjia Zhao, Yinghuan Shi, Xin Geng 0001. Patch-Aware Batch Normalization for Improving Cross-Domain Robustness
811 -- 822Yusong Hu, Zichen Liang, Xialei Liu, Qibin Hou, Ming-Ming Cheng. Reformulating Classification as Image-Class Matching for Class Incremental Learning
823 -- 837Wanyu Wu, Wei Wang 0170, Zheng Wang 0007, Kui Jiang, Zhengguo Li. For Overall Nighttime Visibility: Integrate Irregular Glow Removal With Glow-Aware Enhancement
838 -- 856Mingye Ju, Chunming He, Can Ding, Wenqi Ren, Lin Zhang 0014, Kai-Kuang Ma. All-Inclusive Image Enhancement for Degraded Images Exhibiting Low-Frequency Corruption
857 -- 873Yuxin Kong, Peng Yang 0004, Yan Cheng. Adaptive On-Device Model Update for Responsive Video Analytics in Adverse Environments
874 -- 887Bobiao Guo, Ping Ping, Junyuan Huo. CRDH: Compatible Reversible Data Hiding With High Capacity and Generalization
888 -- 899Zhiyuan Li, Yanhui Zhou, Hao Wei 0005, Chenyang Ge, Jingwen Jiang. Toward Extreme Image Compression With Latent Feature Guidance and Diffusion Prior
900 -- 910Yili Jin 0001, Xize Duan, Kaiyuan Hu, Fangxin Wang 0001, Xue Liu 0001. 3D Video Conferencing via On-Hand Devices
911 -- 921Wenhui Li 0001, Chao Pang, Weizhi Nie, Hongshuo Tian, An-An Liu. Bidirectional Mask Selection for Zero-Shot Referring Image Segmentation
922 -- 937Laijin Meng, Fan Li, Xinghao Jiang, Qiang Xu 0007. A Universal Framework for Improving the Robustness of Coverless Image Steganography Based on Image Restoration
938 -- 952Jia-Run Du, Jia-Chang Feng, Kun-Yu Lin, Fa-Ting Hong, Zhongang Qi, Ying Shan, Jian-Fang Hu, Wei-Shi Zheng 0001. Weakly-Supervised Temporal Action Localization by Progressive Complementary Learning
953 -- 966Kuiyuan Zhang, Zeming Hou, Zhongyun Hua, Yifeng Zheng, Leo Yu Zhang. Boosting Deepfake Detection Generalizability via Expansive Learning and Confidence Judgement
967 -- 980Jing Lian, Zhenghao Wang, Dongfang Yang, Wen Zheng, Linhui Li, Yibin Zhang. Pedestrian Facial Attention Detection Using Deep Fusion and Multi-Modal Fusion Classifier
981 -- 985Yaning Zhang, Yingqian Wang 0002, Tianhao Wu, Jungang Yang 0001, Wei An. Fixed Relative Pose Prior for Camera Array Self-Calibration