Journal: The Visual Computer

Volume 41, Issue 9

6331 -- 6333Nadia Magnenat-Thalmann. Editorial issue July 2025
6335 -- 6348Wonjun Lee. Multilevel Monte Carlo for asymptotically efficient path tracing
6349 -- 6361Teng Zhang, Bo Yang, Jianlin Zhu, Xincheng Hu. Scene-Enhanced Social Interpretable Movement Behavior for Multimodal Pedestrian Trajectory Prediction
6363 -- 6374Naoki Kita. StencilQR: connectivity-enhanced fabricable QR codes for stencil
6375 -- 6386Yi Jiang, Yiqian Wu, Hao Xu, Xiwen Shi, Xiaogang Jin 0001. Geometry guidance diffusion image morphing with large shape difference
6387 -- 6399Yanping Fu, Yuting Zhang, Dengdi Sun, Shaojie Zhang, Haifeng Zhao 0001. Single image shadow removal using 2D signed distance field
6401 -- 6412Xiaonan Fang, Muhan Chang. Video sketching using multi-domain guidance and implicit encoding
6413 -- 6424Wenguang Chen, Dong Xiao, Renjie Chen. Bijective spherical parameterization via stereographic projection
6425 -- 6437Haipeng Wang. Submodular-based view selection for low-quality points rendering with multi-feature point-based NeRF
6439 -- 6452Shihao Zheng, Huisi Wu, Zhijian Gao, Ping Li. Few-shot medical image segmentation via query transformation learning
6453 -- 6464Yuan-Hao Jiang, Kezong Tang, Zi-Wei Chen, Yuang Wei, Tian-Yi Liu, Jiayi Wu. MAS-KCL: knowledge component graph structure learning with large language model-based agentic workflow
6465 -- 6477Xiaojiao Guo, Shenghong Luo, Yihang Dong, Zexiao Liang, Zimeng Li, Xiujun Zhang, Xuhang Chen. An asymmetric calibrated transformer network for underwater image restoration
6479 -- 6491Renjie Zhang, Xin Wang, George Baciu, Ping Li. Distilling complementary information from temporal context for enhancing human appearance in human-specific NeRF
6493 -- 6505Feiwei Qin, Liangzhe Zhu, Zijian Xu, Meie Fang, Ping Li. CADGCL: unsupervised retrieval of CAD models via boundary representations
6507 -- 6519Jie Zhao, Ju Dai, Feng Zhou 0007, JunJun Pan, HongWen Xu. Dual-path spatio-temporal Mamba for skeleton-based action recognition
6521 -- 6532Shu Liu, Yilin Huang, Hongyun Yu, Yan Xu. AMNet: an attention-enhanced multi-branch network for micro-expression recognition
6533 -- 6546Yun Pei, Lingbo Liu, Runqing Jiang, Ye Zhang, Pengpeng Yu, Liang Lin, Yulan Guo. Energy-guided test-time adaptation for data shifts in multi-modal perception
6547 -- 6560Cheng Fang, Siyan Zhu, JunJun Pan. Enhanced material point method with affine projection stabilizer for efficient hyperelastic simulations
6561 -- 6569Pengpei Hong, Chuhua Xian, Hongmin Cai, Jiazhou Chen, Guiqing Li. Batch Specular Manifold Sampling for caustics rendering
6571 -- 6585Yuhang Yi, Yan Gui, Zhuo Liu. Boosting memory network for video object segmentation in complex scenes
6587 -- 6600Yuval Onn, Haggai Maron, Ayellet Tal. Attention-guided self-supervised distinctive region detection in point clouds
6601 -- 6615Qingzheng Wang, Ning Li, Jiazhi Xie, Wenhui Liu, Xingqin Wang, Zengwei Mai. Unified cross-domain refinement network for camouflaged object detection
6617 -- 6629Runqiao Li, Qiujie Dong, Shuangmin Chen. RevolRecon: Neural Representation for Reconstructing Surface of Revolution
6631 -- 6644Kai Yang, Wenhao Zhang, Ping Li, Jinxing Liang, Tao Peng 0006, Jia Chen, Li Li, Xinrong Hu, Junping Liu. ViT-BF: vision transformer with border-aware features for visual tracking
6645 -- 6656Xijun Wang, Xin Zhou, Yi Wang, Songto Zeng, Xinyu Liu, Haobo Shen, Xianying Wang, Ping Li, Lei Zhu. RainRWKV: a deep RWKV model for video deraining
6657 -- 6670Sen Peng, Yihang Fu, Runjie Miu, Tianyi Lv, Baorong Yang, Xiao Dong. GenericAvatar: generic human modeling from monocular video based on mesh-guided Gaussians
6671 -- 6685Shengjun Liu, Ting Zhang, Ruoxi Deng, Xinru Liu, Hanchao Liu. Physics-guided deep learning framework with attention for image denoising
6687 -- 6700Qiuyue Zhang, Zhiwang Zhang, Shiting Wen, Chaoyi Pang, Fangyu Wu 0001. Boosting remote semantic segmentation using vision-and-language foundation model
6701 -- 6714Yixiao Feng, Weihua Tong, Zhangjin Huang. High-quality neural surface reconstruction from unoriented point clouds via multilevel tensor product B-spline hash encoding and viscosity regularization
6715 -- 6727Jian Lin, Chengze Li, Xueting Liu, Zhongping Ge. Instance-guided anime editing with a curated large-scale dataset
6729 -- 6743Baofeng Zhou, Xianyong Fang, Linbo Wang, Zhengyi Liu. SemanticAvatar: human surface reconstruction based on semantically consistent biplane features
6745 -- 6755Muyang Zhang, Weiliang Meng, Mingda Jia, Jiaming Gu, Yihua Shao, Changwei Wang 0001, Rongtao Xu, Zhihao Ma, Xiaopeng Zhang 0001. PDFT: parameter-diminish fine-tuning for transformer-based models
6757 -- 6768Taoqi Bao, Jiangnan Ye 0002, Zhankong Bao, Chee Siang Leow, Haoji Hu, Jianfeng Lu, Issei Fujishiro, Jiayi Xu. L2H-NeRF: low- to high-frequency-guided NeRF for 3D reconstruction with a few input scenes
6769 -- 6781Taishi Ito, Yuki Endo, Yoshihiro Kanamori. Selfage: personalized facial age transformation using self-reference images
6783 -- 6794Jianning Chi, Mingyang Sun, Zelan Li, Geng Lin, Ying Huang. Adaptive box-level supervision with superpixel shape guidance for ultrasound image segmentation
6795 -- 6807J. Antony, M. Reghunath, Safeer Babu Thayyil, M. Ramanathan 0001. ConDT: A 2D curve reconstruction algorithm based on a constrained neighbor proximity graph
6809 -- 6821Yiyi Wang, Jia Su, Song Zhang, Eisei Nakahara. RaEUNet: a retentive and efficient UNet for medical image segmentation
6823 -- 6835Zizhao Peng, Zihan Wang, Mengying Sun, Zheng Lv, Yan Wang, Ping Li, Fengwei An. Graph convolutional networks for 3D skeleton-based scoliosis screening using gait sequences
6837 -- 6849Min Shi 0005, Guo-Liang Zhao, Shi-sheng Guo, Bi-lian Sun, Dengming Zhu, Xiu-juan Chai, Zhao-Xin Li, Xinru Zhuo. Generating 3D fish motion skeleton via iterative optimization method and FishSkeletonNet
6851 -- 6864Peng Yu, Zhiyang Ji, Aimin Hao, Yang Gao 0032. Real-time immersive haptic sculpting with elastoplastic virtual clay
6865 -- 6878Enxu Zhao, Jianchi Sun, Fei Luo 0004, Chunxia Xiao. EE-Head: emotion estimation for precise facial expression in NeRF head avatars
6879 -- 6890Linling Jiang, Xin Wang, Fan Zhang 0045, Caiming Zhang. Transforming time and space: efficient video super-resolution with hybrid attention and deformable transformers
6891 -- 6904Huibiao Wen, Lei Wang, Shuang-Min Chen, Shiqing Xin, Chongyang Deng, Ying He 0001, Wenping Wang, Changhe Tu. ImS: implicit shell for the sandwich-walled space surrounding polygonal meshes
6905 -- 6915Tsukasa Fukusato, Akinobu Maejima, Takeo Igarashi. Locality-Preserving Free-Form Deformation
6917 -- 6929Jiawei Xu, Qiangqiang Zhou, Jiacong Yu, Chen Liao, Dandan Zhu. Semantic-Orthogonal Multi-modal Attention Network for RGB-D Salient Object Detection
6931 -- 6941Yunlong Liao, Yiting Lin, Zheng Xing, Xiaochen Yuan. Privacy Image Secrecy Scheme Based on Chaos-Driven Fractal Sorting Matrix and Fibonacci Q-Matrix
6943 -- 6954Ruiling Li, Ming Gao, Xiaogang Jin. Recognize Me If You Can: Two-stream Adversarial Transfer for Facial Privacy Protection using Fine-grained Makeup
6955 -- 6967MinJae Seo, Inhyung Jung, Jinhoon Choi, Kyoungju Park. PhysAvatar: physically plausible avatar generation from sparse tracking
6969 -- 6982Ruhao Wang, Yu Jiang, Huizhi Zhu, Fei Luo 0004, Chunxia Xiao. HumanIR-MGI: human inverse rendering via jointly optimizing geometry, material, and illumination
6983 -- 6997Bingchen Yang, Haiyong Jiang, Zhengda Lu, Jun Xiao 0005. Exploring Structural Lines for Interior Floorplan Segmentation
6999 -- 7012Haibo Wang, Qinsong Li, Ling Hu, Haojun Xu, Jing Meng, Xinru Liu, Yu-Kun Lai, Shengjun Liu. TriAlign: revisiting deep functional map from map representation alignment perspectives

Volume 41, Issue 8

5223 -- 5233Shiyun Zhang, Xing Deng, Haijian Shao, Yingtao Jiang. ImpRes: implicit residual diffusion models for image super-resolution
5235 -- 5250Imen Labiadh, Larbi Boubchir, Hassene Seddik. Optimization of 2D and 3D facial recognition through the fusion of CBAM AlexNet and ResNeXt models
5251 -- 5266He Yu, Kang Yan, Jiexi Chen, Xuan Li, Jinming Guo, Xiaoxue Xing, Tao Huang 0008. Study on the methods of hyperspectral image saliency detection based on MBCNN
5267 -- 5282Yanxiang Li, Wenzhe Meng, Dehua Ma, Siping Xu, Xiaoliang Zhu. MCGFF-Net: a multi-scale context-aware and global feature fusion network for enhanced polyp and skin lesion segmentation
5283 -- 5298Yusong Li, Bin Xie, Yuling Li, Jiahao Zhang. Multi-scale local regional attention fusion using visual transformers for fine-grained image classification
5299 -- 5309Yongpeng Zhao, Guangyuan Zhang, Kefeng Li, Zhenfang Zhu, Xiaotong Li, Yongshuo Zhang, Zhiming Fan. MFADU-Net: an enhanced DoubleU-Net with multi-level feature fusion and atrous decoder for medical image segmentation
5311 -- 5322Meichen Lu, Yi Chai, Kaixiong Xu, Weiqing Chen, Fei Ao, Wen Ji. Multimodal fusion and knowledge distillation for improved anomaly detection
5323 -- 5345Jihua Peng, Yanghong Zhou, P. Y. Mok 0001. EHFusion: an efficient heterogeneous fusion model for group-based 3D human pose estimation
5347 -- 5359Xizhuo Yu, Chaojie Fan, Jiandong Pan, Guoliang Xiang, Chunyang Chen, Tianjian Yu, Yong Peng 0002, Hanwen Deng. X-ray security inspection for real-world rail transit hubs: a wide-ranging dataset and detection model with incremental learning block
5361 -- 5371Junli Shen, Yuman Hai, Chongyu Lin. CT-UFormer: an improved hybrid decoder for image segmentation
5373 -- 5389Yufang Yang, Yining Xie, Jun Cao, Kaihua Yang. Attention-guided dual feature extraction approach for small target detection in infrared images
5391 -- 5404Honglin Wu, Xinyu Yu, Zhaobin Zeng. SSBFNet: a spectral-spatial fusion with BiFormer network for hyperspectral image classification
5405 -- 5419Fangfang Liang, Zilong Huang, Wenjian Wang, Zhenxue He, Qing En. Dynamic text prompt joint multimodal features for accurate plant disease image captioning
5421 -- 5433Wei Cao, Xin Chen, Jianping Lv, Liang Shao, Weixin Si. Semi-supervised intracranial aneurysm segmentation via reliable weight selection
5435 -- 5445Wei-jong Yang, Li-Yang Ho. CSA-Lanenet: a contiguous spatial attention lane detection network with vision transformer modules
5447 -- 5459Simin Yan, Shuchang Xu, Aiping Lei, Sanyuan Zhang. Advancing neural aesthetic assessment of artistic images based on bundle features integration
5461 -- 5476Donghui Wang, Jinhua Wang, Ning He, Jingzun Zhang, Sen Zhang, Shuai Liu. Enhancing unsupervised shadow removal via multi-intensity shadow generation and diffusion modeling
5477 -- 5494Yunfei Lu, Chenxia Chang, Song Gao, Shaowen Yao 0001, Ahmed Zahir. Boosting adversarial example detection via local histogram equalization and spectral feature analysis
5495 -- 5515Canlin Li, Haowen Su, Xin Tan, Lihua Bi, Xiangfei Zhang, Lizhuang Ma. Innovative collaborative multi-lookup table for real-time enhancement of low-light images
5517 -- 5537Zhao Liangjun, Yinqing Wang, Yueming Hu, Hui Dai, Xi Yubin, Feng Ning, He Zhongliang, Gang Liang, Yuanyang Zhang. An image fusion algorithm based on image clustering theory
5539 -- 5562Jie Yin, Tao Sun, Guorong Zhang, Yuhao Wu, Xiao Zhang. Deformation-aware image restoration from atmospheric turbulence based on quasiconformal geometry and pulse-coupled neural network
5563 -- 5582Hongwei Wei, Qi Li, Jie Pan, Junmei Chen, Yizhuo Zhang, Lizhuang Qi, Ying Zhou. SPSNet: semantic-guided perspective shift network for robust person re-identification in drone imagery
5583 -- 5596Shuai Su, Chengju Liu, Qijun Chen. Universally describing keypoints from a semi-global to local perspective, without any specific training
5597 -- 5608Yan Liu, Wenting Qi, Jingwen Wang, Yanqiu Xiao, Guangzhen Cui, Li Han. An efficient defogging network for RAW image sequences with high viewpoint
5609 -- 5624Yiyuan Ge, Mingxin Yu, Zhihao Chen, Wenshuai Lu, Yuxiang Dai, Huiyu Shi. Attention-enhanced controllable disentanglement for cloth-changing person re-identification
5625 -- 5641Maocheng Bai, Xiaosheng Yu, Ying Wang, Jubo Chen, Xiaofeng Zhang, Pengfei Lyu. Enhancing pixel-level analysis in medical imaging through visual instruction tuning: introducing PLAMi
5643 -- 5660Wei Liu, Cong Wang, Yongkang Zhang. Industrial surface defect detection by multi-scale Inpainting-GAN
5661 -- 5674Yanzheng He, Pengjun Wang, Xiaochun Guan, Han Li. Enhancing 3D Human Moiton Prediction with MSIGCN: A Novel Approach to Addressing Sensor Noise and State Accuracy
5675 -- 5688Saba Ghazanfar Ali, Xiangning Wang, Lei Bi 0001, Younhyun Jung, Tingli Chen, Haifang Zhang. Deep learning-based binocular system for automated diabetic retinopathy grading with prior clinical knowledge integration
5689 -- 5700Xuefeng Zhang, Bin Yan, Zhaohu Xing, Feng Gao, Yuandong Tao, Zhenyan Han, Weiming Wang, Lei Zhu 0003. HADiff: hierarchy aggregated diffusion model for pathology image segmentation
5701 -- 5718Zhaobin Chang, Xiong Gao, Dongyi Kong, Na Li, Yonggang Lu. Multi-prototype collaborative perception enhancement network for few-shot semantic segmentation
5719 -- 5731Kunyu Yan, Wenbin Zheng, Yujie Yang. Lightweight weed detection using re-parameterized partial convolution and collection-distribution feature fusion
5733 -- 5749Xin Zhang, Degang Yang, Tingting Song, Yichen Ye, Yingze Song, Jie Zhou, Jie Chen. A lightweight object detector based on changeable-size lightweight convolution and context augmentation module for images captured by UAVs
5751 -- 5767Cuiyun Lin, Chengxue Lao, Tianrun Jing, Wenxiao Wang 0004. Predicting game ownership dynamics: a novel POAFD-trend analysis approach
5769 -- 5780Jiaze He, Jian Xiao, Yuanjie Cao, Jing He, Siyu Li, Jin Huang, Ruhan He, Jianlin Zhu. Region-assisted line drawing colorization through diffusion model
5781 -- 5798Jinsong Zhang, Yu-Kun Lai, Jingyu Yang, Kun Li. PISE-V: person image and video synthesis with decoupled GAN
5799 -- 5814Zheyuan Wang, Ziyao Meng, Yiming Qin. MSPAN: lightweight image super-resolution with multi-semantic guidance
5815 -- 5833Zehao Cao, Zongji Wang, Yuanben Zhang, Cheng Jin, Weinan Cai, Zhihong Zeng, Junyi Liu. Enhancing 3D Gaussian splatting for low-quality images: semantically guided training and unsupervised quality assessment
5835 -- 5854Liangjun Zhao, Xi Yubin, Yinqing Wang, Feng Ning, He Zhongliang, Gang Liang, Yuanyang Zhang. MADNet: cropland change detection network for the complex terrain and dense vegetation hilly region in the Southwestern China
5855 -- 5872Qiaohong Chen, ZhenYang Xu, Xian Fang. CaVMamba: convolution-augmented VMamba for medical image segmentation
5873 -- 5889Runlong Cao, Jianqi Zhang, Yun Shen, Huanhuan Zhou, Peiying Zhou, Guowei Shen, Zhengwen Xia, Ying Zang, Qingshan Liu, Wenjun Hu. Dual-flow feature enhancement network for robust anomaly detection in stainless steel pipe welding
5891 -- 5903Yiming Chen, Yihang Liu, Gizem Kayar-Ceylan. CSG-based ML-supported 3D translation of sketches into game assets for game designers
5905 -- 5917Yuanchuan Duan, Peng Wang, Yan Huang, Yuxin Hang, Qi Sun, Haibo Shao, Jinzhu Yang. Optimizing semi-supervised medical image segmentation with imbalanced filtering and nnU-Net enhancement
5919 -- 5933Pengfei Zhao, Jianhua Ji, Yang Wen, Wuzhen Shi, Wenming Cao 0001. Dual prior guided depth image super-resolution with multi-scale transformer fusion network
5935 -- 5947Yaguang Lu, Yong Hu, Huiyan Feng, Pengshuai Duan, Xukun Shen. Generating reconstructable collaborative virtual environments via graph matching for mixed reality remote collaboration
5949 -- 5960Yingjie Fan, Bin Wen, Hongfei Deng. MRA-Net: an instance segmentation method based on multi-scale feature fusion for ethnic costumes images
5961 -- 5977Zhangmeng Chen, Ju Dai, JunJun Pan, Feng Zhou 0007. Diffusion model with temporal constraint for 3D human pose estimation
5979 -- 5993Zhenmin Yao, QianQian Hu. Accelerated local progressive-iterative approximation methods for curve and surface fitting
5995 -- 6009Ahmet Agaoglu, Nezih Topaloglu. Dynamic region of interest generation for maritime horizon line detection using time series analysis
6011 -- 6025Hu Wang, Hong-mei Sun, Wen-Long Zhang, Yu-Xiang Chen, Rui-sheng Jia. FANN: a novel frame attention neural network for student engagement recognition in facial video
6027 -- 6039Tongtong Liu, Chen Yang, Guoqiang Chen, Wenhui Li. Open-vocabulary multi-label classification with visual and textual features fusion
6041 -- 6054Shang Ma, Xiaoying Nie, Gang Yang, Chunqing Zhou. A robust and efficient model for the interaction of fluids with deformable solids
6055 -- 6065Guoyou Zhang, Zhixiang Hao, Lihu Pan, Wei Guo, Jiaxin Zuo, Xuenan Zhang. MeshBLS: mesh-based broad learning 3D object classification network
6067 -- 6085YaJuan Zhang, Yongquan Liang, Junjie Wang, Houying Zhu, Zhihui Wang 0003. Enhanced multi-object tracking via embedded graph matching and differentiable Sinkhorn assignment: addressing challenges in occlusion and varying object appearances
6087 -- 6102Xiao Li, Kai Wu, Haoran Chen, Wenjun Song, Hongwei Tao, Zuhe Li, Yanan Du. Deep residual PLSR model with manifold optimization and Gaussian filter for enhanced image classification
6103 -- 6120Hongzhi Li, Zhanghao Ren, Guoqing Zhu, Yaoju Liang, Han Cui, Chaozeyu Wang, Jiaxi Wang. Enhancing medical image segmentation with MA-UNet: a multi-scale attention framework
6121 -- 6132Jianbing Xu, Jiangxin Zhou, Dongxu Xu, Yu Chen. Local dual-branch attention feature learning framework from UAVs for visual defect detection
6133 -- 6148Zhanqiang Huo, Xiyan Zhan, Yingxu Qiao, Shan Zhao. D3-Dehaze: a divide-and-conquer framework for enhanced single image dehazing
6149 -- 6167Jingya Shi, Dezhi Han, Chongqing Chen, Xiang Shen. SAFFNet: self-attention based on Fourier frequency domain filter network for visual question answering
6169 -- 6185Xiaodong Wang, Jiangtao Fan, Fei Yan, Hongmin Hu, Zhiqiang Zeng, Haiyan Huang. Unsupervised fur anomaly detection with B-spline noise-guided Multi-directional Feature Aggregation
6187 -- 6199Tang Xu, Wenbin Wang 0001, Alin Zhong. HOIEdit: Human-object interaction editing with text-to-image diffusion model
6201 -- 6217Xiangyang Wang 0003, Kun Yang, Qiang Ding, Rui Wang 0034, Jinhua Sun. Tic action recognition for children tic disorder with end-to-end video semi-supervised learning
6219 -- 6235Elmira Bagheri, Amir Hossein Barshooi. Nighttime driver behavior prediction using taillight signal recognition via CNN-SVM classifier
6237 -- 6249Yanmei Li, Tao Yu, Jian Luo, Xiaoshuang Li, Jingshi Deng, Qibin Yang. JLEDNet: a nighttime UAV tracking method through joint low-light image enhancement using hybrid attention transformer and denoising
6251 -- 6269V. Karthikeyan 0004, S. Praveen, S. Sudeep Nandan. Lightweight deep hybrid CNN with attention mechanism for enhanced underwater image restoration
6271 -- 6297Qian Ye, Qingwu Li, Guanying Huo, Yan Liu, Yan Zhou. Boundary-guided multi-scale refinement network for camouflaged object detection
6299 -- 6312Qiuquan Zhao, Jianyuan Li. SPS-UNet: a super-pixel sampling UNet for extracting buildings from high-resolution satellite images
6313 -- 6326Enze Yang, Yuxin Liu, Shitao Zhao, Yiran Liu, Shuoyan Liu. Learn from restoration: exploiting task-oriented knowledge distillation in self-supervised person re-identification
6327 -- 0Daniel Jiménez Navarro, Ana Serrano, Sandra Malpica. Correction to: Minimally disruptive auditory cues: their impact on visual performance in virtual reality
6329 -- 0Satoshi Nishimura. Correction: Grid-induced bounding volume hierarchy for ray tracing dynamic scenes

Volume 41, Issue 7

4395 -- 4403Long Zhang, Qinghua Zhou, Shuai Tang, Yunxiang Chen. High-definition multi-scale voice-driven facial animation: enhancing lip-sync clarity and image detail
4405 -- 4418Qiaohong Chen, Shufan Xie, Xian Fang, Qi Sun. CTHFNet: contrastive translation and hierarchical fusion network for text-video-audio sentiment analysis
4419 -- 4430Xuanpeng Li, Hengshuo Cao, Jinming Li, Guangyu Li, Lin Zhao. A shoreline extraction method based on dual-loop network framework
4431 -- 4448Viktor Leonhardt, Alexander Wiebel, Christoph Garth. A framework for visual comparison of scalar fields with uncertainty
4449 -- 4461Ye Liu, Lei Zhu, Liang Wan, Xing Wang. Masked frequency-color fusion network for video instance-level hazy lane detection
4463 -- 4480Jibing Peng, Yaohua Yi, Ying Zhou. DPDTRN: a dynamic pixel-level difficulty-aware texture reconstruction network for document super-resolution
4481 -- 4495Huangyuan Wu, Bin Li, Lianfang Tian, Chao Dong. DDFA: a displacement and diffusion-based feature augmentation method for imbalanced image recognition
4497 -- 4515Yunfei Qiu, Shuai Jiao, Qingtang Su. Enhancing color image watermarking via fast quaternion Schur decomposition: a high-quality blind approach
4517 -- 4532Rui Sun, Xiaolu Yu, Huidong Feng, Fei Wang, Xudong Zhang. Motion-robust mask face presentation attack detection via dual-stream texture-rPPG network
4533 -- 4546Zhiwen Shao, Yifan Cheng, Yong Zhou 0003, Xiang Xiang 0001, Jian Li 0054, Bing Liu 0016, Dit-Yan Yeung. High-level LoRA and hierarchical fusion for enhanced micro-expression recognition
4547 -- 4565Kesai Wang, Xifan Yao, Nanfeng Ma, Guangjun Ran. PLMOT-SLAM: a point-line features fusion SLAM system with moving object tracking
4567 -- 4580Ping Lu, Youcheng Cai, Jiale Yang, Dong Wang, Tingting Wu. Uanet: uncertainty-aware cost volume aggregation-based multi-view stereo for 3D reconstruction
4581 -- 4601Zhengyan Liu, Huiwen Wang, Lihong Wang, Shanshan Wang. Locality-constrained double-layer structure scaled simplex multi-view subspace clustering
4603 -- 4621Tianxiang Huo, Zhenqi Liu, Shichao Zhang, Jiening Wu, Rui Yuan, Shukai Duan 0001, Lidan Wang 0001. CDNet: object detection based on cross-level aggregation and deformable attention for UAV aerial images
4623 -- 4637Krishnendu Maity, Susanta Mukhopadhyay. LPSIS: a lossless secret image sharing scheme based on Legendre polynomials with low-cost reconstruction
4639 -- 4660Yuesong Tian, Li Shen 0008, Xiang Tian 0002, Dacheng Tao, Zhifeng Li 0001, Wei Liu 0005, Yaowu Chen. DGL-GAN: discriminator-guided GAN compression
4661 -- 4672Javed Aymat Husen Shaikh, Shailendrakumar M. Mukane, Santosh Nagnath Randive. Lightweight progressive recurrent network for video de-hazing in adverse weather conditions
4673 -- 4686Jinchang Zhu, Dayang Sun, Yu Cheng, Hailong Wang, Yujing Chen, Yaowei Chen. GaitHF: enhancing appearance-based gait recognition through height fused images
4687 -- 4702Wanjun Zhong, Haohao Hu, Yuerong Wang, Li Li, Tianyu Han, Chunyong Li, Peng Zan. Hierarchical evidence aggregation in two dimensions for active water surface object detection
4703 -- 4722Julien Thomas, Boyu Kuang, Yizhong Wang, Stuart Barnes, Karl Jenkins. Advanced semantic segmentation of aircraft main components based on transfer learning and data-driven approach
4723 -- 4739Hongfei Li, Xueyang Li. Dim and small objects detection in aerial images with stacked attention mechanism and improved loss function
4741 -- 4758Yanliang Ge, Junchao Ren, Cong Zhang, Min He, Hongbo Bi, Qiao Zhang. Feature-aware and iterative refinement network for camouflaged object detection
4759 -- 4778Mohamad Haniff Junos, Anis Salwa Mohd Khairuddin. YOLO-MMS for aerial object detection model based on hybrid feature extractor and improved multi-scale prediction
4779 -- 4798Sardor Mamarasulov, Lianggangxu Chen, Changgu Chen, Yang Li, Changbo Wang. Data augmentation with attention framework for robust deepfake detection
4799 -- 4813Jian Ni, Zheng Wang, Yixiao Wang, Wenjian Tao, Ao Shen. DRCL: rethinking jigsaw puzzles for unsupervised medical image segmentation
4815 -- 4838Huanshuo Zhang, Guobiao Ren. Intelligent leaf disease diagnosis: image algorithms using Swin Transformer and federated learning
4839 -- 4850Václav Skala. A new fully projective O(log N) point-in-convex polygon algorithm: a new strategy
4851 -- 4864Jianuo Wang, Huawei Li, Yumin Chen. Seg-invRender: fusing semantic segmentation based on NeRF for inverse rendering considering shadows
4865 -- 4877Wuzhen Shi, Aixue Yin, Yingxiang Li, Bo Qian. Cross-view Transformer for enhanced multi-view 3D reconstruction
4879 -- 4892Jiaxing Yu, Zheng Chen 0014, Jingkai Wang, Linghe Kong, Jiajie Yan, Wei Gu. Enhancing Image Super-Resolution with Dual Compression Transformer
4893 -- 4914Saleha Masood, Mousa Ahmad Al Bashrawi, Muhammad Attique Khan, Anam Nazir. Exploring ChatGPT applications in healthcare: a comprehensive overview
4915 -- 4930Yaqi Sun, Xiaolan Xie, Zhi Li, Huihuang Zhao. Image style transfer with saliency constrained and SIFT feature fusion
4931 -- 4955Zean Jin, Yulong Bai, Wei Song, Qinghe Yu, Xiaoxin Yue. EduCodeVR: VR for programming teaching through simulated farm and traffic
4957 -- 4974Zeyu Cai, Ziyu Zhang, Chengqian Jin, Feipeng Da. DMDC: a cross-attention network for dynamic mask-based dual-camera snapshot hyperspectral Photography
4975 -- 4990Baokai Zu, Tong Cao, Yafang Li, Jianqiang Li 0002, Hongyuan Wang, Quanzeng Wang. RESwinT: enhanced pollen image classification with parallel window transformer and coordinate attention
4991 -- 5003Yaqian Li, Xin Zhan, Haibin Li, Wenming Zhang. Selection and guidance: high-dimensional identity consistency preservation for face inpainting
5005 -- 5017Yang Yang, Changming Zhu. Deep multi-view clustering based on global hybrid alignment with cross-contrastive learning
5019 -- 5028Tiago Madeira, Miguel Oliveira 0001, Paulo Dias. Reflection-aware 3D mirror segmentation and pose estimation
5029 -- 5041Tao Shi, Yao Ding, Kui-feng Zhu, Yan-jie Su. DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision
5043 -- 5057Congying An, Jingjing Wu, Huanlong Zhang. Occlusion-aware segmentation via RCF-Pix2Pix generative network
5059 -- 5073Daniel Jiménez Navarro, Ana Serrano, Sandra Malpica. Minimally disruptive auditory cues: their impact on visual performance in virtual reality
5075 -- 5086Zidi Cao, Jiayi Han, Sipeng Yang, Xiaogang Jin 0001. Fast best viewpoint selection with geometry-enhanced multiple views and cross-modal distillation
5087 -- 5104Hongru Wang, Hu Cheng, Jingtao Zhang. Faster-PGYOLO: an efficient framework for floating debris detection in inland waters
5105 -- 5121Yanchen Liu, Changming Zhu. DMVMLC-VT: Deep incomplete multi-view multi-label image classification with view translation and pseudo-label enhancement
5123 -- 5134Miao Yang, Meng Yang 0011, Weiliang Meng, Ping Li 0016, Zhen Li. Msc-Net: multi-stage colorization network for real-world images with specular highlights
5135 -- 5151Kexuan Wang, Chenhua Liu, Rongfu Zhang. CMA-SOD: cross-modal attention fusion network for RGB-D salient object detection
5153 -- 5169Yanliang Ge, Taichuan Liang, Junchao Ren, Jiaxue Chen, Hongbo Bi. Enhanced salient object detection in remote sensing images via dual-stream semantic interactive network
5171 -- 5187Jianguo Ning, Lei Zhang, Xiangzhao Xu. Virtual simulation for the dynamic response of concrete blocks under blast loading
5189 -- 5203Shue Liu, Siwei Zhao, Yiying Wang, Jiaming Xin, Dashe Li. An enhanced underwater fish segmentation method in complex scenes using Swin transformer with cross-scale feature fusion
5205 -- 5221Zewei Zhao, Xiaotie Ma, Yingjie Shi, Xiaotong Yang. Multi-scale defect detection for plaid fabrics using scale sequence feature fusion and triple encoding

Volume 41, Issue 6

3679 -- 3693Zhaijuan Ding, Yanyu Liu, Sen Liu, Kangjian He, Dongming Zhou. $\hbox {KD}^{3}$mt: knowledge distillation-driven dynamic mixer transformer for medical image fusion
3695 -- 3717Lin Wang, Jie Li, Chun Qi, Fengping Wang, Pan Wang 0004. Progressive Crowd Enhancement De-Background Network for crowd counting
3719 -- 3734Baoan Li, Long Zhang, Shangzhi Teng, Xueqiang Lyu. Attribute correlation mask fusion network for pedestrian attribute recognition
3735 -- 3783Yasmin M. Alsakar, Nehal A. Sakr, Shaker H. Ali El-Sappagh, Tamer AbuHmed, Mohammed Elmogy. Underwater image restoration and enhancement: a comprehensive review of recent trends, challenges, and applications
3785 -- 3800Xiaopan Li, Shiqian Wu, Xin Yuan, Shoulie Xie, Sos S. Agaian. Hierarchical wavelet-guided diffusion model for single image deblurring
3801 -- 3827Yawen Xiang, Heng Zhou 0006, Chengyang Li, Fangwei Sun, Zhongbo Li, Yongqiang Xie. Deep learning in motion deblurring: current status, benchmarks and future prospects
3829 -- 3842Yunxi Chen, Yuanjie Cao, Fei Fang, Jin Huang, Xinrong Hu, Ruhan He, Junjie Zhang. SACANet: end-to-end self-attention-based network for 3D clothing animation
3843 -- 3852Yuanjie Dang, Jiangyun Chen, Peng Chen 0008, Nan Gao, Ruohong Huan, Dongdong Zhao. Generate anomalies from normal: a partial pseudo-anomaly augmented approach for video anomaly detection
3853 -- 3865Qian Wan, Bin Zhou, Yanjiang Wang. BSCGAN: structured minority class image generation under class-balanced pretraining
3867 -- 3882Shize Wang, Gang Wu, Jin Wang, Qing Zhu, Yunhui Shi, Baocai Yin. SBC-Net: semantic-guided brightness curve estimation network for low-light image enhancement
3883 -- 3906Xinzhe Xie, Buyu Guo, Peiliang Li 0003, Shuangyan He, Sangjun Zhou. SwinMFF: toward high-fidelity end-to-end multi-focus image fusion via swin transformer-based network
3907 -- 3923Zitao Gao, Xiangjian Liu, Anna K. Wang, Liyu Lin. A simulated two-stream network via multilevel distillation of reviewed features and decoupled logits for video action recognition
3925 -- 3955Ronghui Feng, Yuefei Wang, Jiajing Xue, Yuquan Xu, Yutong Zhang, Xi Yu. CLAC-Net: a composite medical image segmentation framework using self-attention and cross-layer asymmetric connections
3957 -- 3972Guowen Yue, Ge Jiao, Chen Li, Jiahao Xiang. When CNN meet with ViT: decision-level feature fusion for camouflaged object detection
3973 -- 4000Shuo Yang, Xiaoling Gu, Zhenzhong Kuang, Feiwei Qin, Zizhao Wu. Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a survey
4001 -- 4016Chen Li, Weiqi Yan, Hongwei Zhao, Shihua Zhou, Yueping Wang. TFFD-Net: an effective two-stage mixed feature fusion and detail recovery dehazing network
4017 -- 4031Kailin Liu, Yonghong Hou, Zihui Guo, Wenjie Yin, Yi Ren. Visual context learning based on cross-modal knowledge for continuous sign language recognition
4033 -- 4045Qiang Cen, Qiguang Zhu, Yuxin Wang, Weidong Chen 0001, Shuo Liu. YOLOv9-YX: lightweight algorithm for underwater target detection
4047 -- 4066Le-Anh Tran, Dong-Chul Park 0002. Lightweight image dehazing networks based on soft knowledge distillation
4067 -- 4079Haiyuan Cao, Deng Chen, Yanduo Zhang, Huabing Zhou, Dawei Wen, Congcong Cao. MFINet: a multi-scale feature interaction network for point cloud registration
4081 -- 4095Libo Sun, Jiahui Yan, Yongchun Qiu, Wenhu Qin. The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learning
4097 -- 4110Guowei An, Yaonan Wang 0001, Kai Zeng 0010, Qing Zhu, Xiaofang Yuan. Deep spatial and discriminative feature enhancement network for stereo matching
4111 -- 4127Qiyang Liu, Yun Ge, Sijia Wang, Ting Wang, Jinlong Xu. Dynamic manifold-based sample selection in contrastive learning for remote sensing image retrieval
4129 -- 4141Ziwei Zeng, Lihong Li, Zoufei Zhao, Qingqing Liu. Improved fine-grained image classification in few-shot learning based on channel-spatial attention and grouped bilinear convolution
4143 -- 4156Yiqian Huang 0004, Shuqi Liu, Fei Dong, Xu Li, Xin Yang 0021, Ya Zhou, Jinxiang Huang, Yong Song. PL-MCT: pseudo-labeling and multi-frame consistency training for semi-supervised visual tracking
4157 -- 4169Yong Zhang, Qingguo Shan, Wenyun Chen, Wenzhe Liu. EEG emotion recognition approach using multi-scale convolution and feature fusion
4171 -- 4181Guowei Zhang, Weidong Zhang, Wuzhi Li, Li Wang, Huankang Cui. A dynamic attention mechanism for object detection in road or strip environments
4183 -- 4198Youjie Zhou, Runyu Jiao, Zhonghan Tao, Xichang Liang, Yi Wan 0002. Spatial-frequency attention-based optical and scene flow with cross-modal knowledge distillation
4199 -- 4220Pham Thanh Huu, Nguyen Thai An, Nguyen Ngoc Trung, Huynh Ngoc Thien, Nguyen Sy Duc, Nguyen Thi Ty. Judicial decision prediction using an integrated attention based bidirectional long-short term memory and dilated skip residual convolution neural network
4221 -- 4238Xinbiao Lu, Gaofan Zhan, Wen Wu, Wentao Zhang, Xiaolong Wu, Changjiang Han. Van-DETR: enhanced real-time object detection with vanillanet and advanced feature fusion
4239 -- 4252Chenchen Xu, Kaixin Han, Weiwei Xu. Image-aware layout generation with user constraints for poster design
4253 -- 4267Zhen Huang, Yongjian Zhu, Qiao Zhang, Hongyan Zang, Tengfei Lei. Exploration, fusion, and refinement: a multivariate features interaction network for visual camouflaged detection
4269 -- 4285Yongbo Yu, Weidong Li, Linyan Bai, Jinlong Duan, Xuehai Zhang. UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration
4287 -- 4300Liping Zhu, Haibo Zhou, Silin Wu, Tianrong Cheng, Hongjun Sun. Polynomial for real-time rendering of neural radiance fields
4301 -- 4320Yong Zhang, Da Liu, Li Jiang, Huibing Wang, Wenzhe Liu. Feature decomposition and structural learning for multi-diverse and multi-view data clustering
4321 -- 4346Pengjie Liu, Yanzhan Chen, Fan Yu, Qian Zhang. Mastering adverse weather: a two-stage approach for robust semantic segmentation in autonomous driving
4347 -- 4361Yuqi Xiao, Yongjun Wu. A dual-channel correlation filtering tracker for real-time tracking based on deep features of improved CaffeNet and integrated manual features
4363 -- 4376Dejin Zhao, Yunjie Ma, Xiaolong Yuan, Tong Tong, Dechao Wang, Rui Sun, Lili Cheng, Jianhai Zhang. SME: Spatial multi-scale enhanced attention for automated detection of micro-defect on automobile complex paint surfaces
4377 -- 4392Yuanhong Zhong, Ting Chen, Daidi Zhong, Xiaoming Liu. Wavelet-guided network with fine-grained feature extraction for vessel segmentation
4393 -- 4394Ling-Xiao Qin, Hong-mei Sun, Xiao-Meng Duan, Cheng-Yue Che, Rui-sheng Jia. Correction: Adaptive learning-enhanced lightweight network for real-time vehicle density estimation

Volume 41, Issue 5

3003 -- 3015Liang Zhang, Shifeng Li, Xi Luo, Xiaoru Liu, Ruixuan Zhang. Video anomaly detection with both normal and anomaly memory modules
3017 -- 3035Hong Zhao, Wengai Li, Dailin Huang, Jinhai Huang, Lijun Zhang. M-GAN: multiattribute learning and multimodal feature fusion-based generative adversarial network for text-to-image synthesis
3037 -- 3058Xunan Tan, Xiang Suo, Wenjun Li, Lei Bi, Fangshu Yao. Data visualization in healthcare and medicine: a survey
3059 -- 3076Junding Sun, Chenxu Wang, Haifeng Sima, Xiaosheng Wu, Shuihua Wang, Yudong Zhang. Mfpenet: multistage foreground-perception enhancement network for remote-sensing scene classification
3077 -- 3093R. Varun Prakash, V. Karthikeyan 0005, S. Vishali, M. Karthika. Multi-level LSTM framework with hybrid sonic features for human-animal conflict evasion
3095 -- 3107Xintao Liu, Yan Gao, Changqing Zhan, Qiao Wangr, Yu Zhang, Yi He, Hongyan Quan. Directional latent space representation for medical image segmentation
3109 -- 3128Yan Zhou 0003, Haibin Zhou, Yin Yang, Jianxun Li, Richard Irampaye, Dongli Wang, Zhengpeng Zhang. Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation
3129 -- 3142Fengling Li, Zheng Yang, Yan Gui. SES-yolov5: small object graphics detection and visualization applications
3143 -- 3154Xiaoying Chen, Weijie Ye. Dual representations network for few-shot learning based on local descriptor importance: integrating global and local features
3155 -- 3171Zezheng Tang, Yihua Wu, Xinming Xu. The study of recognizing ripe strawberries based on the improved YOLOv7-Tiny model
3173 -- 3188Daipeng Yang, Bo Peng 0006, Xi Wu. A bio-inspired edge and segment detection method by modeling multiple visual regions
3189 -- 3204Jianjun Zhu, Huihuang Zhao, Yudong Zhang. Filter-deform attention GAN: constructing human motion videos from few images
3205 -- 3219Mingjian Li, Younhyun Jung, Shaoli Song, Jinman Kim. Attention-driven visual emphasis for medical volumetric image visualization
3221 -- 3238Jun Wang, Honghui Cao, Chenhao Sun, Ziqing Huang, Yonghua Zhang. Motion perception-driven multimodal self-supervised video object segmentation
3239 -- 3261Gang Chen, Wenju Wang, Haoran Zhou, Xiaolin Wang. EGCT: enhanced graph convolutional transformer for 3D point cloud representation learning
3263 -- 3281Haojie Gao, Peishun Liu, Xiaolong Ma, Zikang Yan, Ningning Ma, Wenqiang Liu, Xuefang Wang, Ruichun Tang. TP-LSM: visual temporal pyramidal time modeling network to multi-label action detection in image-based AI
3283 -- 3295Guowei Zhang, Wuzhi Li, Yutong Tang, Shuixuan Chen, Li Wang. Lightweight CNN-ViT with cross-module representational constraint for express parcel detection
3297 -- 3308Jianglei Ye, Yigang Wang, Fengmao Xie, Qin Wang, Xiaoling Gu, Zizhao Wu. Slot-VTON: subject-driven diffusion-based virtual try-on with slot attention
3309 -- 3327Xingquan Cai, Haoyu Zhang, Lizhe Chen, Yijie Wu, Haiyan Sun. 3D human pose estimation using spatiotemporal hypergraphs and its public benchmark on opera videos
3329 -- 3344Zhiyuan Li, Xin Jin 0005, Qian Jiang, Puming Wang, Shin-Jye Lee, Shaowen Yao 0001, Wei Zhou 0011. Crafting imperceptible and transferable adversarial examples: leveraging conditional residual generator and wavelet transforms to deceive deepfake detection
3345 -- 3357Wan-He Kai, Kai-Xin Xing. Video-driven musical composition using large language model with memory-augmented state space
3359 -- 3370Wenzhe Shi, Ziqi Hu, Hao Chen, Hengjia Zhang, Jiale Yang, Li Li. Orhlr-net: one-stage residual learning network for joint single-image specular highlight detection and removal
3371 -- 3412Xu Liu, Tong Zhou, Chong Wang, Yuping Wang, Yuanxin Wang 0001, Qinjingwen Cao, Weizhi Du, Yonghuan Yang, Junjun He, Yu Qiao, Yiqing Shen 0003. Toward the unification of generative and discriminative visual foundation model: a survey
3413 -- 3422Yaping Deng, Yingjiang Li, Zibo Wei, Keying Li. GLDC: combining global and local consistency of multibranch depth completion
3423 -- 3435Weifeng Cao, Xiaoyan Lei, Jun Shi, Wanyong Liang, Jie Liu, Zongfei Bai. HASN: hybrid attention separable network for efficient image super-resolution
3437 -- 3455Sunhan Xu, Jinhua Wang, Ning He, Guangmei Xu, Geng Zhang. Optimizing underwater image enhancement: integrating semi-supervised learning and multi-scale aggregated attention
3457 -- 3472Yazhuo Fan, Jianhua Song, Lei Yuan, Yunlin Jia. HCT-Unet: multi-target medical image segmentation via a hybrid CNN-transformer Unet incorporating multi-axis gated multi-layer perceptron
3473 -- 3486Muhammad Fahad 0013, Tao Zhang 0025, Yasir Iqbal, Azaz Ikram, Fazeela Siddiqui, Bin Younas Abdullah, Malik Muhammad Nauman, Xin Zhao 0006, Yanzhang Geng. Advanced deepfake detection with enhanced Resnet-18 and multilayer CNN max pooling
3487 -- 3501Jiajun Yang, Xuesong Zhang, Cunli Song. Research on a small target object detection method for aerial photography based on improved YOLOv7
3503 -- 3518Pengbo Bo, Qingxiang Liu, Caiming Zhang. Topological structure extraction for computing surface-surface intersection curves
3519 -- 3535Wenji Yang, Hang An, Wenchao Hu, Xinxin Ma, Liping Xie. Text-guided floral image generation based on lightweight deep attention feature fusion GAN
3537 -- 3551Ali Salar, Ali Ahmadi. Enhancing high-vocabulary image annotation with a novel attention-based pooling
3553 -- 3564Yiting Wu, Pinqi Fang, Xiangning Wang, Jie Shen. Predicting pancreatic diseases from fundus images using deep learning
3565 -- 3580Shunzhou Wang, Yao Lu, Wang Xia, Peiqi Xia, Ziqi Wang, Wei Gao. Light field angular super-resolution by view-specific queries
3581 -- 3593Xiaohu Wang, Xin Yang, Hengrui Li, Tao Li. FDDCC-VSR: a lightweight video super-resolution network based on deformable 3D convolution and cheap convolution
3595 -- 3610Minsoo Choi, Christos Mousas, Nicoletta Adamo, Sanjeevani Patankar, Klay Hauser, Fangzheng Zhao, Richard E. Mayer. ASAP: animation system for agent-based presentations
3611 -- 3626Dinghao Guo, Dali Chen, Xin Lin, Zheng Xue, Wei Zheng, Xianling Li. Semi-supervised image semantic segmentation method with semantic regions patching and uncertainty-guided loss
3627 -- 3644Yating Liu, ChengDong Lan, Wanjian Feng. DLKN: enhanced lightweight image super-resolution with dynamic large kernel network
3645 -- 3662Andrea Bodonyi, István Csoba, Roland Kunkli. Real-time ray transfer for lens flare rendering using sparse polynomials
3663 -- 3678Shijie Li, Shanhua Yao, Zhonggen Wang, Juan Wu. FFCANet: a frequency channel fusion coordinate attention mechanism network for lane detection

Volume 41, Issue 4

2065 -- 2077Hanqin Wang, Alexei Sourin. Visual signatures for music mood and timbre
2079 -- 2089Khawla Ben Salah, Mohamed Othmani, Jihen Fourati, Monji Kherallah. Advancing spatial mapping for satellite image road segmentation with multi-head attention
2091 -- 2105Mikolaj Maik, Jakub Flotynski, Krzysztof Walczak 0001. Knowledge-based approach to adaptive XR interface design for non-programmers
2107 -- 2122Max Reimann, Martin Büßemeyer, Benito Buchheim, Amir Semmo, Jürgen Döllner, Matthias Trapp 0001. Artistic style decomposition for texture and shape editing
2123 -- 2142Hiba Mzoughi, Ines Njeh, Mohamed Ben Slima, Nouha Farhat, Chokri Mhiri. Vision transformers (ViT) and deep convolutional neural network (D-CNN)-based models for MRI brain primary tumors images multi-classification supported by explainable artificial intelligence (XAI)
2143 -- 2156Dingning Long, Rongrong Chen. Cognitive capacity and aesthetics: the influence of visual working memory on landscape ink painting preference
2157 -- 2169Liangwei Wang, Zhan Wang, Xi Zhao 0003, Fugee Tsung, Wei Zeng 0004. Antarctica storytelling: creating interactive story maps for polar regions with graphic-based approach
2171 -- 2185Chuang Wu, Tingqin He. Efficient minor defects detection on steel surface via res-attention and position encoding
2187 -- 2202Junjie Zhang 0002, Yi Lin, Xin Zhou, Pangrong Shi, Xiaoqiang Zhu, Dan Zeng 0001. Precision in pursuit: a multi-consistency joint approach for infrared anti-UAV tracking
2203 -- 2217Jiayi Xu 0002, Xuan Tan, Yixuan Ju, Xiaoyang Mao, Shanqing Zhang. High similarity controllable face anonymization based on dynamic identity perception
2219 -- 2232Mohamed ElSayed, Mohamed Reda, Ahmed S. Mashaly, Ahmed Saleh 0004. LERFNet: an enlarged effective receptive field backbone network for enhancing visual drone detection
2233 -- 2249Jialin Zhu, He Wang 0002, David Hogg 0001, Tom Kelly. Learning to sculpt neural cityscapes
2251 -- 2270Suresh Cheekaty, G. Muneeswari. Advancing autism prediction through visual-based AI approaches: integrating advanced eye movement analysis and shape recognition with Kalman filtering
2271 -- 2283Huaping Zhou, Bin Deng, Kelei Sun, Shunxiang Zhang, Yongqi Zhang. UTE-CrackNet: transformer-guided and edge feature extraction U-shaped road crack image segmentation
2285 -- 2297Xiaoyang Zhao, Zhuo Wang 0008, Zhongchao Deng, Hongde Qin, Zhongben Zhu. Transmission-guided multi-feature fusion Dehaze network
2299 -- 2322Randa I. Elanwar, Margrit Betke. Generative adversarial networks for handwriting image generation: a review
2323 -- 2337Yixi Li, Yanzhe Liu, Rong Chen 0003, Hui Li, Na Zhao. Point cloud upsampling via a coarse-to-fine network with transformer-encoder
2339 -- 2376Neil Patrick Del Gallego, Joel Ilao, Macario O. Cordel II, Conrado R. Ruiz Jr.. Training a shadow removal network using only 3D primitive occluders
2377 -- 2390Qunpo Liu, Qi Tang, Bo Su, Xuhui Bu, Naohiko Hanajima, Manli Wang. Wire rope damage detection based on a uniform-complementary binary pattern with exponentially weighted guide image filtering
2391 -- 2408Jianjian Jiang, Ziwei Chen, Fangyuan Lei, Long Xu, Jiahao Huang, Xiaochen Yuan. Multi-granularity hypergraph-guided transformer learning framework for visual classification
2409 -- 2424Yueqian Pan, Qiaohong Chen, Xian Fang. DAMAF: dual attention network with multi-level adaptive complementary fusion for medical image segmentation
2425 -- 2437Wei Li, Bowen Li, Jingqi Wang, Weiliang Meng, Jiguang Zhang, Xiaopeng Zhang 0001. ROMOT: Referring-expression-comprehension open-set multi-object tracking
2439 -- 2459Longfeng Shen, Bin Hou, Yulei Jian, Xisong Tu, Yingjie Zhang, Lingying Shuai, Fangzhen Ge, Debao Chen. TransFGVC: transformer-based fine-grained visual classification
2461 -- 2476Avantika Saklani, Shailendra Tiwari, H. S. Pannu. Deep attentive multimodal learning for food information enhancement via early-stage heterogeneous fusion
2477 -- 2493Xiang Suo, Weidi Tang, Lijuan Mao, Zhen Li. Digital human and embodied intelligence for sports science: advancements, opportunities and prospects
2495 -- 2510Jiaxuan Zhu, Ming Shao, Libo Sun, Siyu Xia. ACL-SAR: model agnostic adversarial contrastive learning for robust skeleton-based action recognition
2511 -- 2527Jiayan Wen, YuanSheng Zhuang, Junyi Deng. EDM: a enhanced diffusion models for image restoration in complex scenes
2529 -- 2544Canlin Li, Xinyue Wang, Ran Yi, Wenjiao Zhang, Lihua Bi, Lizhuang Ma. MCLGAN: a multi-style cartoonization method based on style condition information
2545 -- 2561Haobo Dong, Tianyu Song 0003, Xuanyu Qi, Jiyu Jin, Guiyue Jin, Lei Fan 0004. Exploring high-quality image deraining Transformer via effective large kernel attention
2563 -- 2594Surendrabikram Thapa, Abhijit Sarkar. A deep dive into enhancing sharing of naturalistic driving data through face deidentification
2595 -- 2605Runtao Xi, Jiahao Lyu 0001, Kang Sun, Tian Ma. Learning kernel parameter lookup tables to implement adaptive bilateral filtering
2607 -- 2627Yi-Lun Wang, Yi-zheng Lang, Yunsheng Qian. Effective multi-scale enhancement fusion method for low-light images based on interest-area perception OCTM and "pixel healthiness" evaluation
2629 -- 2638Alireza Dehghanpour, Zahra Sharifi, Masoud Dehyadegari. Point cloud downsampling based on the transformer features
2639 -- 2654Yabo Wu, Wenting Li, Ziyang Chen 0002, Hui-Wen, Zhongwei Cui, Yongjun Zhang 0007. Distribution-decouple learning network: an innovative approach for single image dehazing with spatial and frequency decoupling
2655 -- 2667Yumei Tan, Haiying Xia, Shuxiang Song 0001. Robust consistency learning for facial expression recognition under label noise
2669 -- 2690Wen-Kai Tsai, Hsin-Chih Wang. Real-time salient object detection based on accuracy background and salient path source selection
2691 -- 2708Nauman Ullah Gilal, Marwa K. Qaraqe, Jens Schneider 0002, Marco Agus. Autocleandeepfood: auto-cleaning and data balancing transfer learning for regional gastronomy food computing
2709 -- 2720Ying Ni, Xiaoli Wang, Hanghang Peng, Yonzhi Li, Jinyang Wang, Haoxuan Li, Jin Huang. Dual-branch dilated context convolutional for table detection transformer in the document images
2721 -- 2736Yubo Zhang, Lei Xu, Haibin Xiang, Haihua Kong, Junhao Bi, Chao Han. LKSMN: Large Kernel Spatial Modulation Network for Lightweight Image Super-Resolution
2737 -- 2754Xiaoyu Song, Dezhi Han, Chongqing Chen, Xiang Shen, Huafeng Wu. Vman: visual-modified attention network for multimodal paradigms
2755 -- 2766Zekang Liu, Wei Feng 0005, Liqing Gao, Lianyu Hu 0003. DBL-SC: background-independent sign language recognition based on spatial channel separation computation
2767 -- 2782Ze Ouyang, Huihuang Zhao, Yudong Zhang, Long Chen. STVDNet: spatio-temporal interactive video de-raining network
2783 -- 2800R. Raja Sekar, T. Dhiliphan Rajkumar, Koteswara Rao Anne. Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme
2801 -- 2815Lirong Li, Jiang Ding, Hao Cui, Zhiqiang Chen, Guisheng Liao. LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes
2817 -- 2834Saba Ghazanfar Ali, Xiaoxia Wang, Ping Li, Huating Li, Po Yang 0001, Younhyun Jung, Jing Qin 0001, Jinman Kim, Bin Sheng 0001. EGDNet: an efficient glomerular detection network for multiple anomalous pathological feature in glomerulonephritis
2835 -- 2856Pan Wu, Jin Tang. FHFN: content and context feature hierarchical fusion networks for multi-focus image fusion
2857 -- 2873Ling-Xiao Qin, Hong-mei Sun, Xiao-Meng Duan, Cheng-Yue Che, Rui-sheng Jia. Adaptive learning-enhanced lightweight network for real-time vehicle density estimation
2875 -- 2889Jit Chatterjee, Maria Torres Vega. 3D-Scene-Former: 3D scene generation from a single RGB image using Transformers
2891 -- 2906Xinyi Liu, Guoheng Huang, Xiaochen Yuan, Zewen Zheng, Guo-Zhong, Xuhang Chen 0002, Chi-Man Pun. Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression
2907 -- 2921Jiazhe Miao, Tao Peng 0006, Fei Fang, Xinrong Hu, Li Li 0094. TDGar-Ani: temporal motion fusion model and deformation correction network for enhancing garment animation details
2923 -- 2937Wei Song, Kaili Yang. Dual adaptive local semantic alignment for few-shot fine-grained classification
2939 -- 2951Changhong Shi, Weirong Liu 0002, Jiahao Meng, Xiongfei Jia, Jie Liu. Self-prior guided generative adversarial network for image inpainting
2953 -- 2972Chunyu Liu, Yixiao Jin, Zhouyu Guan, Tingyao Li, Yiming Qin, Bo Qian, Zehua Jiang, Yilan Wu, Xiangning Wang, Ying-Feng Zheng, Dian Zeng. Visual-language foundation models in medicine
2973 -- 2985Xin Zhao, Yinhuang Chen, Chengzhuan Yang, Lincong Fang. FuseNet: a multi-modal feature fusion network for 3D shape classification
2987 -- 3001Hao Li, Guoheng Huang, Xiaochen Yuan, Zewen Zheng, Xuhang Chen 0002, Guo-Zhong, Chi-Man Pun. Psanet: prototype-guided salient attention for few-shot segmentation

Volume 41, Issue 3

1415 -- 1433Yanfeng Zhao, Zhenjian Yang, Yunjie Zhang, Yadong Chen. BGFNet: boundary information-aided graph structure fusion network for semantic segmentation of remote sensing images
1435 -- 1451Shuo Tong, Han Liu 0007, Runyuan Guo, Wenqing Wang 0001, Ding Liu. Context-Aware Enhanced Virtual Try-On Network with fabric adaptive registration
1453 -- 1466Pengshu Du, Xiao Wang, Qi Zheng, Xi Wang, Weigang Li, Xin Xu. Glare countering and exploiting via dual stream network for nighttime vehicle detection
1467 -- 1484Yongli Liu, Degang Yang, Tingting Song, Yichen Ye, Xin Zhang. YOLO-SSP: an object detection model based on pyramid spatial attention and improved downsampling strategy for remote sensing images
1485 -- 1498Robin G. C. Maack, Felix Raith, Juan F. Pérez, Gerik Scheuermann, Christina Gillmann. A workflow to systematically design uncertainty-aware visual analytics applications
1499 -- 1509Qiguang Zhu, Qiang Cen, Yuxin Wang, Weidong Chen 0001, Shuo Liu. An underwater target recognition algorithm incorporating improved attention mechanism and downsampling
1511 -- 1525Wenyue Sun, Jindong Zhang, Yitong Liu. Adversarial-based refinement dual-branch network for semi-supervised salient object detection of strip steel surface defects
1527 -- 1541Jun Yang, Zilu Wu, Renbiao Wu. Micro-expression recognition based on contextual transformer networks
1543 -- 1554Ya Li, Ziming Li, Huiwang Liu, Qing Wang. ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation
1555 -- 1571Jindrich Adolf, Peter Kán, Tiare Feuchtner, Barbora Adolfová, Jaromír Dolezal, Lenka Lhotská. Offistretch: camera-based real-time feedback for daily stretching exercises
1573 -- 1589Qunpo Liu, Zhiwei Lu, Ruxin Gao, Xuhui Bu, Naohiko Hanajima. SimpleMask: parameter link and efficient instance segmentation
1591 -- 1608Xiao Fang, Xin Gao, Baofeng Li, Feng Zhai, Yu Qin, Zhihang Meng, Jiansheng Lu, Chun-Xiao. A non-uniform low-light image enhancement method with multi-scale attention transformer and luminance consistency loss
1609 -- 1620Haibin Li, Aodi Guo, Yaqian Li. CCMA: CapsNet for audio-video sentiment analysis using cross-modal attention
1621 -- 1635Xun Zhao, Feiyun Xu, Zheng Liu. TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze
1637 -- 1654Qi Zhao, Congxuan Zhang, Zhibo Rao, Zhen Chen, Zige Wang, Ke Lu. GPDF-Net: geometric prior-guided stereo matching with disparity fusion refinement
1655 -- 1671Haihua Ding, Chuan Lin, Fuzhang Li, Yongcai Pan. A feature aggregation network for contour detection inspired by complex cells properties
1673 -- 1688Zhengwu Yuan, Peixian Tang, Xinguang Sang, Fan Zhang, Zheqi Zhang. Visionary: vision-aware enhancement with reminding scenes generated by captions via multimodal transformer for embodied referring expression
1689 -- 1704Munish Bhardwaj, Nafis uddin Khan, Vikas Baghel. Road crack detection using pixel classification and intensity-based distinctive fuzzy C-means clustering
1705 -- 1717Houfu Peng, Xing Lu, Daoxun Xia, Xiaoyao Xie. A novel image restoration solution for cross-resolution person re-identification
1719 -- 1731Caifeng Liu, Fangjie Gu. Differential motion attention network for efficient action recognition
1733 -- 1755Gang Zhang, Yang Geng, Zhao G. Gong. A comprehensive review of deep learning approaches for group activity analysis
1757 -- 1775Huijuan Wang, Xinyue Chen, Quanbo Yuan, Peng Liu. A review of 3D object detection based on autonomous driving
1777 -- 1788Libo Sun, Yifan Li, Wenhu Qin. PEPillar: a point-enhanced pillar network for efficient 3D object detection in autonomous driving
1789 -- 1809Mohamed Charfeddine Mzoughi, Najib Ben Aoun, Sami Naouali. A review on kinship verification from facial information
1811 -- 1825Jiawei Chen, Wen Su, Mengjiao Ge, Ye He, Jun Yu. To-Former: semantic segmentation of transparent object with edge-enhanced transformer
1827 -- 1840Ying Ma, Meng Wang, Guangyun Lu, Yajun Sun. Multi-label semantic sharing based on graph convolutional network for image-to-text retrieval
1841 -- 1854Xiafan Li, Hongyan Quan. MVPCL: multi-view prototype consistency learning for semi-supervised medical image segmentation
1855 -- 1872Yihe Nie, Xingbo Zhao, Yongxiang Li, Qianwen Lu, Qingchuan Tao, Yanmei Yu. DEAR: a novel deep-level semantics feature reinforce framework for Infrared Small Object Segmentation
1873 -- 1889Aokun Mei, Hua Huo, Jiaxin Xu, Ningya Xu. Multistage attention region supplement transformer for fine-grained visual categorization
1891 -- 1905Tong Li, Zhaoxuan Zhang, Yuxin Wang, Yan Cui, Yuqi Li, Dongsheng Zhou, Baocai Yin, Xin Yang. Self-supervised indoor scene point cloud completion from a single panorama
1907 -- 1920Xuyuan Zhang, Chen Xu 0004, Yu Han 0001, George Baciu. Fabric image recolorization by fuzzy pretrained neural network
1921 -- 1938Shilong Wang, Qianwen Hou, Jiaang Li, Jianlei Liu. TSID-Net: a two-stage single image dehazing framework with style transfer and contrastive knowledge transfer
1939 -- 1956Xiaohong Zhang, Shengwu Xiong 0001, Zhaoyang Sun, Jianwen Xiang. Semi-hard constraint augmentation of triplet learning to improve image corruption classification
1957 -- 1969Huijuan Wang, Boyan Cui, Quanbo Yuan, Gangqiang Pu, Xueli Liu, Jie Zhu. Mini-3DCvT: a lightweight lip-reading method based on 3D convolution visual transformer
1971 -- 1986Zhigang Huang, Wanli Xue, Yuxi Zhou, Jinlu Sun, Yazhou Wu, Tiantian Yuan, Shengyong Chen. Dual-stage temporal perception network for continuous sign language recognition
1987 -- 1998Zixuan Yu, Zhenjun Tang, Xiaoping Liang, Hanyun Zhang, Ronghai Sun, Xianquan Zhang. A novel image hashing with low-rank sparse matrix decomposition and feature distance
1999 -- 2010Shiyu Li, Zehao Liu, Meijing Gao, Yang Bai, Haozheng Yin. MDSCN: multiscale depthwise separable convolutional network for underwater graphics restoration
2011 -- 2027Suyi Liu, Fang Xu, Chengdong Wu, Jianning Chi, Xiaosheng Yu, Longxing Wei, Chuanjiang Leng. CMT-6D: a lightweight iterative 6DoF pose estimation network based on cross-modal Transformer
2029 -- 2046Jun Wu, Wanyu Nie, Yu Zheng, Gan Zuo, Jiaming Dong, Siwei Wei. Malleable pruning meets more scaled wide-area of attention model for real-time crack detection
2047 -- 2060Qiwang Li, Mingwen Shao, Fukang Liu, Yuanjian Qiao, Zhiyong Hu. Contrastive local constraint for irregular image reconstruction and editability
2061 -- 0Xiang Suo, Weidi Tang, Lijuan Mao, Zhen Li. Correction: Digital human and embodied intelligence for sports science: advancements, opportunities and prospects
2063 -- 0Dhruv Meduri, Mohit Sharma, Vijay Natarajan. Correction to: Jacobi set simplification for tracking topological features in time-varying scalar fields

Volume 41, Issue 2

785 -- 798Jianliang Li, Jinming Zhang, Xiaohai Zhang, Ming Chen. Edge-guided generative network with attention for point cloud completion
799 -- 813Haowei Zhu, Suqin Bai, Jinlong Shi, Chenggen Wang, Yunhan Sun, Jiawen Lu, Xin Shu, Shucheng Huang. IOFusion: instance segmentation and optical-flow guided 3D reconstruction in dynamic scenes
815 -- 829Chao Yang, Meng Yang 0011, HongYu Li, Linlu Jiang, Xiang Suo, Lijuan Mao, Weiliang Meng, Zhen Li. A survey on soccer player detection and tracking with videos
831 -- 851Sameer Bhimrao Patil, Suresh Shirgave. Instructor emotion recognition system using manta ray foraging algorithm for improving the content delivery in video lecture
853 -- 867Ting Yu, Weiliang Meng, Zhongqi Wu, Jianwei Guo, Xiaopeng Zhang 0001. Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow
869 -- 881Yasmeen Cheema, Muhammad Nadeem Cheema, Anam Nazir, Fahad Ahmed KhoKhar, Ping Li 0016, Ayaz Ahmed. A novel approach for improving open scene text translation with modified GAN
883 -- 900Pengbin Fu, Ganyun Xiao, Huirong Yang. SATD: syntax-aware handwritten mathematical expression recognition based on tree-structured transformer decoder
901 -- 919Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Pablo Carballeira. Per-class curriculum for Unsupervised Domain Adaptation in semantic segmentation
921 -- 943Supriya Agrawal, Prachi Natu. OBB detector: occluded object detection based on geometric modeling of video frames
945 -- 960Xin Wang, Jin Feng, Jiajia Ding, Jun Gao. Light field salient object detection based on discrete viewpoint selection and multi-feature fusion
961 -- 973Zhizhen Zhou, Yejing Huo, Guoheng Huang, An Zeng, Xuhang Chen 0002, Lian Huang, Zinuo Li. QEAN: quaternion-enhanced attention network for visual dance generation
975 -- 990Shunsuke Takao. Underwater image sharpening and color correction via dataset based on revised underwater image formation model
991 -- 1006Junqing Yuan, Mengting Fan, Zhenyang Liu, Tongxuan Han, Zhenzhong Kuang, Chihao Pan, Jiajun Ding. Collaborative neural radiance fields for novel view synthesis
1007 -- 1020Can Zhang, Feipeng Da, Shaoyan Gai. Point clouds feature frequency domain analysis based on multilayer perceptron
1021 -- 1036Lei Wang, Xue-song Tang, Kuangrong Hao. GFPE-ViT: vision transformer with geometric-fractal-based position encoding
1037 -- 1048Fahad Ahmed KhoKhar, Jamal Hussain Shah, Rabia Saleem, Anum Masood. Harnessing deep learning for faster water quality assessment: identifying bacterial contaminants in real time
1049 -- 1059Yixiao Jin, Fu Gui, Minghao Chen, Xiang Chen, Haoxuan Li, Jingfa Zhang. Deep learning-driven automated quality assessment of ultra-widefield optical coherence tomography angiography images for diabetic retinopathy
1061 -- 1077Bo Qian, Xiangning Wang, Zhouyu Guan, Dawei Yang, An-ran Ran, Tingyao Li, Zheyuan Wang, Yang Wen, Xinming Shu, Jinyang Xie, Shichang Liu, Guanyu Xing, Julio Silva-Rodríguez, Riadh Kobbi, Ping Li 0016, Tingli Chen, Lei Bi 0001, Jinman Kim, Weiping Jia, Huating Li, Jing Qin 0001, Ping Zhang 0016, Ching Yu Cheng, Pheng-Ann Heng, Tien Yin Wong, Carol Y. Cheung, Yih Chung Tham, Nadia Magnenat-Thalmann, Bin Sheng 0001. HRDC challenge: a public benchmark for hypertension and hypertensive retinopathy classification from fundus images
1079 -- 1096Dapeng Yan, Gangyi Ding, Kexiang Huang, Tianyu Huang. Generating natural pedestrian crowds by learning real crowd trajectories through a transformer-based GAN
1097 -- 1108Yan Zhou, Xiang Chen, Tingyao Li, Shiqun Lin, Bin Sheng 0001, Ruhan Liu, Rongping Dai. GAMNet: a gated attention mechanism network for grading myopic traction maculopathy in OCT images
1109 -- 1125Gang Liu, Jiebang Wang, Yao Qian, Yonghua Li. Infrared and visible image fusion method based on visual saliency objects and fuzzy region attributes
1127 -- 1140Shweta Saboo, Joyeeta Singha. Semantic hand gesture integration system using self-co-articulation and movement epenthesis detection
1141 -- 1154Lars Zawallich. Unfolding polyhedra via tabu search
1155 -- 1170Bo Qian, Hao Chen 0011, Yupeng Xu, Yang Wen, Huating Li, Yuan Xie 0006, David Dagan Feng, Jinman Kim, Lei Bi 0001, Xun Xu, Xiangui He, Bin Sheng 0001. Deep contour attention learning for scleral deformation from OCT images
1171 -- 1181Lan Wei, Nikolaos M. Freris. Multi-scale graph neural network for physics-informed fluid simulation
1183 -- 1196Mengsi Guo, Mingfu Xiong, Jin Huang, Xinrong Hu, Tao Peng 0006. Face photo-sketch portraits transformation via generation pipeline
1197 -- 1211Mengsi Wang, Yuan Mei 0001, Lichun Yang, Bin Tian, Kaijun Wu 0001. SDR: stepwise deep rectangling model for stitched images
1213 -- 1226Qingkuo Meng, Yongjian Huai, Fei Ma, Wentao Ye, Haifeng Xu, Siyu Yang. Visualization of the occurrence and spread of wildfires in three-dimensional natural scenes
1227 -- 1239Xuan Miao, Shijie Li, Zheng Li, Wenzheng Xu, Ning Yang 0001. Multi-scale gated network for efficient image super-resolution
1241 -- 1249Václav Skala. A new fully projective O(lg N) line convex polygon intersection algorithm
1251 -- 1271Gaoming Yang, Yifeng Ding, Xianjin Fang, Ji Zhang 0001, Yan Chu. Fast face swapping with high-fidelity lightweight generator assisted by online knowledge distillation
1273 -- 1291Wensheng Li, Jing Zhang, Jiafeng Li, Li Zhuo. Unpaved road segmentation of UAV imagery via a global vision transformer with dilated cross window self-attention for dynamic map
1293 -- 1302Xiangning Wang, Zhouyu Guan, Bo Qian, Tingli Chen, Qiang Wu. A deep learning system for the detection of optic disc neovascularization in diabetic retinopathy using optical coherence tomography angiography images
1303 -- 1317Mei Zhang, Lingling Liu, Yongtao Pei, Guojing Xie, JingHua Wen. Semantic segmentation of multi-scale remote sensing images with contextual feature enhancement
1319 -- 1333Ya'nan Guan, Shujiao Liao, Wenyuan Yang. AParC-DETR: Accelerate DETR training by introducing Adaptive Position-aware Circular Convolution
1335 -- 1350Yong Liu, Xingyuan Li, Yong Liu, Wei Zhong. SimpliFusion: a simplified infrared and visible image fusion network
1351 -- 1366Liping Zhu, Silin Wu, Xianxiang Chang, Yixuan Yang, Xuan Li. Rethinking group activity recognition under the open set condition
1367 -- 1378Yuanqi Hu, Jianqi Zhang, Ling Bai, Jing Li, Bing Li, Ying Zang, Wenjun Hu. From sketch to reality: precision-friendly 3D generation technology
1379 -- 1394Wenxuan Liu, Xuemei Jia, Yihao Ju, Yakun Ju, Kui Jiang, Shifeng Wu, Luo Zhong, Xian Zhong. Fragrant: frequency-auxiliary guided relational attention network for low-light action recognition
1395 -- 1408Wuzhen Shi, Fei Tao, Yang Wen. Joint super-resolution-based fast face image coding for human and machine vision
1409 -- 1411Shengzhou Luo, Jingxing Xu, John Dingliana, Mingqiang Wei, Lu Han, Lewei He, Jiahui Pan. Publisher Correction: Twinenet: coupling features for synthesizing volume rendered images via convolutional encoder-decoders and multilayer perceptrons
1413 -- 1414Liwen Huang, Shujiao Liao, Wenyuan Yang. Correction: DC-PSENet: a novel scene text detection method integrating double ResNet-based and changed channels recursive feature pyramid

Volume 41, Issue 10

7013 -- 7025Mengyao Liu, Ruhan Liu, Jia Shu, Qirong Liu, Yuan Zhang, Lixin Jiang. AutoDDH: A dual-attention multi-task network for grading developmental dysplasia of the hip in ultrasound images
7027 -- 7047Lakshita Agarwal, Bindu Verma. Enriching image description generation through multi-modal fusion of VGG16, scene graphs and BiGRU
7049 -- 7061Main Uddin, Zhangjie Fu, Xiang Zhang. Deepfake face detection via multi-level discrete wavelet transform and vision transformer
7063 -- 7078Mengnan Hu, Qianli Zhou, Rong Wang. Bridging visible and infrared modalities: a dual-level joint align network for person re-identification
7079 -- 7092Hao Liu, Ye Liu, Shuanglong Yao, Tongshuai Yu, Ke Gao, Pengcheng Hao, Shuqing He, Ji Chen, Xing Wang. ISTFormer: lightweight transformer for enhanced super-resolution of coal rock images via iterative feature extraction
7093 -- 7108Zhehang Qiu, Huijuan Zhang, Jie Zhou, Jianming Zhan. Image restoration for both deblurring and dehazing based on multi-channel frequency information using deep neural network
7109 -- 7121Xi Li, Yulong Feng, Xianguo Yu, Yirui Cong, Lili Chen. Epipolar constraint-guided differentiable keypoint detection and description
7123 -- 7139Wei Pan, Zhe Yang 0005. A lightweight enhanced YOLOv8 algorithm for detecting small objects in UAV aerial photography
7141 -- 7167Sung-Wook Park, Se-Hoon Jung, Chun-Bo Sim. NeXtSRGAN: enhancing super-resolution GAN with ConvNeXt discriminator for superior realism
7169 -- 7184Yuyan Liu, Qing Zhang, Yilin Zhao, Yanjiao Shi. A dual-stream learning framework for weakly supervised salient object detection with multi-strategy integration
7185 -- 7199Guoquan Jiang, Canyu Wang, Zhanqiang Huo, Huan Xu. Multi-channel correlated diffusion for text-driven artistic style transfer
7201 -- 7214Lihua Yang, Jinxian Zhao, Ziming Wang, Yuheng Liu, Dazhao Chi. M-KANUNet: enhanced defect segmentation in X-ray images of copper pipe welds via multi-scale representation and Kolmogorov-Arnold Networks
7215 -- 7232Xingyue Zou, Jiqiang Tang. Guided fusion of infrared and visible images using gradient-based attentive generative adversarial networks
7233 -- 7248Lei Dai, Wen Gao, Chengyu Tang, Min Wang, Zhihua Chen. MTMFNet: multi-threshold and multi-scale feature fusion network for text detection
7249 -- 7267Huaiguang Cai, Yang Yang 0056, Yongqiang Tang, Zhengya Sun, Wensheng Zhang 0002. Shapley value-based class activation mapping for improved explainability in neural networks
7269 -- 7283Wei Song, Yaobin Huang. Adaptive feature recalibration transformer for enhancing few-shot image classification
7285 -- 7302Jialin Zhang, Xiao Wang, Hui Wei, Kui Jiang, Nan Mu, Zheng Wang. Context-aware target texture perturbation attack for concealed object detection
7303 -- 7317Qida Cao, Jiajun Ding, Zhenyang Liu, Zhenzhong Kuang, Yijie Shao, Yilan Shen. VC-GS: view-consistent deblurring Gaussian splatting via alternating branch optimization
7319 -- 7340Fuqiang Gou, Yonglong Li, Yanpian Mao, Chunyao Hou, Gang Wan, Jialong Li, Haoran Wang, Yongcan Chen. Planar tunnel point cloud fine registration under multiple constraints
7341 -- 7350Haitian Ren, Quinten Kwok, Meng Sun, Xuyan Huang, Jianlin Zhu, Haoxuan Li. Toward artificial general intelligence in health care
7351 -- 7365Chen-Bin Feng, Qi Lai, Kangdao Liu, Houcheng Su, Hao Chen, Kaixi Luo, Chi-Man Vong. Learning few-shot semantic segmentation with error-filtered segment anything model
7367 -- 7377Peng Zhang, Yuming Yan, Yuangao Ai, Benhong Wang, Houming Shen, Zhonghan Peng. Unet-based image segmentation and binarization for water level detection
7379 -- 7397Manuel Silva, Antonio Seoane, Omar A. Mures, Antonio M. López 0001, José Antonio Iglesias Guitián. Exploring the effects of synthetic data generation: a case study on autonomous driving for semantic segmentation
7399 -- 7415Ronggui Wang, Hong Chen, Juan Yang, Lixia Xue. Adaptive sparse triple convolutional attention for enhanced visual question answering
7417 -- 7432Die Yu, Zhaoyan Fang, Yong Jiang. Alleviating category confusion in fine-grained visual classification
7433 -- 7446Haomiao Liu, Hao Xu, Chuhuai Yue, Bo Ma. Adaptive objectness learning for enhanced unknown object detection
7447 -- 7458Xinbiao Lu, Yisen Chen, Yudan Chen, Xing Gao, Tieliu Yang, Guiyun Chen. STIG-Net: a spatial-temporal interactive graph framework for recognizing violent behaviors in videos
7459 -- 7475Keqi Li, Yaping Wan, Gang Zou, Wangxiu Li, Jian Yang, Changyi Xie. Enhancing facial action unit recognition through topological feature integration and relational learning
7477 -- 7491Yuenan Wang, Hua Wang, Fan Zhang 0045. Mask autoencoder for enhanced image reconstruction with position coding offset and combined masking
7493 -- 7508Haowei Zhu, Suqin Bai, Jinlong Shi, Jiawen Lu, Xin Zuo, Shucheng Huang, Xu Yao. Ellipsoid-SLAM: enhancing dynamic scene understanding through ellipsoidal object representation and trajectory tracking
7509 -- 7520Daikun Qu, Hongwei Zhao, Mingzhu Zhou. Unsupervised video object segmentation with mask transformer: boosting accuracy and efficiency through feature fusion
7521 -- 7533Cheng Zhong, Xiaomin Yu, Huan Xia, Rongdong Xie, Qingyi Xu. Restoring intricate Miao embroidery patterns: a GAN-based U-Net with spatial-channel attention
7535 -- 7549Jinyang Wang, Jihong Wang, Haoxuan Li, Xiaojun Huang, Jun Xia, Zhen Li, Weibing Wu, Bin Sheng. Temporal goal-aware transformer assisted visual reinforcement learning for virtual table tennis agent
7551 -- 7565Junchi Ma, Yuanqing Wang, Guangmiao Ding, Wei Cao, Xiangyun Liao, Ping Zhang, Jianping Lv. Mamba-enhanced hierarchical attention network for precise visualization of hippocampus and amygdala
7567 -- 7584Yuhao Zhang, Jiaqi Tong, Honglin Liu. SCAP: enhancing image captioning through lightweight feature sifting and hierarchical decoding
7585 -- 7601Yan Zhang, Xueting Sang, Yemei Sun, Shudong Liu, Shengpei Zhou. DMTNet: dual-domain adaptive multi-scale feature fusion network with transformer for small target detection
7603 -- 7616Xiaochun Wu, Ning Guo. MGSLU-Net: a lightweight network for efficient detection of water leakage in subway tunnel linings
7617 -- 7640Kehao Chen, Zhiping Zhou, Kewei Li, Taoyong Su, Zhaozhong Zhang, Jinhua Liu, Chenghao Ying. Red green blue-depth salient object detection based on multi-scale refinement and cross-modalities fusion network
7641 -- 7656Fang Zhou, Tingting Yang, Liuyan Tan, Xiaolong Xu, Mengdao Xing. DAP-Net: enhancing SAR target recognition with dual-channel attention and polarimetric features
7657 -- 7670Cheng Jiang, Pengle Zhang, Ying Ni, Xiaoli Wang, Hanghang Peng, Sen Liu, Mengdi Fei, Yuxin He, Yaxuan Xiao, Jin Huang, Xingyu Ma, Tian Yang. Multimodal retrieval-augmented generation for financial documents: image-centric analysis of charts and tables with large language models
7671 -- 7685Zhaozhao Yang, Yuhai Yu, Yongdong Huang, Jiana Meng. Innovative approaches in image processing: enhancing feature extraction and recognition capabilities
7687 -- 7702Yihao Li, Junyu Liu, Xiaoyu Guan, Hanming Hou, Tianyu Huang. Introducing anisotropic fields for enhanced diversity in crowd simulation
7703 -- 7721Liming Wan, Lin Song, Ying Zhou, Chenrui Kang, Shijian Zheng, Guo Chen. Dynamic neighbourhood-enhanced UNet with interwoven fusion for medical image segmentation
7723 -- 7733Haomou Bai, Yue Sang. Ultra-lightweight convolutional network for efficient single-image super-resolution
7735 -- 7750Sathish Mothe, Srinivas Kankanala. Multi-stage residual network with two fold attention mechanisms for low-light image enhancement
7751 -- 7766Xie Chengjie, Lu Shuhua, Shi Yangyu, Zheng Diwen. Joint perturbation consistency across image and feature levels for cross-domain adaptive crowd counting
7767 -- 7780Pengyun Chen, Shuang Cui, Ning Cao, Wenhao Zhang, Pengfei Wang, Shaohui Jin, Mingliang Xu. Lightweight multi-scale feature fusion with attention guidance for passive non-line-of-sight imaging
7781 -- 7798Wu Shili, Guo Yongkun, Qian Chao, Li Ying, Zhang Xinyou. Global attention and context encoding for enhanced medical image segmentation
7799 -- 7815Xiang Shijie, Zhou Dong, Tian Dan. Multi-scale feature fusion network for real-time semantic segmentation of urban street scenes: enhancing detail retention and accuracy
7817 -- 7838Hao Li, Shengkun Wu, Lei Deng, Chenhua Liu, Yifan Chen, Hanrui Chen, Heng Yu, Mingli Dong, Lianqing Zhu. Enhancing infrared and visible image fusion through multiscale Gaussian total variation and adaptive local entropy
7839 -- 7854Duo Liu, Guoyin Zhang, Yiqi Shi, Ye Tian, Liguo Zhang. Efficient feature difference-based infrared and visible image fusion for low-light environments
7855 -- 7865Weichen Dai 0001, Hexing Wu, Xiaoyang Weng, Wanzeng Kong. Implicit guidance for enhancing low-light optical flow estimation via channel attention networks
7867 -- 7882Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Jose M. Martínez. Layer-wise model merging for unsupervised domain adaptation in segmentation tasks
7883 -- 7907Xinzhi Li, Yong Liu, Peng Yan. Optimizing feature map matching for marine benthic organism detection
7909 -- 7923Zhen Song, Jianhua Chen. Adaptive rate compression for distributed video sensing in wireless visual sensor networks
7925 -- 7938Jinxing Liang, Kaifang Han, Dongsheng Li, Ruixin Gao, Jiajia Peng, Tao Peng, Xinrong Hu. Enhancing low-frequency stitch code generation for knitted fabrics: an LFSCG-E-Net approach
7939 -- 7950Jiahao Wang, Yongqiang Wang, Congling Zhou, Jiawei Huang. LF-RTMDet: an instance segmentation algorithm for real-time detection of water-filled barriers
7951 -- 7963Xijun Wang, Xin Zhou, Yi Wang, Songto Zeng, Xinyu Liu, Haobo Shen, Song Fei, Lei Zhu. Msu-mamba: multi-scale defocus blur detection using cross-scale fusion and state-space models
7965 -- 7981Xite Wang, Changsheng Qin, Mei Bai, Qian Ma 0003, Guanyu Li. CAFormer: a connectivity-aware vision transformer for road extraction from remote sensing images
7983 -- 7995Zhenghao Xie, Junfen Chen, Yingying Wang, Bojun Xie. Enhanced fine-grained relearning for skeleton-based action recognition
7997 -- 8008Doudou Zhang, Junchi Ma, Jie Chen, Linxia Xiao, Xiangyun Liao, Yong Zhang, Weixin Si. MF-SAM: enhancing multi-modal fusion with Mamba in SAM-Med3D for GPi segmentation
8009 -- 8023Wubin Shi, Shaoyan Gai, Feipeng Da, Zeyu Cai, Jiaoling Wang. GRPoseNet: a generalizable and robust 6D object pose estimation network using sparse RGB views
8025 -- 8040Zongyu Ye, Hongjuan Yan, Yewang Sun, Bin Li, Lei Liu, Wenbo Wu. MSPNet: real-time semantic segmentation with large kernel and atrous convolutions
8041 -- 8053Zhengwei Guo, Bo Wang. Enhancing sandstorm images via color-guided spatial-frequency fusion network
8055 -- 8073Yu Pang, Yang Huang, Chenyu Weng, Jialin Lyu, Chuanyue Bai, Xiaosheng Yu. Enhanced RGB-T saliency detection via thermal-guided multi-stage attention network
8075 -- 8087Xiang Chen, Yuanqi Yao, Zhouyu Guan, Chenyang Li, Jian Guan, Jun Pu, Ruhan Liu, Bin Sheng 0001, Shankai Yin, Yiming Qin. DSTS-GF: a dual-stream temporal-spatial transformer with gated fusion for the classification of Obstructive Sleep Apnea
8089 -- 8101Yuanqi Yao, Zehua Jiang, Zhouyu Guan, Yilun Luxue, Seungmin Lee, Xiang Chen, Haodong Yang, Yiming Qin. A visual-language foundation model for disease diagnosis and doctor-patient co-decision
8103 -- 8116Shigang Hu, Darong Wu, Jianxin Wang, Shijun Huang. The image super-resolution network based on dual-branch feature interaction attention mechanism
8117 -- 0Tao Shi, Yao Ding 0012, Kui-feng Zhu, Yan-jie Su. Correction: DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision
8119 -- 0Sung-Wook Park, Se-Hoon Jung, Chun-Bo Sim. Correction: NeXtSRGAN: enhancing super-resolution GAN with ConvNeXt discriminator for superior realism

Volume 41, Issue 1

1 -- 2Nadia Magnenat-Thalmann. Welcome to the Year 2025
3 -- 10. Acknowledgement to reviewers 2024
11 -- 24Wenji Yang, Liping Xie, Wenbin Qian, Canghai Wu, Hongyun Yang. Coarse-to-fine cascaded 3D hand reconstruction based on SSGC and MHSA
25 -- 40Gusu Song, Shaoyan Gai, Feipeng Da. Memory-based gradient-guided progressive propagation network for video deblurring
41 -- 51Rohit Pratap Singh, Dolendro Singh Laiphrakpam. Dyhand: dynamic hand gesture recognition using BiLSTM and soft attention methods
53 -- 66Zhe Li, Hui Lv, Libo Cheng, Xiaoning Jia. Image deblocking algorithm based on GC and SSR
67 -- 78I-Chao Shen, Li-wen Su, Yu-Ting Wu, Bing-Yu Chen 0004. StylePart: image-based shape part manipulation
79 -- 97Youssef Ait Khouya, Mohammed Ait Oussous, Abdeslam Jakimi, Faouzi Ghorbel. Stable and invertible invariants description for gray-level images based on Radon transform
99 -- 114Mahmoud A. Eldosoky, Jianping Li 0002, Amin Ul Haq, Fanyu Zeng, Mao Xu, Shakir Khan, Inayat Khan. WallNet: Hierarchical Visual Attention-Based Model for Putty Bulge Terminal Points Detection
115 -- 128Rajendra Nagar. Robust extrinsic symmetry estimation in 3D point clouds
129 -- 140Chen Zhao, Weiling Cai, Zheng Yuan. Spectral normalization and dual contrastive regularization for image-to-image translation
141 -- 155Ziliang Feng, Ju Zhang, Xusong Ran, Donglu Li, Chengfang Zhang. Ghost-Unet: multi-stage network for image deblurring via lightweight subnet learning
157 -- 171Chunlu Li, Feipeng Da. Refined dense face alignment through image matching
173 -- 189Xiongbo Lu, Feng Liu, Yi Rong, Yaxiong Chen, Shengwu Xiong 0001. MakeupDiffuse: a double image-controlled diffusion model for exquisite makeup transfer
191 -- 208Junjie Liu, Junlong Liu, Rongxin Jiang, Boxuan Gu, Yaowu Chen, Chen Shen 0003. Boosted verification using siamese neural network with DiffBlock
209 -- 227Xujia Qin, Xinyu Li, Mengjia Li, Hongbo Zheng, Xiaogang Xu. Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement
229 -- 241Xiaochun Lei, Zeyu Chen, Zhaoxin Yu, Zetao Jiang. BENet: boundary-enhanced network for real-time semantic segmentation
243 -- 255Feihu Bian, Suya Xiong, Ran Yi, Lizhuang Ma. Multi-view stereo-regulated NeRF for urban scene novel view synthesis
257 -- 270Hengrui Zhang, Yongfeng Qi, Huili Chen, Panpan Cao, Anye Liang, Shengcong Wen. LSDNet: lightweight stochastic depth network for human pose estimation
271 -- 280Zubair Ahmad Lone, Alwyn Roshan Pais. Salient object detection in HSI using MEV-SFS and saliency optimization
281 -- 301Clement Mailhe, Amine Ammar, Francisco Chinesta, Dominique Baillargeat. Towards improving synthetic-to-real image correlation for instance recognition in structure monitoring
303 -- 314Yue Yu, Yue Yang, Jingshuo Xing. PMGAN: pretrained model-based generative adversarial network for text-to-image generation
315 -- 330Haoyu Xiong, Yu Xiang. Robust gradient aware and reliable entropy minimization for stable test-time adaptation in dynamic scenarios
331 -- 343Zhixuan Tang, Haiyun Shen, Peng Yu, Kaisong Zhang, Jianyu Chen. Infrared tracking for accurate localization by capturing global context information
345 -- 358Yixiu Liu, Long Zhan, Yu Feng, Pengju Si, Shaowei Jiang, Qiang Zhao, Chenggang Yan 0001. Loose-tight cluster regularization for unsupervised person re-identification
359 -- 382Le-Anh Tran, Dong-Chul Park 0002. Encoder-decoder networks with guided transmission map for effective image dehazing
383 -- 397Yixiu Liu, Tao Jiang, Pengju Si, Shangdong Zhu, Chenggang Yan, Shuai Wang 0003, Haibing Yin. Unpaired semantic neural person image synthesis
399 -- 408Yan Huang, Xinchang Lu, Jia Fu. Single image reflection removal via self-attention and local discrimination
409 -- 421Ziyang Chen 0002, Yang Zhao, Junling He, Yujie Lu, Zhongwei Cui, Wenting Li, Yongjun Zhang 0007. Feature distribution normalization network for multi-view stereo
423 -- 435Dayu Jia, Yanwei Pang, Jiale Cao, Jing Pan. SSNet: a joint learning network for semantic segmentation and disparity estimation
437 -- 449Ye Li, Wu Zhang, Meiling Wu, Di Zhang, Zhiguo Wang, Changjiang You. Multi-keypoints matching network for clothing detection
451 -- 464Zhentao Zhang, Wenhao Li, Yuxi Cheng, Qingnan Huang, Taorong Qiu. An improved residual learning model and its application to hardware image classification
465 -- 479Ping Ma, Xinyi He, Yiyang Chen, Yuan Liu 0021. ISOD: improved small object detection based on extended scale feature pyramid network
481 -- 490Jian-xiong, Jie Wu, Ming Tang, Pengwen Xiong, Yushui Huang, Hang Guo. Combining YOLO and background subtraction for small dynamic target detection
491 -- 516Henry Senior, Gregory G. Slabaugh, Shanxin Yuan, Luca Rossi 0011. Graph neural networks in vision-language image understanding: a survey
517 -- 534Yuanhao Chai, Jingyu Gong, Xin Tan 0002, Jiachen Xu, Yuan Xie 0006, Lizhuang Ma. Learnable scene prior for point cloud semantic segmentation
535 -- 548Kunhong Xiong, Linbo Qing, Lindong Li, Li Guo 0018, Yonghong Peng. Facial expression recognition based on local-global information reasoning and spatial distribution of landmark features
549 -- 562Lixia Xue, Wenhao Wang, Ronggui Wang, Juan Yang 0001. Modular dual-stream visual fusion network for visual question answering
563 -- 577Jinguang Chen, Xin Zhang, Lili Ma, Bo Yang, Kaibing Zhang. CS-VITON: a realistic virtual try-on network based on clothing region alignment and SPM
579 -- 590Huihui Li, Junhao Zhu, Guihua Wen, Haoyang Zhong. Structural self-contrast learning based on adaptive weighted negative samples for facial expression recognition
591 -- 604Lihuan Zheng, Wanru Xu, Zhenjiang Miao, Xinxiu Qiu, Shanshan Gong. RESTHT: relation-enhanced spatial-temporal hierarchical transformer for video captioning
605 -- 624Yanxiang Hu, Panpan Wu, Bo Zhang 0101, Wenhao Sun, Yaru Gao, Caixia Hao, Xinran Chen. A new multi-focus image fusion quality assessment method with convolutional sparse representation
625 -- 638Shuyu Xiao, Yongfang Wang, Yihan Wang. SISIM: statistical information similarity-based point cloud quality assessment
639 -- 658Jing Wu, Hao Wu 0015, Guowu Yuan. Detail-aware image denoising via structure preserved network and residual diffusion model
659 -- 674Luhan Wang, Jun Li, Shangwei Guo, Shaokun Han. A cascaded graph convolutional network for point cloud completion
675 -- 693Zhongxu Li, Qihan He, Wenyuan Yang. E-FPN: an enhanced feature pyramid network for UAV scenarios detection
695 -- 708Jiakun Zhao, Yige Cai. SCAKD: a knowledge distillation framework based on spatial-corner attention for infrared and visible image fusion
709 -- 722Hao Zhou, Junjie Yin, Yilun Yang, Meie Fang, Ping Li. Topology-guided accelerated vector field streamline visualization
723 -- 737Kun Wu, Lei Zhu 0010, Weihang Shi, Wenwu Wang 0008. Automated fabric defect detection using multi-scale fusion MemAE
739 -- 757A. Lubna, Saidalavi Kalady, A. Lijiya. Visual question answering on blood smear images using convolutional block attention module powered object detection
759 -- 772Xiyu Wei, Yanmei Dong, Qin Liu, Lei Wang, Liantang Lou. Robust corner detection in continuous space
773 -- 783Jing Zhao, Yongjun He, Zheng Shi, Jian Qin, Yining Xie. A style-aware network based on multi-task learning for multi-domain image normalization