The Visual Computer - researchr journal

researchr

You are not signed in
Sign in
Sign up

6331	--	6333	Nadia Magnenat-Thalmann. Editorial issue July 2025
6335	--	6348	Wonjun Lee. Multilevel Monte Carlo for asymptotically efficient path tracing
6349	--	6361	Teng Zhang, Bo Yang, Jianlin Zhu, Xincheng Hu. Scene-Enhanced Social Interpretable Movement Behavior for Multimodal Pedestrian Trajectory Prediction
6363	--	6374	Naoki Kita. StencilQR: connectivity-enhanced fabricable QR codes for stencil
6375	--	6386	Yi Jiang, Yiqian Wu, Hao Xu, Xiwen Shi, Xiaogang Jin 0001. Geometry guidance diffusion image morphing with large shape difference
6387	--	6399	Yanping Fu, Yuting Zhang, Dengdi Sun, Shaojie Zhang, Haifeng Zhao 0001. Single image shadow removal using 2D signed distance field
6401	--	6412	Xiaonan Fang, Muhan Chang. Video sketching using multi-domain guidance and implicit encoding
6413	--	6424	Wenguang Chen, Dong Xiao, Renjie Chen. Bijective spherical parameterization via stereographic projection
6425	--	6437	Haipeng Wang. Submodular-based view selection for low-quality points rendering with multi-feature point-based NeRF
6439	--	6452	Shihao Zheng, Huisi Wu, Zhijian Gao, Ping Li. Few-shot medical image segmentation via query transformation learning
6453	--	6464	Yuan-Hao Jiang, Kezong Tang, Zi-Wei Chen, Yuang Wei, Tian-Yi Liu, Jiayi Wu. MAS-KCL: knowledge component graph structure learning with large language model-based agentic workflow
6465	--	6477	Xiaojiao Guo, Shenghong Luo, Yihang Dong, Zexiao Liang, Zimeng Li, Xiujun Zhang, Xuhang Chen. An asymmetric calibrated transformer network for underwater image restoration
6479	--	6491	Renjie Zhang, Xin Wang, George Baciu, Ping Li. Distilling complementary information from temporal context for enhancing human appearance in human-specific NeRF
6493	--	6505	Feiwei Qin, Liangzhe Zhu, Zijian Xu, Meie Fang, Ping Li. CADGCL: unsupervised retrieval of CAD models via boundary representations
6507	--	6519	Jie Zhao, Ju Dai, Feng Zhou 0007, JunJun Pan, HongWen Xu. Dual-path spatio-temporal Mamba for skeleton-based action recognition
6521	--	6532	Shu Liu, Yilin Huang, Hongyun Yu, Yan Xu. AMNet: an attention-enhanced multi-branch network for micro-expression recognition
6533	--	6546	Yun Pei, Lingbo Liu, Runqing Jiang, Ye Zhang, Pengpeng Yu, Liang Lin, Yulan Guo. Energy-guided test-time adaptation for data shifts in multi-modal perception
6547	--	6560	Cheng Fang, Siyan Zhu, JunJun Pan. Enhanced material point method with affine projection stabilizer for efficient hyperelastic simulations
6561	--	6569	Pengpei Hong, Chuhua Xian, Hongmin Cai, Jiazhou Chen, Guiqing Li. Batch Specular Manifold Sampling for caustics rendering
6571	--	6585	Yuhang Yi, Yan Gui, Zhuo Liu. Boosting memory network for video object segmentation in complex scenes
6587	--	6600	Yuval Onn, Haggai Maron, Ayellet Tal. Attention-guided self-supervised distinctive region detection in point clouds
6601	--	6615	Qingzheng Wang, Ning Li, Jiazhi Xie, Wenhui Liu, Xingqin Wang, Zengwei Mai. Unified cross-domain refinement network for camouflaged object detection
6617	--	6629	Runqiao Li, Qiujie Dong, Shuangmin Chen. RevolRecon: Neural Representation for Reconstructing Surface of Revolution
6631	--	6644	Kai Yang, Wenhao Zhang, Ping Li, Jinxing Liang, Tao Peng 0006, Jia Chen, Li Li, Xinrong Hu, Junping Liu. ViT-BF: vision transformer with border-aware features for visual tracking
6645	--	6656	Xijun Wang, Xin Zhou, Yi Wang, Songto Zeng, Xinyu Liu, Haobo Shen, Xianying Wang, Ping Li, Lei Zhu. RainRWKV: a deep RWKV model for video deraining
6657	--	6670	Sen Peng, Yihang Fu, Runjie Miu, Tianyi Lv, Baorong Yang, Xiao Dong. GenericAvatar: generic human modeling from monocular video based on mesh-guided Gaussians
6671	--	6685	Shengjun Liu, Ting Zhang, Ruoxi Deng, Xinru Liu, Hanchao Liu. Physics-guided deep learning framework with attention for image denoising
6687	--	6700	Qiuyue Zhang, Zhiwang Zhang, Shiting Wen, Chaoyi Pang, Fangyu Wu 0001. Boosting remote semantic segmentation using vision-and-language foundation model
6701	--	6714	Yixiao Feng, Weihua Tong, Zhangjin Huang. High-quality neural surface reconstruction from unoriented point clouds via multilevel tensor product B-spline hash encoding and viscosity regularization
6715	--	6727	Jian Lin, Chengze Li, Xueting Liu, Zhongping Ge. Instance-guided anime editing with a curated large-scale dataset
6729	--	6743	Baofeng Zhou, Xianyong Fang, Linbo Wang, Zhengyi Liu. SemanticAvatar: human surface reconstruction based on semantically consistent biplane features
6745	--	6755	Muyang Zhang, Weiliang Meng, Mingda Jia, Jiaming Gu, Yihua Shao, Changwei Wang 0001, Rongtao Xu, Zhihao Ma, Xiaopeng Zhang 0001. PDFT: parameter-diminish fine-tuning for transformer-based models
6757	--	6768	Taoqi Bao, Jiangnan Ye 0002, Zhankong Bao, Chee Siang Leow, Haoji Hu, Jianfeng Lu, Issei Fujishiro, Jiayi Xu. L2H-NeRF: low- to high-frequency-guided NeRF for 3D reconstruction with a few input scenes
6769	--	6781	Taishi Ito, Yuki Endo, Yoshihiro Kanamori. Selfage: personalized facial age transformation using self-reference images
6783	--	6794	Jianning Chi, Mingyang Sun, Zelan Li, Geng Lin, Ying Huang. Adaptive box-level supervision with superpixel shape guidance for ultrasound image segmentation
6795	--	6807	J. Antony, M. Reghunath, Safeer Babu Thayyil, M. Ramanathan 0001. ConDT: A 2D curve reconstruction algorithm based on a constrained neighbor proximity graph
6809	--	6821	Yiyi Wang, Jia Su, Song Zhang, Eisei Nakahara. RaEUNet: a retentive and efficient UNet for medical image segmentation
6823	--	6835	Zizhao Peng, Zihan Wang, Mengying Sun, Zheng Lv, Yan Wang, Ping Li, Fengwei An. Graph convolutional networks for 3D skeleton-based scoliosis screening using gait sequences
6837	--	6849	Min Shi 0005, Guo-Liang Zhao, Shi-sheng Guo, Bi-lian Sun, Dengming Zhu, Xiu-juan Chai, Zhao-Xin Li, Xinru Zhuo. Generating 3D fish motion skeleton via iterative optimization method and FishSkeletonNet
6851	--	6864	Peng Yu, Zhiyang Ji, Aimin Hao, Yang Gao 0032. Real-time immersive haptic sculpting with elastoplastic virtual clay
6865	--	6878	Enxu Zhao, Jianchi Sun, Fei Luo 0004, Chunxia Xiao. EE-Head: emotion estimation for precise facial expression in NeRF head avatars
6879	--	6890	Linling Jiang, Xin Wang, Fan Zhang 0045, Caiming Zhang. Transforming time and space: efficient video super-resolution with hybrid attention and deformable transformers
6891	--	6904	Huibiao Wen, Lei Wang, Shuang-Min Chen, Shiqing Xin, Chongyang Deng, Ying He 0001, Wenping Wang, Changhe Tu. ImS: implicit shell for the sandwich-walled space surrounding polygonal meshes
6905	--	6915	Tsukasa Fukusato, Akinobu Maejima, Takeo Igarashi. Locality-Preserving Free-Form Deformation
6917	--	6929	Jiawei Xu, Qiangqiang Zhou, Jiacong Yu, Chen Liao, Dandan Zhu. Semantic-Orthogonal Multi-modal Attention Network for RGB-D Salient Object Detection
6931	--	6941	Yunlong Liao, Yiting Lin, Zheng Xing, Xiaochen Yuan. Privacy Image Secrecy Scheme Based on Chaos-Driven Fractal Sorting Matrix and Fibonacci Q-Matrix
6943	--	6954	Ruiling Li, Ming Gao, Xiaogang Jin. Recognize Me If You Can: Two-stream Adversarial Transfer for Facial Privacy Protection using Fine-grained Makeup
6955	--	6967	MinJae Seo, Inhyung Jung, Jinhoon Choi, Kyoungju Park. PhysAvatar: physically plausible avatar generation from sparse tracking
6969	--	6982	Ruhao Wang, Yu Jiang, Huizhi Zhu, Fei Luo 0004, Chunxia Xiao. HumanIR-MGI: human inverse rendering via jointly optimizing geometry, material, and illumination
6983	--	6997	Bingchen Yang, Haiyong Jiang, Zhengda Lu, Jun Xiao 0005. Exploring Structural Lines for Interior Floorplan Segmentation
6999	--	7012	Haibo Wang, Qinsong Li, Ling Hu, Haojun Xu, Jing Meng, Xinru Liu, Yu-Kun Lai, Shengjun Liu. TriAlign: revisiting deep functional map from map representation alignment perspectives

5223	--	5233	Shiyun Zhang, Xing Deng, Haijian Shao, Yingtao Jiang. ImpRes: implicit residual diffusion models for image super-resolution
5235	--	5250	Imen Labiadh, Larbi Boubchir, Hassene Seddik. Optimization of 2D and 3D facial recognition through the fusion of CBAM AlexNet and ResNeXt models
5251	--	5266	He Yu, Kang Yan, Jiexi Chen, Xuan Li, Jinming Guo, Xiaoxue Xing, Tao Huang 0008. Study on the methods of hyperspectral image saliency detection based on MBCNN
5267	--	5282	Yanxiang Li, Wenzhe Meng, Dehua Ma, Siping Xu, Xiaoliang Zhu. MCGFF-Net: a multi-scale context-aware and global feature fusion network for enhanced polyp and skin lesion segmentation
5283	--	5298	Yusong Li, Bin Xie, Yuling Li, Jiahao Zhang. Multi-scale local regional attention fusion using visual transformers for fine-grained image classification
5299	--	5309	Yongpeng Zhao, Guangyuan Zhang, Kefeng Li, Zhenfang Zhu, Xiaotong Li, Yongshuo Zhang, Zhiming Fan. MFADU-Net: an enhanced DoubleU-Net with multi-level feature fusion and atrous decoder for medical image segmentation
5311	--	5322	Meichen Lu, Yi Chai, Kaixiong Xu, Weiqing Chen, Fei Ao, Wen Ji. Multimodal fusion and knowledge distillation for improved anomaly detection
5323	--	5345	Jihua Peng, Yanghong Zhou, P. Y. Mok 0001. EHFusion: an efficient heterogeneous fusion model for group-based 3D human pose estimation
5347	--	5359	Xizhuo Yu, Chaojie Fan, Jiandong Pan, Guoliang Xiang, Chunyang Chen, Tianjian Yu, Yong Peng 0002, Hanwen Deng. X-ray security inspection for real-world rail transit hubs: a wide-ranging dataset and detection model with incremental learning block
5361	--	5371	Junli Shen, Yuman Hai, Chongyu Lin. CT-UFormer: an improved hybrid decoder for image segmentation
5373	--	5389	Yufang Yang, Yining Xie, Jun Cao, Kaihua Yang. Attention-guided dual feature extraction approach for small target detection in infrared images
5391	--	5404	Honglin Wu, Xinyu Yu, Zhaobin Zeng. SSBFNet: a spectral-spatial fusion with BiFormer network for hyperspectral image classification
5405	--	5419	Fangfang Liang, Zilong Huang, Wenjian Wang, Zhenxue He, Qing En. Dynamic text prompt joint multimodal features for accurate plant disease image captioning
5421	--	5433	Wei Cao, Xin Chen, Jianping Lv, Liang Shao, Weixin Si. Semi-supervised intracranial aneurysm segmentation via reliable weight selection
5435	--	5445	Wei-jong Yang, Li-Yang Ho. CSA-Lanenet: a contiguous spatial attention lane detection network with vision transformer modules
5447	--	5459	Simin Yan, Shuchang Xu, Aiping Lei, Sanyuan Zhang. Advancing neural aesthetic assessment of artistic images based on bundle features integration
5461	--	5476	Donghui Wang, Jinhua Wang, Ning He, Jingzun Zhang, Sen Zhang, Shuai Liu. Enhancing unsupervised shadow removal via multi-intensity shadow generation and diffusion modeling
5477	--	5494	Yunfei Lu, Chenxia Chang, Song Gao, Shaowen Yao 0001, Ahmed Zahir. Boosting adversarial example detection via local histogram equalization and spectral feature analysis
5495	--	5515	Canlin Li, Haowen Su, Xin Tan, Lihua Bi, Xiangfei Zhang, Lizhuang Ma. Innovative collaborative multi-lookup table for real-time enhancement of low-light images
5517	--	5537	Zhao Liangjun, Yinqing Wang, Yueming Hu, Hui Dai, Xi Yubin, Feng Ning, He Zhongliang, Gang Liang, Yuanyang Zhang. An image fusion algorithm based on image clustering theory
5539	--	5562	Jie Yin, Tao Sun, Guorong Zhang, Yuhao Wu, Xiao Zhang. Deformation-aware image restoration from atmospheric turbulence based on quasiconformal geometry and pulse-coupled neural network
5563	--	5582	Hongwei Wei, Qi Li, Jie Pan, Junmei Chen, Yizhuo Zhang, Lizhuang Qi, Ying Zhou. SPSNet: semantic-guided perspective shift network for robust person re-identification in drone imagery
5583	--	5596	Shuai Su, Chengju Liu, Qijun Chen. Universally describing keypoints from a semi-global to local perspective, without any specific training
5597	--	5608	Yan Liu, Wenting Qi, Jingwen Wang, Yanqiu Xiao, Guangzhen Cui, Li Han. An efficient defogging network for RAW image sequences with high viewpoint
5609	--	5624	Yiyuan Ge, Mingxin Yu, Zhihao Chen, Wenshuai Lu, Yuxiang Dai, Huiyu Shi. Attention-enhanced controllable disentanglement for cloth-changing person re-identification
5625	--	5641	Maocheng Bai, Xiaosheng Yu, Ying Wang, Jubo Chen, Xiaofeng Zhang, Pengfei Lyu. Enhancing pixel-level analysis in medical imaging through visual instruction tuning: introducing PLAMi
5643	--	5660	Wei Liu, Cong Wang, Yongkang Zhang. Industrial surface defect detection by multi-scale Inpainting-GAN
5661	--	5674	Yanzheng He, Pengjun Wang, Xiaochun Guan, Han Li. Enhancing 3D Human Moiton Prediction with MSIGCN: A Novel Approach to Addressing Sensor Noise and State Accuracy
5675	--	5688	Saba Ghazanfar Ali, Xiangning Wang, Lei Bi 0001, Younhyun Jung, Tingli Chen, Haifang Zhang. Deep learning-based binocular system for automated diabetic retinopathy grading with prior clinical knowledge integration
5689	--	5700	Xuefeng Zhang, Bin Yan, Zhaohu Xing, Feng Gao, Yuandong Tao, Zhenyan Han, Weiming Wang, Lei Zhu 0003. HADiff: hierarchy aggregated diffusion model for pathology image segmentation
5701	--	5718	Zhaobin Chang, Xiong Gao, Dongyi Kong, Na Li, Yonggang Lu. Multi-prototype collaborative perception enhancement network for few-shot semantic segmentation
5719	--	5731	Kunyu Yan, Wenbin Zheng, Yujie Yang. Lightweight weed detection using re-parameterized partial convolution and collection-distribution feature fusion
5733	--	5749	Xin Zhang, Degang Yang, Tingting Song, Yichen Ye, Yingze Song, Jie Zhou, Jie Chen. A lightweight object detector based on changeable-size lightweight convolution and context augmentation module for images captured by UAVs
5751	--	5767	Cuiyun Lin, Chengxue Lao, Tianrun Jing, Wenxiao Wang 0004. Predicting game ownership dynamics: a novel POAFD-trend analysis approach
5769	--	5780	Jiaze He, Jian Xiao, Yuanjie Cao, Jing He, Siyu Li, Jin Huang, Ruhan He, Jianlin Zhu. Region-assisted line drawing colorization through diffusion model
5781	--	5798	Jinsong Zhang, Yu-Kun Lai, Jingyu Yang, Kun Li. PISE-V: person image and video synthesis with decoupled GAN
5799	--	5814	Zheyuan Wang, Ziyao Meng, Yiming Qin. MSPAN: lightweight image super-resolution with multi-semantic guidance
5815	--	5833	Zehao Cao, Zongji Wang, Yuanben Zhang, Cheng Jin, Weinan Cai, Zhihong Zeng, Junyi Liu. Enhancing 3D Gaussian splatting for low-quality images: semantically guided training and unsupervised quality assessment
5835	--	5854	Liangjun Zhao, Xi Yubin, Yinqing Wang, Feng Ning, He Zhongliang, Gang Liang, Yuanyang Zhang. MADNet: cropland change detection network for the complex terrain and dense vegetation hilly region in the Southwestern China
5855	--	5872	Qiaohong Chen, ZhenYang Xu, Xian Fang. CaVMamba: convolution-augmented VMamba for medical image segmentation
5873	--	5889	Runlong Cao, Jianqi Zhang, Yun Shen, Huanhuan Zhou, Peiying Zhou, Guowei Shen, Zhengwen Xia, Ying Zang, Qingshan Liu, Wenjun Hu. Dual-flow feature enhancement network for robust anomaly detection in stainless steel pipe welding
5891	--	5903	Yiming Chen, Yihang Liu, Gizem Kayar-Ceylan. CSG-based ML-supported 3D translation of sketches into game assets for game designers
5905	--	5917	Yuanchuan Duan, Peng Wang, Yan Huang, Yuxin Hang, Qi Sun, Haibo Shao, Jinzhu Yang. Optimizing semi-supervised medical image segmentation with imbalanced filtering and nnU-Net enhancement
5919	--	5933	Pengfei Zhao, Jianhua Ji, Yang Wen, Wuzhen Shi, Wenming Cao 0001. Dual prior guided depth image super-resolution with multi-scale transformer fusion network
5935	--	5947	Yaguang Lu, Yong Hu, Huiyan Feng, Pengshuai Duan, Xukun Shen. Generating reconstructable collaborative virtual environments via graph matching for mixed reality remote collaboration
5949	--	5960	Yingjie Fan, Bin Wen, Hongfei Deng. MRA-Net: an instance segmentation method based on multi-scale feature fusion for ethnic costumes images
5961	--	5977	Zhangmeng Chen, Ju Dai, JunJun Pan, Feng Zhou 0007. Diffusion model with temporal constraint for 3D human pose estimation
5979	--	5993	Zhenmin Yao, QianQian Hu. Accelerated local progressive-iterative approximation methods for curve and surface fitting
5995	--	6009	Ahmet Agaoglu, Nezih Topaloglu. Dynamic region of interest generation for maritime horizon line detection using time series analysis
6011	--	6025	Hu Wang, Hong-mei Sun, Wen-Long Zhang, Yu-Xiang Chen, Rui-sheng Jia. FANN: a novel frame attention neural network for student engagement recognition in facial video
6027	--	6039	Tongtong Liu, Chen Yang, Guoqiang Chen, Wenhui Li. Open-vocabulary multi-label classification with visual and textual features fusion
6041	--	6054	Shang Ma, Xiaoying Nie, Gang Yang, Chunqing Zhou. A robust and efficient model for the interaction of fluids with deformable solids
6055	--	6065	Guoyou Zhang, Zhixiang Hao, Lihu Pan, Wei Guo, Jiaxin Zuo, Xuenan Zhang. MeshBLS: mesh-based broad learning 3D object classification network
6067	--	6085	YaJuan Zhang, Yongquan Liang, Junjie Wang, Houying Zhu, Zhihui Wang 0003. Enhanced multi-object tracking via embedded graph matching and differentiable Sinkhorn assignment: addressing challenges in occlusion and varying object appearances
6087	--	6102	Xiao Li, Kai Wu, Haoran Chen, Wenjun Song, Hongwei Tao, Zuhe Li, Yanan Du. Deep residual PLSR model with manifold optimization and Gaussian filter for enhanced image classification
6103	--	6120	Hongzhi Li, Zhanghao Ren, Guoqing Zhu, Yaoju Liang, Han Cui, Chaozeyu Wang, Jiaxi Wang. Enhancing medical image segmentation with MA-UNet: a multi-scale attention framework
6121	--	6132	Jianbing Xu, Jiangxin Zhou, Dongxu Xu, Yu Chen. Local dual-branch attention feature learning framework from UAVs for visual defect detection
6133	--	6148	Zhanqiang Huo, Xiyan Zhan, Yingxu Qiao, Shan Zhao. D3-Dehaze: a divide-and-conquer framework for enhanced single image dehazing
6149	--	6167	Jingya Shi, Dezhi Han, Chongqing Chen, Xiang Shen. SAFFNet: self-attention based on Fourier frequency domain filter network for visual question answering
6169	--	6185	Xiaodong Wang, Jiangtao Fan, Fei Yan, Hongmin Hu, Zhiqiang Zeng, Haiyan Huang. Unsupervised fur anomaly detection with B-spline noise-guided Multi-directional Feature Aggregation
6187	--	6199	Tang Xu, Wenbin Wang 0001, Alin Zhong. HOIEdit: Human-object interaction editing with text-to-image diffusion model
6201	--	6217	Xiangyang Wang 0003, Kun Yang, Qiang Ding, Rui Wang 0034, Jinhua Sun. Tic action recognition for children tic disorder with end-to-end video semi-supervised learning
6219	--	6235	Elmira Bagheri, Amir Hossein Barshooi. Nighttime driver behavior prediction using taillight signal recognition via CNN-SVM classifier
6237	--	6249	Yanmei Li, Tao Yu, Jian Luo, Xiaoshuang Li, Jingshi Deng, Qibin Yang. JLEDNet: a nighttime UAV tracking method through joint low-light image enhancement using hybrid attention transformer and denoising
6251	--	6269	V. Karthikeyan 0004, S. Praveen, S. Sudeep Nandan. Lightweight deep hybrid CNN with attention mechanism for enhanced underwater image restoration
6271	--	6297	Qian Ye, Qingwu Li, Guanying Huo, Yan Liu, Yan Zhou. Boundary-guided multi-scale refinement network for camouflaged object detection
6299	--	6312	Qiuquan Zhao, Jianyuan Li. SPS-UNet: a super-pixel sampling UNet for extracting buildings from high-resolution satellite images
6313	--	6326	Enze Yang, Yuxin Liu, Shitao Zhao, Yiran Liu, Shuoyan Liu. Learn from restoration: exploiting task-oriented knowledge distillation in self-supervised person re-identification
6327	--	0	Daniel Jiménez Navarro, Ana Serrano, Sandra Malpica. Correction to: Minimally disruptive auditory cues: their impact on visual performance in virtual reality
6329	--	0	Satoshi Nishimura. Correction: Grid-induced bounding volume hierarchy for ray tracing dynamic scenes

4395	--	4403	Long Zhang, Qinghua Zhou, Shuai Tang, Yunxiang Chen. High-definition multi-scale voice-driven facial animation: enhancing lip-sync clarity and image detail
4405	--	4418	Qiaohong Chen, Shufan Xie, Xian Fang, Qi Sun. CTHFNet: contrastive translation and hierarchical fusion network for text-video-audio sentiment analysis
4419	--	4430	Xuanpeng Li, Hengshuo Cao, Jinming Li, Guangyu Li, Lin Zhao. A shoreline extraction method based on dual-loop network framework
4431	--	4448	Viktor Leonhardt, Alexander Wiebel, Christoph Garth. A framework for visual comparison of scalar fields with uncertainty
4449	--	4461	Ye Liu, Lei Zhu, Liang Wan, Xing Wang. Masked frequency-color fusion network for video instance-level hazy lane detection
4463	--	4480	Jibing Peng, Yaohua Yi, Ying Zhou. DPDTRN: a dynamic pixel-level difficulty-aware texture reconstruction network for document super-resolution
4481	--	4495	Huangyuan Wu, Bin Li, Lianfang Tian, Chao Dong. DDFA: a displacement and diffusion-based feature augmentation method for imbalanced image recognition
4497	--	4515	Yunfei Qiu, Shuai Jiao, Qingtang Su. Enhancing color image watermarking via fast quaternion Schur decomposition: a high-quality blind approach
4517	--	4532	Rui Sun, Xiaolu Yu, Huidong Feng, Fei Wang, Xudong Zhang. Motion-robust mask face presentation attack detection via dual-stream texture-rPPG network
4533	--	4546	Zhiwen Shao, Yifan Cheng, Yong Zhou 0003, Xiang Xiang 0001, Jian Li 0054, Bing Liu 0016, Dit-Yan Yeung. High-level LoRA and hierarchical fusion for enhanced micro-expression recognition
4547	--	4565	Kesai Wang, Xifan Yao, Nanfeng Ma, Guangjun Ran. PLMOT-SLAM: a point-line features fusion SLAM system with moving object tracking
4567	--	4580	Ping Lu, Youcheng Cai, Jiale Yang, Dong Wang, Tingting Wu. Uanet: uncertainty-aware cost volume aggregation-based multi-view stereo for 3D reconstruction
4581	--	4601	Zhengyan Liu, Huiwen Wang, Lihong Wang, Shanshan Wang. Locality-constrained double-layer structure scaled simplex multi-view subspace clustering
4603	--	4621	Tianxiang Huo, Zhenqi Liu, Shichao Zhang, Jiening Wu, Rui Yuan, Shukai Duan 0001, Lidan Wang 0001. CDNet: object detection based on cross-level aggregation and deformable attention for UAV aerial images
4623	--	4637	Krishnendu Maity, Susanta Mukhopadhyay. LPSIS: a lossless secret image sharing scheme based on Legendre polynomials with low-cost reconstruction
4639	--	4660	Yuesong Tian, Li Shen 0008, Xiang Tian 0002, Dacheng Tao, Zhifeng Li 0001, Wei Liu 0005, Yaowu Chen. DGL-GAN: discriminator-guided GAN compression
4661	--	4672	Javed Aymat Husen Shaikh, Shailendrakumar M. Mukane, Santosh Nagnath Randive. Lightweight progressive recurrent network for video de-hazing in adverse weather conditions
4673	--	4686	Jinchang Zhu, Dayang Sun, Yu Cheng, Hailong Wang, Yujing Chen, Yaowei Chen. GaitHF: enhancing appearance-based gait recognition through height fused images
4687	--	4702	Wanjun Zhong, Haohao Hu, Yuerong Wang, Li Li, Tianyu Han, Chunyong Li, Peng Zan. Hierarchical evidence aggregation in two dimensions for active water surface object detection
4703	--	4722	Julien Thomas, Boyu Kuang, Yizhong Wang, Stuart Barnes, Karl Jenkins. Advanced semantic segmentation of aircraft main components based on transfer learning and data-driven approach
4723	--	4739	Hongfei Li, Xueyang Li. Dim and small objects detection in aerial images with stacked attention mechanism and improved loss function
4741	--	4758	Yanliang Ge, Junchao Ren, Cong Zhang, Min He, Hongbo Bi, Qiao Zhang. Feature-aware and iterative refinement network for camouflaged object detection
4759	--	4778	Mohamad Haniff Junos, Anis Salwa Mohd Khairuddin. YOLO-MMS for aerial object detection model based on hybrid feature extractor and improved multi-scale prediction
4779	--	4798	Sardor Mamarasulov, Lianggangxu Chen, Changgu Chen, Yang Li, Changbo Wang. Data augmentation with attention framework for robust deepfake detection
4799	--	4813	Jian Ni, Zheng Wang, Yixiao Wang, Wenjian Tao, Ao Shen. DRCL: rethinking jigsaw puzzles for unsupervised medical image segmentation
4815	--	4838	Huanshuo Zhang, Guobiao Ren. Intelligent leaf disease diagnosis: image algorithms using Swin Transformer and federated learning
4839	--	4850	Václav Skala. A new fully projective O(log N) point-in-convex polygon algorithm: a new strategy
4851	--	4864	Jianuo Wang, Huawei Li, Yumin Chen. Seg-invRender: fusing semantic segmentation based on NeRF for inverse rendering considering shadows
4865	--	4877	Wuzhen Shi, Aixue Yin, Yingxiang Li, Bo Qian. Cross-view Transformer for enhanced multi-view 3D reconstruction
4879	--	4892	Jiaxing Yu, Zheng Chen 0014, Jingkai Wang, Linghe Kong, Jiajie Yan, Wei Gu. Enhancing Image Super-Resolution with Dual Compression Transformer
4893	--	4914	Saleha Masood, Mousa Ahmad Al Bashrawi, Muhammad Attique Khan, Anam Nazir. Exploring ChatGPT applications in healthcare: a comprehensive overview
4915	--	4930	Yaqi Sun, Xiaolan Xie, Zhi Li, Huihuang Zhao. Image style transfer with saliency constrained and SIFT feature fusion
4931	--	4955	Zean Jin, Yulong Bai, Wei Song, Qinghe Yu, Xiaoxin Yue. EduCodeVR: VR for programming teaching through simulated farm and traffic
4957	--	4974	Zeyu Cai, Ziyu Zhang, Chengqian Jin, Feipeng Da. DMDC: a cross-attention network for dynamic mask-based dual-camera snapshot hyperspectral Photography
4975	--	4990	Baokai Zu, Tong Cao, Yafang Li, Jianqiang Li 0002, Hongyuan Wang, Quanzeng Wang. RESwinT: enhanced pollen image classification with parallel window transformer and coordinate attention
4991	--	5003	Yaqian Li, Xin Zhan, Haibin Li, Wenming Zhang. Selection and guidance: high-dimensional identity consistency preservation for face inpainting
5005	--	5017	Yang Yang, Changming Zhu. Deep multi-view clustering based on global hybrid alignment with cross-contrastive learning
5019	--	5028	Tiago Madeira, Miguel Oliveira 0001, Paulo Dias. Reflection-aware 3D mirror segmentation and pose estimation
5029	--	5041	Tao Shi, Yao Ding, Kui-feng Zhu, Yan-jie Su. DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision
5043	--	5057	Congying An, Jingjing Wu, Huanlong Zhang. Occlusion-aware segmentation via RCF-Pix2Pix generative network
5059	--	5073	Daniel Jiménez Navarro, Ana Serrano, Sandra Malpica. Minimally disruptive auditory cues: their impact on visual performance in virtual reality
5075	--	5086	Zidi Cao, Jiayi Han, Sipeng Yang, Xiaogang Jin 0001. Fast best viewpoint selection with geometry-enhanced multiple views and cross-modal distillation
5087	--	5104	Hongru Wang, Hu Cheng, Jingtao Zhang. Faster-PGYOLO: an efficient framework for floating debris detection in inland waters
5105	--	5121	Yanchen Liu, Changming Zhu. DMVMLC-VT: Deep incomplete multi-view multi-label image classification with view translation and pseudo-label enhancement
5123	--	5134	Miao Yang, Meng Yang 0011, Weiliang Meng, Ping Li 0016, Zhen Li. Msc-Net: multi-stage colorization network for real-world images with specular highlights
5135	--	5151	Kexuan Wang, Chenhua Liu, Rongfu Zhang. CMA-SOD: cross-modal attention fusion network for RGB-D salient object detection
5153	--	5169	Yanliang Ge, Taichuan Liang, Junchao Ren, Jiaxue Chen, Hongbo Bi. Enhanced salient object detection in remote sensing images via dual-stream semantic interactive network
5171	--	5187	Jianguo Ning, Lei Zhang, Xiangzhao Xu. Virtual simulation for the dynamic response of concrete blocks under blast loading
5189	--	5203	Shue Liu, Siwei Zhao, Yiying Wang, Jiaming Xin, Dashe Li. An enhanced underwater fish segmentation method in complex scenes using Swin transformer with cross-scale feature fusion
5205	--	5221	Zewei Zhao, Xiaotie Ma, Yingjie Shi, Xiaotong Yang. Multi-scale defect detection for plaid fabrics using scale sequence feature fusion and triple encoding

3679	--	3693	Zhaijuan Ding, Yanyu Liu, Sen Liu, Kangjian He, Dongming Zhou. $\hbox {KD}^{3}$mt: knowledge distillation-driven dynamic mixer transformer for medical image fusion
3695	--	3717	Lin Wang, Jie Li, Chun Qi, Fengping Wang, Pan Wang 0004. Progressive Crowd Enhancement De-Background Network for crowd counting
3719	--	3734	Baoan Li, Long Zhang, Shangzhi Teng, Xueqiang Lyu. Attribute correlation mask fusion network for pedestrian attribute recognition
3735	--	3783	Yasmin M. Alsakar, Nehal A. Sakr, Shaker H. Ali El-Sappagh, Tamer AbuHmed, Mohammed Elmogy. Underwater image restoration and enhancement: a comprehensive review of recent trends, challenges, and applications
3785	--	3800	Xiaopan Li, Shiqian Wu, Xin Yuan, Shoulie Xie, Sos S. Agaian. Hierarchical wavelet-guided diffusion model for single image deblurring
3801	--	3827	Yawen Xiang, Heng Zhou 0006, Chengyang Li, Fangwei Sun, Zhongbo Li, Yongqiang Xie. Deep learning in motion deblurring: current status, benchmarks and future prospects
3829	--	3842	Yunxi Chen, Yuanjie Cao, Fei Fang, Jin Huang, Xinrong Hu, Ruhan He, Junjie Zhang. SACANet: end-to-end self-attention-based network for 3D clothing animation
3843	--	3852	Yuanjie Dang, Jiangyun Chen, Peng Chen 0008, Nan Gao, Ruohong Huan, Dongdong Zhao. Generate anomalies from normal: a partial pseudo-anomaly augmented approach for video anomaly detection
3853	--	3865	Qian Wan, Bin Zhou, Yanjiang Wang. BSCGAN: structured minority class image generation under class-balanced pretraining
3867	--	3882	Shize Wang, Gang Wu, Jin Wang, Qing Zhu, Yunhui Shi, Baocai Yin. SBC-Net: semantic-guided brightness curve estimation network for low-light image enhancement
3883	--	3906	Xinzhe Xie, Buyu Guo, Peiliang Li 0003, Shuangyan He, Sangjun Zhou. SwinMFF: toward high-fidelity end-to-end multi-focus image fusion via swin transformer-based network
3907	--	3923	Zitao Gao, Xiangjian Liu, Anna K. Wang, Liyu Lin. A simulated two-stream network via multilevel distillation of reviewed features and decoupled logits for video action recognition
3925	--	3955	Ronghui Feng, Yuefei Wang, Jiajing Xue, Yuquan Xu, Yutong Zhang, Xi Yu. CLAC-Net: a composite medical image segmentation framework using self-attention and cross-layer asymmetric connections
3957	--	3972	Guowen Yue, Ge Jiao, Chen Li, Jiahao Xiang. When CNN meet with ViT: decision-level feature fusion for camouflaged object detection
3973	--	4000	Shuo Yang, Xiaoling Gu, Zhenzhong Kuang, Feiwei Qin, Zizhao Wu. Innovative AI techniques for photorealistic 3D clothed human reconstruction from monocular images or videos: a survey
4001	--	4016	Chen Li, Weiqi Yan, Hongwei Zhao, Shihua Zhou, Yueping Wang. TFFD-Net: an effective two-stage mixed feature fusion and detail recovery dehazing network
4017	--	4031	Kailin Liu, Yonghong Hou, Zihui Guo, Wenjie Yin, Yi Ren. Visual context learning based on cross-modal knowledge for continuous sign language recognition
4033	--	4045	Qiang Cen, Qiguang Zhu, Yuxin Wang, Weidong Chen 0001, Shuo Liu. YOLOv9-YX: lightweight algorithm for underwater target detection
4047	--	4066	Le-Anh Tran, Dong-Chul Park 0002. Lightweight image dehazing networks based on soft knowledge distillation
4067	--	4079	Haiyuan Cao, Deng Chen, Yanduo Zhang, Huabing Zhou, Dawei Wen, Congcong Cao. MFINet: a multi-scale feature interaction network for point cloud registration
4081	--	4095	Libo Sun, Jiahui Yan, Yongchun Qiu, Wenhu Qin. The crowd cooperation approach for formation maintenance and collision avoidance using multi-agent deep reinforcement learning
4097	--	4110	Guowei An, Yaonan Wang 0001, Kai Zeng 0010, Qing Zhu, Xiaofang Yuan. Deep spatial and discriminative feature enhancement network for stereo matching
4111	--	4127	Qiyang Liu, Yun Ge, Sijia Wang, Ting Wang, Jinlong Xu. Dynamic manifold-based sample selection in contrastive learning for remote sensing image retrieval
4129	--	4141	Ziwei Zeng, Lihong Li, Zoufei Zhao, Qingqing Liu. Improved fine-grained image classification in few-shot learning based on channel-spatial attention and grouped bilinear convolution
4143	--	4156	Yiqian Huang 0004, Shuqi Liu, Fei Dong, Xu Li, Xin Yang 0021, Ya Zhou, Jinxiang Huang, Yong Song. PL-MCT: pseudo-labeling and multi-frame consistency training for semi-supervised visual tracking
4157	--	4169	Yong Zhang, Qingguo Shan, Wenyun Chen, Wenzhe Liu. EEG emotion recognition approach using multi-scale convolution and feature fusion
4171	--	4181	Guowei Zhang, Weidong Zhang, Wuzhi Li, Li Wang, Huankang Cui. A dynamic attention mechanism for object detection in road or strip environments
4183	--	4198	Youjie Zhou, Runyu Jiao, Zhonghan Tao, Xichang Liang, Yi Wan 0002. Spatial-frequency attention-based optical and scene flow with cross-modal knowledge distillation
4199	--	4220	Pham Thanh Huu, Nguyen Thai An, Nguyen Ngoc Trung, Huynh Ngoc Thien, Nguyen Sy Duc, Nguyen Thi Ty. Judicial decision prediction using an integrated attention based bidirectional long-short term memory and dilated skip residual convolution neural network
4221	--	4238	Xinbiao Lu, Gaofan Zhan, Wen Wu, Wentao Zhang, Xiaolong Wu, Changjiang Han. Van-DETR: enhanced real-time object detection with vanillanet and advanced feature fusion
4239	--	4252	Chenchen Xu, Kaixin Han, Weiwei Xu. Image-aware layout generation with user constraints for poster design
4253	--	4267	Zhen Huang, Yongjian Zhu, Qiao Zhang, Hongyan Zang, Tengfei Lei. Exploration, fusion, and refinement: a multivariate features interaction network for visual camouflaged detection
4269	--	4285	Yongbo Yu, Weidong Li, Linyan Bai, Jinlong Duan, Xuehai Zhang. UTDM: a universal transformer-based diffusion model for multi-weather-degraded images restoration
4287	--	4300	Liping Zhu, Haibo Zhou, Silin Wu, Tianrong Cheng, Hongjun Sun. Polynomial for real-time rendering of neural radiance fields
4301	--	4320	Yong Zhang, Da Liu, Li Jiang, Huibing Wang, Wenzhe Liu. Feature decomposition and structural learning for multi-diverse and multi-view data clustering
4321	--	4346	Pengjie Liu, Yanzhan Chen, Fan Yu, Qian Zhang. Mastering adverse weather: a two-stage approach for robust semantic segmentation in autonomous driving
4347	--	4361	Yuqi Xiao, Yongjun Wu. A dual-channel correlation filtering tracker for real-time tracking based on deep features of improved CaffeNet and integrated manual features
4363	--	4376	Dejin Zhao, Yunjie Ma, Xiaolong Yuan, Tong Tong, Dechao Wang, Rui Sun, Lili Cheng, Jianhai Zhang. SME: Spatial multi-scale enhanced attention for automated detection of micro-defect on automobile complex paint surfaces
4377	--	4392	Yuanhong Zhong, Ting Chen, Daidi Zhong, Xiaoming Liu. Wavelet-guided network with fine-grained feature extraction for vessel segmentation
4393	--	4394	Ling-Xiao Qin, Hong-mei Sun, Xiao-Meng Duan, Cheng-Yue Che, Rui-sheng Jia. Correction: Adaptive learning-enhanced lightweight network for real-time vehicle density estimation

3003	--	3015	Liang Zhang, Shifeng Li, Xi Luo, Xiaoru Liu, Ruixuan Zhang. Video anomaly detection with both normal and anomaly memory modules
3017	--	3035	Hong Zhao, Wengai Li, Dailin Huang, Jinhai Huang, Lijun Zhang. M-GAN: multiattribute learning and multimodal feature fusion-based generative adversarial network for text-to-image synthesis
3037	--	3058	Xunan Tan, Xiang Suo, Wenjun Li, Lei Bi, Fangshu Yao. Data visualization in healthcare and medicine: a survey
3059	--	3076	Junding Sun, Chenxu Wang, Haifeng Sima, Xiaosheng Wu, Shuihua Wang, Yudong Zhang. Mfpenet: multistage foreground-perception enhancement network for remote-sensing scene classification
3077	--	3093	R. Varun Prakash, V. Karthikeyan 0005, S. Vishali, M. Karthika. Multi-level LSTM framework with hybrid sonic features for human-animal conflict evasion
3095	--	3107	Xintao Liu, Yan Gao, Changqing Zhan, Qiao Wangr, Yu Zhang, Yi He, Hongyan Quan. Directional latent space representation for medical image segmentation
3109	--	3128	Yan Zhou 0003, Haibin Zhou, Yin Yang, Jianxun Li, Richard Irampaye, Dongli Wang, Zhengpeng Zhang. Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation
3129	--	3142	Fengling Li, Zheng Yang, Yan Gui. SES-yolov5: small object graphics detection and visualization applications
3143	--	3154	Xiaoying Chen, Weijie Ye. Dual representations network for few-shot learning based on local descriptor importance: integrating global and local features
3155	--	3171	Zezheng Tang, Yihua Wu, Xinming Xu. The study of recognizing ripe strawberries based on the improved YOLOv7-Tiny model
3173	--	3188	Daipeng Yang, Bo Peng 0006, Xi Wu. A bio-inspired edge and segment detection method by modeling multiple visual regions
3189	--	3204	Jianjun Zhu, Huihuang Zhao, Yudong Zhang. Filter-deform attention GAN: constructing human motion videos from few images
3205	--	3219	Mingjian Li, Younhyun Jung, Shaoli Song, Jinman Kim. Attention-driven visual emphasis for medical volumetric image visualization
3221	--	3238	Jun Wang, Honghui Cao, Chenhao Sun, Ziqing Huang, Yonghua Zhang. Motion perception-driven multimodal self-supervised video object segmentation
3239	--	3261	Gang Chen, Wenju Wang, Haoran Zhou, Xiaolin Wang. EGCT: enhanced graph convolutional transformer for 3D point cloud representation learning
3263	--	3281	Haojie Gao, Peishun Liu, Xiaolong Ma, Zikang Yan, Ningning Ma, Wenqiang Liu, Xuefang Wang, Ruichun Tang. TP-LSM: visual temporal pyramidal time modeling network to multi-label action detection in image-based AI
3283	--	3295	Guowei Zhang, Wuzhi Li, Yutong Tang, Shuixuan Chen, Li Wang. Lightweight CNN-ViT with cross-module representational constraint for express parcel detection
3297	--	3308	Jianglei Ye, Yigang Wang, Fengmao Xie, Qin Wang, Xiaoling Gu, Zizhao Wu. Slot-VTON: subject-driven diffusion-based virtual try-on with slot attention
3309	--	3327	Xingquan Cai, Haoyu Zhang, Lizhe Chen, Yijie Wu, Haiyan Sun. 3D human pose estimation using spatiotemporal hypergraphs and its public benchmark on opera videos
3329	--	3344	Zhiyuan Li, Xin Jin 0005, Qian Jiang, Puming Wang, Shin-Jye Lee, Shaowen Yao 0001, Wei Zhou 0011. Crafting imperceptible and transferable adversarial examples: leveraging conditional residual generator and wavelet transforms to deceive deepfake detection
3345	--	3357	Wan-He Kai, Kai-Xin Xing. Video-driven musical composition using large language model with memory-augmented state space
3359	--	3370	Wenzhe Shi, Ziqi Hu, Hao Chen, Hengjia Zhang, Jiale Yang, Li Li. Orhlr-net: one-stage residual learning network for joint single-image specular highlight detection and removal
3371	--	3412	Xu Liu, Tong Zhou, Chong Wang, Yuping Wang, Yuanxin Wang 0001, Qinjingwen Cao, Weizhi Du, Yonghuan Yang, Junjun He, Yu Qiao, Yiqing Shen 0003. Toward the unification of generative and discriminative visual foundation model: a survey
3413	--	3422	Yaping Deng, Yingjiang Li, Zibo Wei, Keying Li. GLDC: combining global and local consistency of multibranch depth completion
3423	--	3435	Weifeng Cao, Xiaoyan Lei, Jun Shi, Wanyong Liang, Jie Liu, Zongfei Bai. HASN: hybrid attention separable network for efficient image super-resolution
3437	--	3455	Sunhan Xu, Jinhua Wang, Ning He, Guangmei Xu, Geng Zhang. Optimizing underwater image enhancement: integrating semi-supervised learning and multi-scale aggregated attention
3457	--	3472	Yazhuo Fan, Jianhua Song, Lei Yuan, Yunlin Jia. HCT-Unet: multi-target medical image segmentation via a hybrid CNN-transformer Unet incorporating multi-axis gated multi-layer perceptron
3473	--	3486	Muhammad Fahad 0013, Tao Zhang 0025, Yasir Iqbal, Azaz Ikram, Fazeela Siddiqui, Bin Younas Abdullah, Malik Muhammad Nauman, Xin Zhao 0006, Yanzhang Geng. Advanced deepfake detection with enhanced Resnet-18 and multilayer CNN max pooling
3487	--	3501	Jiajun Yang, Xuesong Zhang, Cunli Song. Research on a small target object detection method for aerial photography based on improved YOLOv7
3503	--	3518	Pengbo Bo, Qingxiang Liu, Caiming Zhang. Topological structure extraction for computing surface-surface intersection curves
3519	--	3535	Wenji Yang, Hang An, Wenchao Hu, Xinxin Ma, Liping Xie. Text-guided floral image generation based on lightweight deep attention feature fusion GAN
3537	--	3551	Ali Salar, Ali Ahmadi. Enhancing high-vocabulary image annotation with a novel attention-based pooling
3553	--	3564	Yiting Wu, Pinqi Fang, Xiangning Wang, Jie Shen. Predicting pancreatic diseases from fundus images using deep learning
3565	--	3580	Shunzhou Wang, Yao Lu, Wang Xia, Peiqi Xia, Ziqi Wang, Wei Gao. Light field angular super-resolution by view-specific queries
3581	--	3593	Xiaohu Wang, Xin Yang, Hengrui Li, Tao Li. FDDCC-VSR: a lightweight video super-resolution network based on deformable 3D convolution and cheap convolution
3595	--	3610	Minsoo Choi, Christos Mousas, Nicoletta Adamo, Sanjeevani Patankar, Klay Hauser, Fangzheng Zhao, Richard E. Mayer. ASAP: animation system for agent-based presentations
3611	--	3626	Dinghao Guo, Dali Chen, Xin Lin, Zheng Xue, Wei Zheng, Xianling Li. Semi-supervised image semantic segmentation method with semantic regions patching and uncertainty-guided loss
3627	--	3644	Yating Liu, ChengDong Lan, Wanjian Feng. DLKN: enhanced lightweight image super-resolution with dynamic large kernel network
3645	--	3662	Andrea Bodonyi, István Csoba, Roland Kunkli. Real-time ray transfer for lens flare rendering using sparse polynomials
3663	--	3678	Shijie Li, Shanhua Yao, Zhonggen Wang, Juan Wu. FFCANet: a frequency channel fusion coordinate attention mechanism network for lane detection

2065	--	2077	Hanqin Wang, Alexei Sourin. Visual signatures for music mood and timbre
2079	--	2089	Khawla Ben Salah, Mohamed Othmani, Jihen Fourati, Monji Kherallah. Advancing spatial mapping for satellite image road segmentation with multi-head attention
2091	--	2105	Mikolaj Maik, Jakub Flotynski, Krzysztof Walczak 0001. Knowledge-based approach to adaptive XR interface design for non-programmers
2107	--	2122	Max Reimann, Martin Büßemeyer, Benito Buchheim, Amir Semmo, Jürgen Döllner, Matthias Trapp 0001. Artistic style decomposition for texture and shape editing
2123	--	2142	Hiba Mzoughi, Ines Njeh, Mohamed Ben Slima, Nouha Farhat, Chokri Mhiri. Vision transformers (ViT) and deep convolutional neural network (D-CNN)-based models for MRI brain primary tumors images multi-classification supported by explainable artificial intelligence (XAI)
2143	--	2156	Dingning Long, Rongrong Chen. Cognitive capacity and aesthetics: the influence of visual working memory on landscape ink painting preference
2157	--	2169	Liangwei Wang, Zhan Wang, Xi Zhao 0003, Fugee Tsung, Wei Zeng 0004. Antarctica storytelling: creating interactive story maps for polar regions with graphic-based approach
2171	--	2185	Chuang Wu, Tingqin He. Efficient minor defects detection on steel surface via res-attention and position encoding
2187	--	2202	Junjie Zhang 0002, Yi Lin, Xin Zhou, Pangrong Shi, Xiaoqiang Zhu, Dan Zeng 0001. Precision in pursuit: a multi-consistency joint approach for infrared anti-UAV tracking
2203	--	2217	Jiayi Xu 0002, Xuan Tan, Yixuan Ju, Xiaoyang Mao, Shanqing Zhang. High similarity controllable face anonymization based on dynamic identity perception
2219	--	2232	Mohamed ElSayed, Mohamed Reda, Ahmed S. Mashaly, Ahmed Saleh 0004. LERFNet: an enlarged effective receptive field backbone network for enhancing visual drone detection
2233	--	2249	Jialin Zhu, He Wang 0002, David Hogg 0001, Tom Kelly. Learning to sculpt neural cityscapes
2251	--	2270	Suresh Cheekaty, G. Muneeswari. Advancing autism prediction through visual-based AI approaches: integrating advanced eye movement analysis and shape recognition with Kalman filtering
2271	--	2283	Huaping Zhou, Bin Deng, Kelei Sun, Shunxiang Zhang, Yongqi Zhang. UTE-CrackNet: transformer-guided and edge feature extraction U-shaped road crack image segmentation
2285	--	2297	Xiaoyang Zhao, Zhuo Wang 0008, Zhongchao Deng, Hongde Qin, Zhongben Zhu. Transmission-guided multi-feature fusion Dehaze network
2299	--	2322	Randa I. Elanwar, Margrit Betke. Generative adversarial networks for handwriting image generation: a review
2323	--	2337	Yixi Li, Yanzhe Liu, Rong Chen 0003, Hui Li, Na Zhao. Point cloud upsampling via a coarse-to-fine network with transformer-encoder
2339	--	2376	Neil Patrick Del Gallego, Joel Ilao, Macario O. Cordel II, Conrado R. Ruiz Jr.. Training a shadow removal network using only 3D primitive occluders
2377	--	2390	Qunpo Liu, Qi Tang, Bo Su, Xuhui Bu, Naohiko Hanajima, Manli Wang. Wire rope damage detection based on a uniform-complementary binary pattern with exponentially weighted guide image filtering
2391	--	2408	Jianjian Jiang, Ziwei Chen, Fangyuan Lei, Long Xu, Jiahao Huang, Xiaochen Yuan. Multi-granularity hypergraph-guided transformer learning framework for visual classification
2409	--	2424	Yueqian Pan, Qiaohong Chen, Xian Fang. DAMAF: dual attention network with multi-level adaptive complementary fusion for medical image segmentation
2425	--	2437	Wei Li, Bowen Li, Jingqi Wang, Weiliang Meng, Jiguang Zhang, Xiaopeng Zhang 0001. ROMOT: Referring-expression-comprehension open-set multi-object tracking
2439	--	2459	Longfeng Shen, Bin Hou, Yulei Jian, Xisong Tu, Yingjie Zhang, Lingying Shuai, Fangzhen Ge, Debao Chen. TransFGVC: transformer-based fine-grained visual classification
2461	--	2476	Avantika Saklani, Shailendra Tiwari, H. S. Pannu. Deep attentive multimodal learning for food information enhancement via early-stage heterogeneous fusion
2477	--	2493	Xiang Suo, Weidi Tang, Lijuan Mao, Zhen Li. Digital human and embodied intelligence for sports science: advancements, opportunities and prospects
2495	--	2510	Jiaxuan Zhu, Ming Shao, Libo Sun, Siyu Xia. ACL-SAR: model agnostic adversarial contrastive learning for robust skeleton-based action recognition
2511	--	2527	Jiayan Wen, YuanSheng Zhuang, Junyi Deng. EDM: a enhanced diffusion models for image restoration in complex scenes
2529	--	2544	Canlin Li, Xinyue Wang, Ran Yi, Wenjiao Zhang, Lihua Bi, Lizhuang Ma. MCLGAN: a multi-style cartoonization method based on style condition information
2545	--	2561	Haobo Dong, Tianyu Song 0003, Xuanyu Qi, Jiyu Jin, Guiyue Jin, Lei Fan 0004. Exploring high-quality image deraining Transformer via effective large kernel attention
2563	--	2594	Surendrabikram Thapa, Abhijit Sarkar. A deep dive into enhancing sharing of naturalistic driving data through face deidentification
2595	--	2605	Runtao Xi, Jiahao Lyu 0001, Kang Sun, Tian Ma. Learning kernel parameter lookup tables to implement adaptive bilateral filtering
2607	--	2627	Yi-Lun Wang, Yi-zheng Lang, Yunsheng Qian. Effective multi-scale enhancement fusion method for low-light images based on interest-area perception OCTM and "pixel healthiness" evaluation
2629	--	2638	Alireza Dehghanpour, Zahra Sharifi, Masoud Dehyadegari. Point cloud downsampling based on the transformer features
2639	--	2654	Yabo Wu, Wenting Li, Ziyang Chen 0002, Hui-Wen, Zhongwei Cui, Yongjun Zhang 0007. Distribution-decouple learning network: an innovative approach for single image dehazing with spatial and frequency decoupling
2655	--	2667	Yumei Tan, Haiying Xia, Shuxiang Song 0001. Robust consistency learning for facial expression recognition under label noise
2669	--	2690	Wen-Kai Tsai, Hsin-Chih Wang. Real-time salient object detection based on accuracy background and salient path source selection
2691	--	2708	Nauman Ullah Gilal, Marwa K. Qaraqe, Jens Schneider 0002, Marco Agus. Autocleandeepfood: auto-cleaning and data balancing transfer learning for regional gastronomy food computing
2709	--	2720	Ying Ni, Xiaoli Wang, Hanghang Peng, Yonzhi Li, Jinyang Wang, Haoxuan Li, Jin Huang. Dual-branch dilated context convolutional for table detection transformer in the document images
2721	--	2736	Yubo Zhang, Lei Xu, Haibin Xiang, Haihua Kong, Junhao Bi, Chao Han. LKSMN: Large Kernel Spatial Modulation Network for Lightweight Image Super-Resolution
2737	--	2754	Xiaoyu Song, Dezhi Han, Chongqing Chen, Xiang Shen, Huafeng Wu. Vman: visual-modified attention network for multimodal paradigms
2755	--	2766	Zekang Liu, Wei Feng 0005, Liqing Gao, Lianyu Hu 0003. DBL-SC: background-independent sign language recognition based on spatial channel separation computation
2767	--	2782	Ze Ouyang, Huihuang Zhao, Yudong Zhang, Long Chen. STVDNet: spatio-temporal interactive video de-raining network
2783	--	2800	R. Raja Sekar, T. Dhiliphan Rajkumar, Koteswara Rao Anne. Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme
2801	--	2815	Lirong Li, Jiang Ding, Hao Cui, Zhiqiang Chen, Guisheng Liao. LiteMSNet: a lightweight semantic segmentation network with multi-scale feature extraction for urban streetscape scenes
2817	--	2834	Saba Ghazanfar Ali, Xiaoxia Wang, Ping Li, Huating Li, Po Yang 0001, Younhyun Jung, Jing Qin 0001, Jinman Kim, Bin Sheng 0001. EGDNet: an efficient glomerular detection network for multiple anomalous pathological feature in glomerulonephritis
2835	--	2856	Pan Wu, Jin Tang. FHFN: content and context feature hierarchical fusion networks for multi-focus image fusion
2857	--	2873	Ling-Xiao Qin, Hong-mei Sun, Xiao-Meng Duan, Cheng-Yue Che, Rui-sheng Jia. Adaptive learning-enhanced lightweight network for real-time vehicle density estimation
2875	--	2889	Jit Chatterjee, Maria Torres Vega. 3D-Scene-Former: 3D scene generation from a single RGB image using Transformers
2891	--	2906	Xinyi Liu, Guoheng Huang, Xiaochen Yuan, Zewen Zheng, Guo-Zhong, Xuhang Chen 0002, Chi-Man Pun. Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression
2907	--	2921	Jiazhe Miao, Tao Peng 0006, Fei Fang, Xinrong Hu, Li Li 0094. TDGar-Ani: temporal motion fusion model and deformation correction network for enhancing garment animation details
2923	--	2937	Wei Song, Kaili Yang. Dual adaptive local semantic alignment for few-shot fine-grained classification
2939	--	2951	Changhong Shi, Weirong Liu 0002, Jiahao Meng, Xiongfei Jia, Jie Liu. Self-prior guided generative adversarial network for image inpainting
2953	--	2972	Chunyu Liu, Yixiao Jin, Zhouyu Guan, Tingyao Li, Yiming Qin, Bo Qian, Zehua Jiang, Yilan Wu, Xiangning Wang, Ying-Feng Zheng, Dian Zeng. Visual-language foundation models in medicine
2973	--	2985	Xin Zhao, Yinhuang Chen, Chengzhuan Yang, Lincong Fang. FuseNet: a multi-modal feature fusion network for 3D shape classification
2987	--	3001	Hao Li, Guoheng Huang, Xiaochen Yuan, Zewen Zheng, Xuhang Chen 0002, Guo-Zhong, Chi-Man Pun. Psanet: prototype-guided salient attention for few-shot segmentation

1415	--	1433	Yanfeng Zhao, Zhenjian Yang, Yunjie Zhang, Yadong Chen. BGFNet: boundary information-aided graph structure fusion network for semantic segmentation of remote sensing images
1435	--	1451	Shuo Tong, Han Liu 0007, Runyuan Guo, Wenqing Wang 0001, Ding Liu. Context-Aware Enhanced Virtual Try-On Network with fabric adaptive registration
1453	--	1466	Pengshu Du, Xiao Wang, Qi Zheng, Xi Wang, Weigang Li, Xin Xu. Glare countering and exploiting via dual stream network for nighttime vehicle detection
1467	--	1484	Yongli Liu, Degang Yang, Tingting Song, Yichen Ye, Xin Zhang. YOLO-SSP: an object detection model based on pyramid spatial attention and improved downsampling strategy for remote sensing images
1485	--	1498	Robin G. C. Maack, Felix Raith, Juan F. Pérez, Gerik Scheuermann, Christina Gillmann. A workflow to systematically design uncertainty-aware visual analytics applications
1499	--	1509	Qiguang Zhu, Qiang Cen, Yuxin Wang, Weidong Chen 0001, Shuo Liu. An underwater target recognition algorithm incorporating improved attention mechanism and downsampling
1511	--	1525	Wenyue Sun, Jindong Zhang, Yitong Liu. Adversarial-based refinement dual-branch network for semi-supervised salient object detection of strip steel surface defects
1527	--	1541	Jun Yang, Zilu Wu, Renbiao Wu. Micro-expression recognition based on contextual transformer networks
1543	--	1554	Ya Li, Ziming Li, Huiwang Liu, Qing Wang. ZMNet: feature fusion and semantic boundary supervision for real-time semantic segmentation
1555	--	1571	Jindrich Adolf, Peter Kán, Tiare Feuchtner, Barbora Adolfová, Jaromír Dolezal, Lenka Lhotská. Offistretch: camera-based real-time feedback for daily stretching exercises
1573	--	1589	Qunpo Liu, Zhiwei Lu, Ruxin Gao, Xuhui Bu, Naohiko Hanajima. SimpleMask: parameter link and efficient instance segmentation
1591	--	1608	Xiao Fang, Xin Gao, Baofeng Li, Feng Zhai, Yu Qin, Zhihang Meng, Jiansheng Lu, Chun-Xiao. A non-uniform low-light image enhancement method with multi-scale attention transformer and luminance consistency loss
1609	--	1620	Haibin Li, Aodi Guo, Yaqian Li. CCMA: CapsNet for audio-video sentiment analysis using cross-modal attention
1621	--	1635	Xun Zhao, Feiyun Xu, Zheng Liu. TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze
1637	--	1654	Qi Zhao, Congxuan Zhang, Zhibo Rao, Zhen Chen, Zige Wang, Ke Lu. GPDF-Net: geometric prior-guided stereo matching with disparity fusion refinement
1655	--	1671	Haihua Ding, Chuan Lin, Fuzhang Li, Yongcai Pan. A feature aggregation network for contour detection inspired by complex cells properties
1673	--	1688	Zhengwu Yuan, Peixian Tang, Xinguang Sang, Fan Zhang, Zheqi Zhang. Visionary: vision-aware enhancement with reminding scenes generated by captions via multimodal transformer for embodied referring expression
1689	--	1704	Munish Bhardwaj, Nafis uddin Khan, Vikas Baghel. Road crack detection using pixel classification and intensity-based distinctive fuzzy C-means clustering
1705	--	1717	Houfu Peng, Xing Lu, Daoxun Xia, Xiaoyao Xie. A novel image restoration solution for cross-resolution person re-identification
1719	--	1731	Caifeng Liu, Fangjie Gu. Differential motion attention network for efficient action recognition
1733	--	1755	Gang Zhang, Yang Geng, Zhao G. Gong. A comprehensive review of deep learning approaches for group activity analysis
1757	--	1775	Huijuan Wang, Xinyue Chen, Quanbo Yuan, Peng Liu. A review of 3D object detection based on autonomous driving
1777	--	1788	Libo Sun, Yifan Li, Wenhu Qin. PEPillar: a point-enhanced pillar network for efficient 3D object detection in autonomous driving
1789	--	1809	Mohamed Charfeddine Mzoughi, Najib Ben Aoun, Sami Naouali. A review on kinship verification from facial information
1811	--	1825	Jiawei Chen, Wen Su, Mengjiao Ge, Ye He, Jun Yu. To-Former: semantic segmentation of transparent object with edge-enhanced transformer
1827	--	1840	Ying Ma, Meng Wang, Guangyun Lu, Yajun Sun. Multi-label semantic sharing based on graph convolutional network for image-to-text retrieval
1841	--	1854	Xiafan Li, Hongyan Quan. MVPCL: multi-view prototype consistency learning for semi-supervised medical image segmentation
1855	--	1872	Yihe Nie, Xingbo Zhao, Yongxiang Li, Qianwen Lu, Qingchuan Tao, Yanmei Yu. DEAR: a novel deep-level semantics feature reinforce framework for Infrared Small Object Segmentation
1873	--	1889	Aokun Mei, Hua Huo, Jiaxin Xu, Ningya Xu. Multistage attention region supplement transformer for fine-grained visual categorization
1891	--	1905	Tong Li, Zhaoxuan Zhang, Yuxin Wang, Yan Cui, Yuqi Li, Dongsheng Zhou, Baocai Yin, Xin Yang. Self-supervised indoor scene point cloud completion from a single panorama
1907	--	1920	Xuyuan Zhang, Chen Xu 0004, Yu Han 0001, George Baciu. Fabric image recolorization by fuzzy pretrained neural network
1921	--	1938	Shilong Wang, Qianwen Hou, Jiaang Li, Jianlei Liu. TSID-Net: a two-stage single image dehazing framework with style transfer and contrastive knowledge transfer
1939	--	1956	Xiaohong Zhang, Shengwu Xiong 0001, Zhaoyang Sun, Jianwen Xiang. Semi-hard constraint augmentation of triplet learning to improve image corruption classification
1957	--	1969	Huijuan Wang, Boyan Cui, Quanbo Yuan, Gangqiang Pu, Xueli Liu, Jie Zhu. Mini-3DCvT: a lightweight lip-reading method based on 3D convolution visual transformer
1971	--	1986	Zhigang Huang, Wanli Xue, Yuxi Zhou, Jinlu Sun, Yazhou Wu, Tiantian Yuan, Shengyong Chen. Dual-stage temporal perception network for continuous sign language recognition
1987	--	1998	Zixuan Yu, Zhenjun Tang, Xiaoping Liang, Hanyun Zhang, Ronghai Sun, Xianquan Zhang. A novel image hashing with low-rank sparse matrix decomposition and feature distance
1999	--	2010	Shiyu Li, Zehao Liu, Meijing Gao, Yang Bai, Haozheng Yin. MDSCN: multiscale depthwise separable convolutional network for underwater graphics restoration
2011	--	2027	Suyi Liu, Fang Xu, Chengdong Wu, Jianning Chi, Xiaosheng Yu, Longxing Wei, Chuanjiang Leng. CMT-6D: a lightweight iterative 6DoF pose estimation network based on cross-modal Transformer
2029	--	2046	Jun Wu, Wanyu Nie, Yu Zheng, Gan Zuo, Jiaming Dong, Siwei Wei. Malleable pruning meets more scaled wide-area of attention model for real-time crack detection
2047	--	2060	Qiwang Li, Mingwen Shao, Fukang Liu, Yuanjian Qiao, Zhiyong Hu. Contrastive local constraint for irregular image reconstruction and editability
2061	--	0	Xiang Suo, Weidi Tang, Lijuan Mao, Zhen Li. Correction: Digital human and embodied intelligence for sports science: advancements, opportunities and prospects
2063	--	0	Dhruv Meduri, Mohit Sharma, Vijay Natarajan. Correction to: Jacobi set simplification for tracking topological features in time-varying scalar fields

785	--	798	Jianliang Li, Jinming Zhang, Xiaohai Zhang, Ming Chen. Edge-guided generative network with attention for point cloud completion
799	--	813	Haowei Zhu, Suqin Bai, Jinlong Shi, Chenggen Wang, Yunhan Sun, Jiawen Lu, Xin Shu, Shucheng Huang. IOFusion: instance segmentation and optical-flow guided 3D reconstruction in dynamic scenes
815	--	829	Chao Yang, Meng Yang 0011, HongYu Li, Linlu Jiang, Xiang Suo, Lijuan Mao, Weiliang Meng, Zhen Li. A survey on soccer player detection and tracking with videos
831	--	851	Sameer Bhimrao Patil, Suresh Shirgave. Instructor emotion recognition system using manta ray foraging algorithm for improving the content delivery in video lecture
853	--	867	Ting Yu, Weiliang Meng, Zhongqi Wu, Jianwei Guo, Xiaopeng Zhang 0001. Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow
869	--	881	Yasmeen Cheema, Muhammad Nadeem Cheema, Anam Nazir, Fahad Ahmed KhoKhar, Ping Li 0016, Ayaz Ahmed. A novel approach for improving open scene text translation with modified GAN
883	--	900	Pengbin Fu, Ganyun Xiao, Huirong Yang. SATD: syntax-aware handwritten mathematical expression recognition based on tree-structured transformer decoder
901	--	919	Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Pablo Carballeira. Per-class curriculum for Unsupervised Domain Adaptation in semantic segmentation
921	--	943	Supriya Agrawal, Prachi Natu. OBB detector: occluded object detection based on geometric modeling of video frames
945	--	960	Xin Wang, Jin Feng, Jiajia Ding, Jun Gao. Light field salient object detection based on discrete viewpoint selection and multi-feature fusion
961	--	973	Zhizhen Zhou, Yejing Huo, Guoheng Huang, An Zeng, Xuhang Chen 0002, Lian Huang, Zinuo Li. QEAN: quaternion-enhanced attention network for visual dance generation
975	--	990	Shunsuke Takao. Underwater image sharpening and color correction via dataset based on revised underwater image formation model
991	--	1006	Junqing Yuan, Mengting Fan, Zhenyang Liu, Tongxuan Han, Zhenzhong Kuang, Chihao Pan, Jiajun Ding. Collaborative neural radiance fields for novel view synthesis
1007	--	1020	Can Zhang, Feipeng Da, Shaoyan Gai. Point clouds feature frequency domain analysis based on multilayer perceptron
1021	--	1036	Lei Wang, Xue-song Tang, Kuangrong Hao. GFPE-ViT: vision transformer with geometric-fractal-based position encoding
1037	--	1048	Fahad Ahmed KhoKhar, Jamal Hussain Shah, Rabia Saleem, Anum Masood. Harnessing deep learning for faster water quality assessment: identifying bacterial contaminants in real time
1049	--	1059	Yixiao Jin, Fu Gui, Minghao Chen, Xiang Chen, Haoxuan Li, Jingfa Zhang. Deep learning-driven automated quality assessment of ultra-widefield optical coherence tomography angiography images for diabetic retinopathy
1061	--	1077	Bo Qian, Xiangning Wang, Zhouyu Guan, Dawei Yang, An-ran Ran, Tingyao Li, Zheyuan Wang, Yang Wen, Xinming Shu, Jinyang Xie, Shichang Liu, Guanyu Xing, Julio Silva-Rodríguez, Riadh Kobbi, Ping Li 0016, Tingli Chen, Lei Bi 0001, Jinman Kim, Weiping Jia, Huating Li, Jing Qin 0001, Ping Zhang 0016, Ching Yu Cheng, Pheng-Ann Heng, Tien Yin Wong, Carol Y. Cheung, Yih Chung Tham, Nadia Magnenat-Thalmann, Bin Sheng 0001. HRDC challenge: a public benchmark for hypertension and hypertensive retinopathy classification from fundus images
1079	--	1096	Dapeng Yan, Gangyi Ding, Kexiang Huang, Tianyu Huang. Generating natural pedestrian crowds by learning real crowd trajectories through a transformer-based GAN
1097	--	1108	Yan Zhou, Xiang Chen, Tingyao Li, Shiqun Lin, Bin Sheng 0001, Ruhan Liu, Rongping Dai. GAMNet: a gated attention mechanism network for grading myopic traction maculopathy in OCT images
1109	--	1125	Gang Liu, Jiebang Wang, Yao Qian, Yonghua Li. Infrared and visible image fusion method based on visual saliency objects and fuzzy region attributes
1127	--	1140	Shweta Saboo, Joyeeta Singha. Semantic hand gesture integration system using self-co-articulation and movement epenthesis detection
1141	--	1154	Lars Zawallich. Unfolding polyhedra via tabu search
1155	--	1170	Bo Qian, Hao Chen 0011, Yupeng Xu, Yang Wen, Huating Li, Yuan Xie 0006, David Dagan Feng, Jinman Kim, Lei Bi 0001, Xun Xu, Xiangui He, Bin Sheng 0001. Deep contour attention learning for scleral deformation from OCT images
1171	--	1181	Lan Wei, Nikolaos M. Freris. Multi-scale graph neural network for physics-informed fluid simulation
1183	--	1196	Mengsi Guo, Mingfu Xiong, Jin Huang, Xinrong Hu, Tao Peng 0006. Face photo-sketch portraits transformation via generation pipeline
1197	--	1211	Mengsi Wang, Yuan Mei 0001, Lichun Yang, Bin Tian, Kaijun Wu 0001. SDR: stepwise deep rectangling model for stitched images
1213	--	1226	Qingkuo Meng, Yongjian Huai, Fei Ma, Wentao Ye, Haifeng Xu, Siyu Yang. Visualization of the occurrence and spread of wildfires in three-dimensional natural scenes
1227	--	1239	Xuan Miao, Shijie Li, Zheng Li, Wenzheng Xu, Ning Yang 0001. Multi-scale gated network for efficient image super-resolution
1241	--	1249	Václav Skala. A new fully projective O(lg N) line convex polygon intersection algorithm
1251	--	1271	Gaoming Yang, Yifeng Ding, Xianjin Fang, Ji Zhang 0001, Yan Chu. Fast face swapping with high-fidelity lightweight generator assisted by online knowledge distillation
1273	--	1291	Wensheng Li, Jing Zhang, Jiafeng Li, Li Zhuo. Unpaved road segmentation of UAV imagery via a global vision transformer with dilated cross window self-attention for dynamic map
1293	--	1302	Xiangning Wang, Zhouyu Guan, Bo Qian, Tingli Chen, Qiang Wu. A deep learning system for the detection of optic disc neovascularization in diabetic retinopathy using optical coherence tomography angiography images
1303	--	1317	Mei Zhang, Lingling Liu, Yongtao Pei, Guojing Xie, JingHua Wen. Semantic segmentation of multi-scale remote sensing images with contextual feature enhancement
1319	--	1333	Ya'nan Guan, Shujiao Liao, Wenyuan Yang. AParC-DETR: Accelerate DETR training by introducing Adaptive Position-aware Circular Convolution
1335	--	1350	Yong Liu, Xingyuan Li, Yong Liu, Wei Zhong. SimpliFusion: a simplified infrared and visible image fusion network
1351	--	1366	Liping Zhu, Silin Wu, Xianxiang Chang, Yixuan Yang, Xuan Li. Rethinking group activity recognition under the open set condition
1367	--	1378	Yuanqi Hu, Jianqi Zhang, Ling Bai, Jing Li, Bing Li, Ying Zang, Wenjun Hu. From sketch to reality: precision-friendly 3D generation technology
1379	--	1394	Wenxuan Liu, Xuemei Jia, Yihao Ju, Yakun Ju, Kui Jiang, Shifeng Wu, Luo Zhong, Xian Zhong. Fragrant: frequency-auxiliary guided relational attention network for low-light action recognition
1395	--	1408	Wuzhen Shi, Fei Tao, Yang Wen. Joint super-resolution-based fast face image coding for human and machine vision
1409	--	1411	Shengzhou Luo, Jingxing Xu, John Dingliana, Mingqiang Wei, Lu Han, Lewei He, Jiahui Pan. Publisher Correction: Twinenet: coupling features for synthesizing volume rendered images via convolutional encoder-decoders and multilayer perceptrons
1413	--	1414	Liwen Huang, Shujiao Liao, Wenyuan Yang. Correction: DC-PSENet: a novel scene text detection method integrating double ResNet-based and changed channels recursive feature pyramid

7013	--	7025	Mengyao Liu, Ruhan Liu, Jia Shu, Qirong Liu, Yuan Zhang, Lixin Jiang. AutoDDH: A dual-attention multi-task network for grading developmental dysplasia of the hip in ultrasound images
7027	--	7047	Lakshita Agarwal, Bindu Verma. Enriching image description generation through multi-modal fusion of VGG16, scene graphs and BiGRU
7049	--	7061	Main Uddin, Zhangjie Fu, Xiang Zhang. Deepfake face detection via multi-level discrete wavelet transform and vision transformer
7063	--	7078	Mengnan Hu, Qianli Zhou, Rong Wang. Bridging visible and infrared modalities: a dual-level joint align network for person re-identification
7079	--	7092	Hao Liu, Ye Liu, Shuanglong Yao, Tongshuai Yu, Ke Gao, Pengcheng Hao, Shuqing He, Ji Chen, Xing Wang. ISTFormer: lightweight transformer for enhanced super-resolution of coal rock images via iterative feature extraction
7093	--	7108	Zhehang Qiu, Huijuan Zhang, Jie Zhou, Jianming Zhan. Image restoration for both deblurring and dehazing based on multi-channel frequency information using deep neural network
7109	--	7121	Xi Li, Yulong Feng, Xianguo Yu, Yirui Cong, Lili Chen. Epipolar constraint-guided differentiable keypoint detection and description
7123	--	7139	Wei Pan, Zhe Yang 0005. A lightweight enhanced YOLOv8 algorithm for detecting small objects in UAV aerial photography
7141	--	7167	Sung-Wook Park, Se-Hoon Jung, Chun-Bo Sim. NeXtSRGAN: enhancing super-resolution GAN with ConvNeXt discriminator for superior realism
7169	--	7184	Yuyan Liu, Qing Zhang, Yilin Zhao, Yanjiao Shi. A dual-stream learning framework for weakly supervised salient object detection with multi-strategy integration
7185	--	7199	Guoquan Jiang, Canyu Wang, Zhanqiang Huo, Huan Xu. Multi-channel correlated diffusion for text-driven artistic style transfer
7201	--	7214	Lihua Yang, Jinxian Zhao, Ziming Wang, Yuheng Liu, Dazhao Chi. M-KANUNet: enhanced defect segmentation in X-ray images of copper pipe welds via multi-scale representation and Kolmogorov-Arnold Networks
7215	--	7232	Xingyue Zou, Jiqiang Tang. Guided fusion of infrared and visible images using gradient-based attentive generative adversarial networks
7233	--	7248	Lei Dai, Wen Gao, Chengyu Tang, Min Wang, Zhihua Chen. MTMFNet: multi-threshold and multi-scale feature fusion network for text detection
7249	--	7267	Huaiguang Cai, Yang Yang 0056, Yongqiang Tang, Zhengya Sun, Wensheng Zhang 0002. Shapley value-based class activation mapping for improved explainability in neural networks
7269	--	7283	Wei Song, Yaobin Huang. Adaptive feature recalibration transformer for enhancing few-shot image classification
7285	--	7302	Jialin Zhang, Xiao Wang, Hui Wei, Kui Jiang, Nan Mu, Zheng Wang. Context-aware target texture perturbation attack for concealed object detection
7303	--	7317	Qida Cao, Jiajun Ding, Zhenyang Liu, Zhenzhong Kuang, Yijie Shao, Yilan Shen. VC-GS: view-consistent deblurring Gaussian splatting via alternating branch optimization
7319	--	7340	Fuqiang Gou, Yonglong Li, Yanpian Mao, Chunyao Hou, Gang Wan, Jialong Li, Haoran Wang, Yongcan Chen. Planar tunnel point cloud fine registration under multiple constraints
7341	--	7350	Haitian Ren, Quinten Kwok, Meng Sun, Xuyan Huang, Jianlin Zhu, Haoxuan Li. Toward artificial general intelligence in health care
7351	--	7365	Chen-Bin Feng, Qi Lai, Kangdao Liu, Houcheng Su, Hao Chen, Kaixi Luo, Chi-Man Vong. Learning few-shot semantic segmentation with error-filtered segment anything model
7367	--	7377	Peng Zhang, Yuming Yan, Yuangao Ai, Benhong Wang, Houming Shen, Zhonghan Peng. Unet-based image segmentation and binarization for water level detection
7379	--	7397	Manuel Silva, Antonio Seoane, Omar A. Mures, Antonio M. López 0001, José Antonio Iglesias Guitián. Exploring the effects of synthetic data generation: a case study on autonomous driving for semantic segmentation
7399	--	7415	Ronggui Wang, Hong Chen, Juan Yang, Lixia Xue. Adaptive sparse triple convolutional attention for enhanced visual question answering
7417	--	7432	Die Yu, Zhaoyan Fang, Yong Jiang. Alleviating category confusion in fine-grained visual classification
7433	--	7446	Haomiao Liu, Hao Xu, Chuhuai Yue, Bo Ma. Adaptive objectness learning for enhanced unknown object detection
7447	--	7458	Xinbiao Lu, Yisen Chen, Yudan Chen, Xing Gao, Tieliu Yang, Guiyun Chen. STIG-Net: a spatial-temporal interactive graph framework for recognizing violent behaviors in videos
7459	--	7475	Keqi Li, Yaping Wan, Gang Zou, Wangxiu Li, Jian Yang, Changyi Xie. Enhancing facial action unit recognition through topological feature integration and relational learning
7477	--	7491	Yuenan Wang, Hua Wang, Fan Zhang 0045. Mask autoencoder for enhanced image reconstruction with position coding offset and combined masking
7493	--	7508	Haowei Zhu, Suqin Bai, Jinlong Shi, Jiawen Lu, Xin Zuo, Shucheng Huang, Xu Yao. Ellipsoid-SLAM: enhancing dynamic scene understanding through ellipsoidal object representation and trajectory tracking
7509	--	7520	Daikun Qu, Hongwei Zhao, Mingzhu Zhou. Unsupervised video object segmentation with mask transformer: boosting accuracy and efficiency through feature fusion
7521	--	7533	Cheng Zhong, Xiaomin Yu, Huan Xia, Rongdong Xie, Qingyi Xu. Restoring intricate Miao embroidery patterns: a GAN-based U-Net with spatial-channel attention
7535	--	7549	Jinyang Wang, Jihong Wang, Haoxuan Li, Xiaojun Huang, Jun Xia, Zhen Li, Weibing Wu, Bin Sheng. Temporal goal-aware transformer assisted visual reinforcement learning for virtual table tennis agent
7551	--	7565	Junchi Ma, Yuanqing Wang, Guangmiao Ding, Wei Cao, Xiangyun Liao, Ping Zhang, Jianping Lv. Mamba-enhanced hierarchical attention network for precise visualization of hippocampus and amygdala
7567	--	7584	Yuhao Zhang, Jiaqi Tong, Honglin Liu. SCAP: enhancing image captioning through lightweight feature sifting and hierarchical decoding
7585	--	7601	Yan Zhang, Xueting Sang, Yemei Sun, Shudong Liu, Shengpei Zhou. DMTNet: dual-domain adaptive multi-scale feature fusion network with transformer for small target detection
7603	--	7616	Xiaochun Wu, Ning Guo. MGSLU-Net: a lightweight network for efficient detection of water leakage in subway tunnel linings
7617	--	7640	Kehao Chen, Zhiping Zhou, Kewei Li, Taoyong Su, Zhaozhong Zhang, Jinhua Liu, Chenghao Ying. Red green blue-depth salient object detection based on multi-scale refinement and cross-modalities fusion network
7641	--	7656	Fang Zhou, Tingting Yang, Liuyan Tan, Xiaolong Xu, Mengdao Xing. DAP-Net: enhancing SAR target recognition with dual-channel attention and polarimetric features
7657	--	7670	Cheng Jiang, Pengle Zhang, Ying Ni, Xiaoli Wang, Hanghang Peng, Sen Liu, Mengdi Fei, Yuxin He, Yaxuan Xiao, Jin Huang, Xingyu Ma, Tian Yang. Multimodal retrieval-augmented generation for financial documents: image-centric analysis of charts and tables with large language models
7671	--	7685	Zhaozhao Yang, Yuhai Yu, Yongdong Huang, Jiana Meng. Innovative approaches in image processing: enhancing feature extraction and recognition capabilities
7687	--	7702	Yihao Li, Junyu Liu, Xiaoyu Guan, Hanming Hou, Tianyu Huang. Introducing anisotropic fields for enhanced diversity in crowd simulation
7703	--	7721	Liming Wan, Lin Song, Ying Zhou, Chenrui Kang, Shijian Zheng, Guo Chen. Dynamic neighbourhood-enhanced UNet with interwoven fusion for medical image segmentation
7723	--	7733	Haomou Bai, Yue Sang. Ultra-lightweight convolutional network for efficient single-image super-resolution
7735	--	7750	Sathish Mothe, Srinivas Kankanala. Multi-stage residual network with two fold attention mechanisms for low-light image enhancement
7751	--	7766	Xie Chengjie, Lu Shuhua, Shi Yangyu, Zheng Diwen. Joint perturbation consistency across image and feature levels for cross-domain adaptive crowd counting
7767	--	7780	Pengyun Chen, Shuang Cui, Ning Cao, Wenhao Zhang, Pengfei Wang, Shaohui Jin, Mingliang Xu. Lightweight multi-scale feature fusion with attention guidance for passive non-line-of-sight imaging
7781	--	7798	Wu Shili, Guo Yongkun, Qian Chao, Li Ying, Zhang Xinyou. Global attention and context encoding for enhanced medical image segmentation
7799	--	7815	Xiang Shijie, Zhou Dong, Tian Dan. Multi-scale feature fusion network for real-time semantic segmentation of urban street scenes: enhancing detail retention and accuracy
7817	--	7838	Hao Li, Shengkun Wu, Lei Deng, Chenhua Liu, Yifan Chen, Hanrui Chen, Heng Yu, Mingli Dong, Lianqing Zhu. Enhancing infrared and visible image fusion through multiscale Gaussian total variation and adaptive local entropy
7839	--	7854	Duo Liu, Guoyin Zhang, Yiqi Shi, Ye Tian, Liguo Zhang. Efficient feature difference-based infrared and visible image fusion for low-light environments
7855	--	7865	Weichen Dai 0001, Hexing Wu, Xiaoyang Weng, Wanzeng Kong. Implicit guidance for enhancing low-light optical flow estimation via channel attention networks
7867	--	7882	Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Jose M. Martínez. Layer-wise model merging for unsupervised domain adaptation in segmentation tasks
7883	--	7907	Xinzhi Li, Yong Liu, Peng Yan. Optimizing feature map matching for marine benthic organism detection
7909	--	7923	Zhen Song, Jianhua Chen. Adaptive rate compression for distributed video sensing in wireless visual sensor networks
7925	--	7938	Jinxing Liang, Kaifang Han, Dongsheng Li, Ruixin Gao, Jiajia Peng, Tao Peng, Xinrong Hu. Enhancing low-frequency stitch code generation for knitted fabrics: an LFSCG-E-Net approach
7939	--	7950	Jiahao Wang, Yongqiang Wang, Congling Zhou, Jiawei Huang. LF-RTMDet: an instance segmentation algorithm for real-time detection of water-filled barriers
7951	--	7963	Xijun Wang, Xin Zhou, Yi Wang, Songto Zeng, Xinyu Liu, Haobo Shen, Song Fei, Lei Zhu. Msu-mamba: multi-scale defocus blur detection using cross-scale fusion and state-space models
7965	--	7981	Xite Wang, Changsheng Qin, Mei Bai, Qian Ma 0003, Guanyu Li. CAFormer: a connectivity-aware vision transformer for road extraction from remote sensing images
7983	--	7995	Zhenghao Xie, Junfen Chen, Yingying Wang, Bojun Xie. Enhanced fine-grained relearning for skeleton-based action recognition
7997	--	8008	Doudou Zhang, Junchi Ma, Jie Chen, Linxia Xiao, Xiangyun Liao, Yong Zhang, Weixin Si. MF-SAM: enhancing multi-modal fusion with Mamba in SAM-Med3D for GPi segmentation
8009	--	8023	Wubin Shi, Shaoyan Gai, Feipeng Da, Zeyu Cai, Jiaoling Wang. GRPoseNet: a generalizable and robust 6D object pose estimation network using sparse RGB views
8025	--	8040	Zongyu Ye, Hongjuan Yan, Yewang Sun, Bin Li, Lei Liu, Wenbo Wu. MSPNet: real-time semantic segmentation with large kernel and atrous convolutions
8041	--	8053	Zhengwei Guo, Bo Wang. Enhancing sandstorm images via color-guided spatial-frequency fusion network
8055	--	8073	Yu Pang, Yang Huang, Chenyu Weng, Jialin Lyu, Chuanyue Bai, Xiaosheng Yu. Enhanced RGB-T saliency detection via thermal-guided multi-stage attention network
8075	--	8087	Xiang Chen, Yuanqi Yao, Zhouyu Guan, Chenyang Li, Jian Guan, Jun Pu, Ruhan Liu, Bin Sheng 0001, Shankai Yin, Yiming Qin. DSTS-GF: a dual-stream temporal-spatial transformer with gated fusion for the classification of Obstructive Sleep Apnea
8089	--	8101	Yuanqi Yao, Zehua Jiang, Zhouyu Guan, Yilun Luxue, Seungmin Lee, Xiang Chen, Haodong Yang, Yiming Qin. A visual-language foundation model for disease diagnosis and doctor-patient co-decision
8103	--	8116	Shigang Hu, Darong Wu, Jianxin Wang, Shijun Huang. The image super-resolution network based on dual-branch feature interaction attention mechanism
8117	--	0	Tao Shi, Yao Ding 0012, Kui-feng Zhu, Yan-jie Su. Correction: DFP-YOLO: a lightweight machine tool workpiece defect detection algorithm based on computer vision
8119	--	0	Sung-Wook Park, Se-Hoon Jung, Chun-Bo Sim. Correction: NeXtSRGAN: enhancing super-resolution GAN with ConvNeXt discriminator for superior realism

1	--	2	Nadia Magnenat-Thalmann. Welcome to the Year 2025
3	--	10	. Acknowledgement to reviewers 2024
11	--	24	Wenji Yang, Liping Xie, Wenbin Qian, Canghai Wu, Hongyun Yang. Coarse-to-fine cascaded 3D hand reconstruction based on SSGC and MHSA
25	--	40	Gusu Song, Shaoyan Gai, Feipeng Da. Memory-based gradient-guided progressive propagation network for video deblurring
41	--	51	Rohit Pratap Singh, Dolendro Singh Laiphrakpam. Dyhand: dynamic hand gesture recognition using BiLSTM and soft attention methods
53	--	66	Zhe Li, Hui Lv, Libo Cheng, Xiaoning Jia. Image deblocking algorithm based on GC and SSR
67	--	78	I-Chao Shen, Li-wen Su, Yu-Ting Wu, Bing-Yu Chen 0004. StylePart: image-based shape part manipulation
79	--	97	Youssef Ait Khouya, Mohammed Ait Oussous, Abdeslam Jakimi, Faouzi Ghorbel. Stable and invertible invariants description for gray-level images based on Radon transform
99	--	114	Mahmoud A. Eldosoky, Jianping Li 0002, Amin Ul Haq, Fanyu Zeng, Mao Xu, Shakir Khan, Inayat Khan. WallNet: Hierarchical Visual Attention-Based Model for Putty Bulge Terminal Points Detection
115	--	128	Rajendra Nagar. Robust extrinsic symmetry estimation in 3D point clouds
129	--	140	Chen Zhao, Weiling Cai, Zheng Yuan. Spectral normalization and dual contrastive regularization for image-to-image translation
141	--	155	Ziliang Feng, Ju Zhang, Xusong Ran, Donglu Li, Chengfang Zhang. Ghost-Unet: multi-stage network for image deblurring via lightweight subnet learning
157	--	171	Chunlu Li, Feipeng Da. Refined dense face alignment through image matching
173	--	189	Xiongbo Lu, Feng Liu, Yi Rong, Yaxiong Chen, Shengwu Xiong 0001. MakeupDiffuse: a double image-controlled diffusion model for exquisite makeup transfer
191	--	208	Junjie Liu, Junlong Liu, Rongxin Jiang, Boxuan Gu, Yaowu Chen, Chen Shen 0003. Boosted verification using siamese neural network with DiffBlock
209	--	227	Xujia Qin, Xinyu Li, Mengjia Li, Hongbo Zheng, Xiaogang Xu. Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement
229	--	241	Xiaochun Lei, Zeyu Chen, Zhaoxin Yu, Zetao Jiang. BENet: boundary-enhanced network for real-time semantic segmentation
243	--	255	Feihu Bian, Suya Xiong, Ran Yi, Lizhuang Ma. Multi-view stereo-regulated NeRF for urban scene novel view synthesis
257	--	270	Hengrui Zhang, Yongfeng Qi, Huili Chen, Panpan Cao, Anye Liang, Shengcong Wen. LSDNet: lightweight stochastic depth network for human pose estimation
271	--	280	Zubair Ahmad Lone, Alwyn Roshan Pais. Salient object detection in HSI using MEV-SFS and saliency optimization
281	--	301	Clement Mailhe, Amine Ammar, Francisco Chinesta, Dominique Baillargeat. Towards improving synthetic-to-real image correlation for instance recognition in structure monitoring
303	--	314	Yue Yu, Yue Yang, Jingshuo Xing. PMGAN: pretrained model-based generative adversarial network for text-to-image generation
315	--	330	Haoyu Xiong, Yu Xiang. Robust gradient aware and reliable entropy minimization for stable test-time adaptation in dynamic scenarios
331	--	343	Zhixuan Tang, Haiyun Shen, Peng Yu, Kaisong Zhang, Jianyu Chen. Infrared tracking for accurate localization by capturing global context information
345	--	358	Yixiu Liu, Long Zhan, Yu Feng, Pengju Si, Shaowei Jiang, Qiang Zhao, Chenggang Yan 0001. Loose-tight cluster regularization for unsupervised person re-identification
359	--	382	Le-Anh Tran, Dong-Chul Park 0002. Encoder-decoder networks with guided transmission map for effective image dehazing
383	--	397	Yixiu Liu, Tao Jiang, Pengju Si, Shangdong Zhu, Chenggang Yan, Shuai Wang 0003, Haibing Yin. Unpaired semantic neural person image synthesis
399	--	408	Yan Huang, Xinchang Lu, Jia Fu. Single image reflection removal via self-attention and local discrimination
409	--	421	Ziyang Chen 0002, Yang Zhao, Junling He, Yujie Lu, Zhongwei Cui, Wenting Li, Yongjun Zhang 0007. Feature distribution normalization network for multi-view stereo
423	--	435	Dayu Jia, Yanwei Pang, Jiale Cao, Jing Pan. SSNet: a joint learning network for semantic segmentation and disparity estimation
437	--	449	Ye Li, Wu Zhang, Meiling Wu, Di Zhang, Zhiguo Wang, Changjiang You. Multi-keypoints matching network for clothing detection
451	--	464	Zhentao Zhang, Wenhao Li, Yuxi Cheng, Qingnan Huang, Taorong Qiu. An improved residual learning model and its application to hardware image classification
465	--	479	Ping Ma, Xinyi He, Yiyang Chen, Yuan Liu 0021. ISOD: improved small object detection based on extended scale feature pyramid network
481	--	490	Jian-xiong, Jie Wu, Ming Tang, Pengwen Xiong, Yushui Huang, Hang Guo. Combining YOLO and background subtraction for small dynamic target detection
491	--	516	Henry Senior, Gregory G. Slabaugh, Shanxin Yuan, Luca Rossi 0011. Graph neural networks in vision-language image understanding: a survey
517	--	534	Yuanhao Chai, Jingyu Gong, Xin Tan 0002, Jiachen Xu, Yuan Xie 0006, Lizhuang Ma. Learnable scene prior for point cloud semantic segmentation
535	--	548	Kunhong Xiong, Linbo Qing, Lindong Li, Li Guo 0018, Yonghong Peng. Facial expression recognition based on local-global information reasoning and spatial distribution of landmark features
549	--	562	Lixia Xue, Wenhao Wang, Ronggui Wang, Juan Yang 0001. Modular dual-stream visual fusion network for visual question answering
563	--	577	Jinguang Chen, Xin Zhang, Lili Ma, Bo Yang, Kaibing Zhang. CS-VITON: a realistic virtual try-on network based on clothing region alignment and SPM
579	--	590	Huihui Li, Junhao Zhu, Guihua Wen, Haoyang Zhong. Structural self-contrast learning based on adaptive weighted negative samples for facial expression recognition
591	--	604	Lihuan Zheng, Wanru Xu, Zhenjiang Miao, Xinxiu Qiu, Shanshan Gong. RESTHT: relation-enhanced spatial-temporal hierarchical transformer for video captioning
605	--	624	Yanxiang Hu, Panpan Wu, Bo Zhang 0101, Wenhao Sun, Yaru Gao, Caixia Hao, Xinran Chen. A new multi-focus image fusion quality assessment method with convolutional sparse representation
625	--	638	Shuyu Xiao, Yongfang Wang, Yihan Wang. SISIM: statistical information similarity-based point cloud quality assessment
639	--	658	Jing Wu, Hao Wu 0015, Guowu Yuan. Detail-aware image denoising via structure preserved network and residual diffusion model
659	--	674	Luhan Wang, Jun Li, Shangwei Guo, Shaokun Han. A cascaded graph convolutional network for point cloud completion
675	--	693	Zhongxu Li, Qihan He, Wenyuan Yang. E-FPN: an enhanced feature pyramid network for UAV scenarios detection
695	--	708	Jiakun Zhao, Yige Cai. SCAKD: a knowledge distillation framework based on spatial-corner attention for infrared and visible image fusion
709	--	722	Hao Zhou, Junjie Yin, Yilun Yang, Meie Fang, Ping Li. Topology-guided accelerated vector field streamline visualization
723	--	737	Kun Wu, Lei Zhu 0010, Weihang Shi, Wenwu Wang 0008. Automated fabric defect detection using multi-scale fusion MemAE
739	--	757	A. Lubna, Saidalavi Kalady, A. Lijiya. Visual question answering on blood smear images using convolutional block attention module powered object detection
759	--	772	Xiyu Wei, Yanmei Dong, Qin Liu, Lei Wang, Liantang Lou. Robust corner detection in continuous space
773	--	783	Jing Zhao, Yongjun He, Zheng Shi, Jian Qin, Yining Xie. A style-aware network based on multi-task learning for multi-domain image normalization

External Links

Journal: The Visual Computer

Volume 41, Issue 9

Volume 41, Issue 8

Volume 41, Issue 7

Volume 41, Issue 6

Volume 41, Issue 5

Volume 41, Issue 4

Volume 41, Issue 3

Volume 41, Issue 2

Volume 41, Issue 10

Volume 41, Issue 1