Abstract is missing.
- RSID: A Remote Sensing Image Dehazing NetworkYuan Li, Yafeng Zhao. 3-14 [doi]
- ContextNet: Learning Context Information for Texture-Less Light Field Depth EstimationWentao Chao, Xuechun Wang, Yiming Kan, Fuqing Duan. 15-27 [doi]
- An Efficient Way for Active None-Line-of-Sight: End-to-End Learned Compressed NLOS ImagingChen Chang, Tao Yue 0003, Siqi Ni, Xuemei Hu. 28-40 [doi]
- DFAR-Net: Dual-Input Three-Branch Attention Fusion Reconstruction Network for Polarized Non-Line-of-Sight ImagingHao Liu, Pengfei Wang, Xin He, Ke Wang, Shaohu Jin, Pengyun Chen, Xiaoheng Jiang, Mingliang Xu. 41-52 [doi]
- EVCPP:Example-Driven Virtual Camera Pose Prediction for Cloud Performing Arts ScenesJucheng Qiu, Xiaoyu Wu, Boshu Jia. 53-64 [doi]
- RBSR: Efficient and Flexible Recurrent Network for Burst Super-ResolutionRenlong Wu, Zhilu Zhang, Shuohao Zhang, Hongzhi Zhang, Wangmeng Zuo. 65-78 [doi]
- WDU-Net: Wavelet-Guided Deep Unfolding Network for Image Compressed Sensing ReconstructionXinlu Wang, Lijun Zhao 0002, Jinjing Zhang, Yufeng Zhang, Anhong Wang. 79-91 [doi]
- Memory-Augmented Spatial-Temporal Consistency Network for Video Anomaly DetectionZhangxun Li, Mengyang Zhao, Xinhua Zeng, Tian Wang, Chengxin Pang. 95-107 [doi]
- Frequency and Spatial Domain Filter Network for Visual Object TrackingManqi Zhao, Shenyang Li, Han Wang. 108-120 [doi]
- Enhancing Feature Representation for Anomaly Detection via Local-and-Global Temporal Relations and a Multi-stage MemoryXuan Li, Ding Ma, Xiangqian Wu. 121-133 [doi]
- DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming VideosShicheng Jing, Liping Xie. 134-145 [doi]
- Relation-Guided Multi-stage Feature Aggregation Network for Video Object DetectionTingting Yao, Fuxiao Cao, Fuheng Mi, Danmeng Li. 146-157 [doi]
- Multimodal Local Feature Enhancement Network for Video SummarizationZhaoyun Li, Xiwei Ren, Fengyi Du. 158-169 [doi]
- Asymmetric Attention Fusion for Unsupervised Video Object SegmentationHongfan Jiang, Xiaojun Wu, Tianyang Xu. 170-182 [doi]
- Flow-Guided Diffusion Autoencoder for Unsupervised Video Anomaly DetectionAoni Zhu, Wenjun Wang, Cheng Yan. 183-194 [doi]
- Prototypical Transformer for Weakly Supervised Action SegmentationTao Lin, Xiaobin Chang, Wei Sun, Wei-Shi Zheng 0001. 195-206 [doi]
- Unimodal-Multimodal Collaborative Enhancement for Audio-Visual Event LocalizationHuilin Tian, Jingke Meng, Yuhan Yao, Wei-Shi Zheng 0001. 207-219 [doi]
- Dual-Memory Feature Aggregation for Video Object DetectionDiwei Fan, Huicheng Zheng, Jisheng Dang. 220-232 [doi]
- Going Beyond Closed Sets: A Multimodal Perspective for Video Emotion AnalysisHao Pu, Yuchong Sun, Ruihua Song, Xu Chen, Hao Jiang, Yi Liu, Zhao Cao. 233-244 [doi]
- Temporal-Semantic Context Fusion for Robust Weakly Supervised Video Anomaly DetectionYuan Zeng, Yuanyuan Wu, Jing Liang, Wu Zeng. 245-256 [doi]
- A Survey: The Sensor-Based Method for Sign Language RecognitionTian Yang, Cong Shen, Xinyue Wang, Xiaoyu Ma, Chen Ling. 257-268 [doi]
- Utilizing Video Word Boundaries and Feature-Based Knowledge Distillation Improving Sentence-Level Lip ReadingHongzhong Zhen, Chenglong Jiang, Jiyong Zhou, Liming Liang, Ying Gao. 269-281 [doi]
- Denoised Temporal Relation Network for Temporal Action SegmentationZhichao Ma 0002, Kan Li 0001. 282-294 [doi]
- 3D Lightweight Spatial-Spectral Attention Network for Hyperspectral Image ClassificationZiyou Zheng, Shuzhen Zhang, Hailong Song, Qi Yan. 297-308 [doi]
- Deepfake Detection via Fine-Grained Classification and Global-Local Information FusionTonghui Li, Yuanfang Guo, Yunhong Wang. 309-321 [doi]
- Unsupervised Image-to-Image Translation with Style ConsistencyBinxin Lai, Yuan-Gen Wang. 322-334 [doi]
- SemanticCrop: Boosting Contrastive Learning via Semantic-Cropped ViewsYa Fang, Zipeng Chen, Weixuan Tang, Yuan-Gen Wang. 335-346 [doi]
- Transformer-Based Multi-object Tracking in Unmanned Aerial VehiclesJiaxin Li, Hongjun Li. 347-358 [doi]
- HEI-GAN: A Human-Environment Interaction Based GAN for Multimodal Human Trajectory PredictionZihao Wang, Xuguang Chen, Sichao Wen, YaoNong Wang. 359-370 [doi]
- CenterMatch: A Center Matching Method for Semi-supervised Facial Expression RecognitionLinhuang Wang, Xin Kang, Satoshi Nakagawa, Fuji Ren. 371-383 [doi]
- Cross-Dataset Distillation with Multi-tokens for Image Quality AssessmentTimin Gao, Weixuan Jin, Bokai Lai, Zhen Chen, Runze Hu, Yan Zhang, Pingyang Dai. 384-395 [doi]
- Quality-Aware CLIP for Blind Image Quality AssessmentWensheng Pan, Zhifu Yang, Dingming Liu, Chenxin Fang, Yan Zhang, Pingyang Dai. 396-408 [doi]
- Multi-agent Perception via Co-attentive Communication MechanismNing Gong, Zhi Li, Shaohui Li, Yuxin Ke, Zhizhuo Jiang, Yaowen Li, Yu Liu. 409-421 [doi]
- DBRNet: Dual-Branch Real-Time Segmentation NetWork for Metal Defect DetectionTianpeng Zhang, Xiumei Wei, Xiaoming Wu, Xuesong Jiang. 422-434 [doi]
- MaskDiffuse: Text-Guided Face Mask Removal Based on Diffusion ModelsJingxia Lu, Xianxu Hou, Hao Li, Zhibin Peng, LinLin Shen, Lixin Fan. 435-446 [doi]
- Image Generation Based Intra-class Variance Smoothing for Fine-Grained Visual ClassificationZihan Yan, Ruoyi Du, Kongming Liang, Tao Wei, Wei Chen, Zhanyu Ma. 447-459 [doi]
- Cross-Domain Soft Adaptive Teacher for Syn2Real Object DetectionWeijie Guo 0002, Boyong He, Yaoyuan Wu, Xianjiang Li, Liaoni Wu. 460-472 [doi]
- Dynamic Graph-Driven Heat Diffusion: Enhancing Industrial Semantic SegmentationJiaquan Li, Min Jiang, Minghui Shi. 473-484 [doi]
- EKGRL: Entity-Based Knowledge Graph Representation Learning for Fact-Based Visual Question AnsweringYongjian Ren, Xiaotang Chen, Kaiqi Huang. 485-496 [doi]
- Disentangled Attribute Features Vision Transformer for Pedestrian Attribute RecognitionCaihua Liu, Jiaxian Guo, Sichu Chen, Xia Feng. 497-509 [doi]
- A High-Resolution Network Based on Feature Redundancy Reduction and Attention MechanismYuQing Pan, Weiming Lan, Feng Xu, Qinghua Ren. 510-521 [doi]