Abstract is missing.
- DCI-Net: Remote Sensing Image-Based Object DetectorQuanyue Cui, Jun Lu. 1-15 [doi]
- Cross-Modal Ship Grounding: Towards Large Model for Enhanced Few-Shot LearningQuan Hu, Li Chen, Zhida Feng, YaoJie Chen. 16-28 [doi]
- STNet: Small Target Detection Network for IR ImageryNikhil Kumar, Pranav Singh Chib, Pravendra Singh. 29-44 [doi]
- FF-Yolo: A Feature-Fusion Yolo Model for Small Scale FODs Detection in Airport RunwaysSoumen Biswas, Ananth Ganesh. 45-60 [doi]
- Weakly Aligned Multi-spectral Pedestrian Detection via Cross-Modality Differential Enhancement and Multi-scale Spatial AlignmentZhenzhou Shao, Yongxin Chen, Yibo Zou, Jie Zhang, Yong Guan. 61-73 [doi]
- CrackUDA: Incremental Unsupervised Domain Adaptation for Improved Crack Segmentation in Civil StructuresKushagra Srivastava, Damodar Datta Kancharla, Rizvi Tahereen, Pradeep Kumar Ramancharla, Ravi Kiran Sarvadevabhatla, Harikumar Kandath 0001. 74-89 [doi]
- DS MYOLO: A Reliable Object Detector Based on SSMs for Driving ScenariosYang Li, Jianli Xiao. 90-104 [doi]
- Robust Single-Cam Surround View Object Detection and Localization Using Memory MapsYitong Quan, Benjamin Kiefer, Martin Messmer, Charan Ram Akupati, Rainer Graser, Andreas Zell. 105-118 [doi]
- Exploring the Reliability of Foundation Model-Based Frontier Selection in Zero-Shot Object Goal NavigationShuaihang Yuan, Halil Utku Unlu, Hao Huang 0003, Congcong Wen, Anthony Tzes, Yi Fang 0006. 119-134 [doi]
- Reliable Semantic Understanding for Real World Zero-Shot Object Goal NavigationHalil Utku Unlu, Shuaihang Yuan, Congcong Wen, Hao Huang 0003, Anthony Tzes, Yi Fang 0006. 135-150 [doi]
- AllWeather-Net: Unified Image Enhancement for Autonomous Driving Under Adverse Weather and Low-Light ConditionsChenghao Qian, Mahdi Rezaei 0001, Saeed Anwar, Wenjing Li 0005, Tanveer Hussain, Mohsen Azarmi, Wei Wang 0335. 151-166 [doi]
- Uni4DAL: A Unified Baseline for Multi-dataset 4D Auto-LabelingZhiyuan Yang, Xuekuan Wang, Wei Zhang 0197, Xiao Tan 0001, Jinchen Lu, Jingdong Wang 0001, Errui Ding, Zhihui Lai, Cairong Zhao. 167-182 [doi]
- Dual-Attention Fusion Network with Edge and Content Guidance for Remote Sensing Images SegmentationShuaipeng Ding, Jianan Shui, Xin Li, Mingyong Li. 183-197 [doi]
- Distortion Correction Sub-network for Semantic Segmentation Based on Deep Hough TransformWanpeng Geng, Jing Liu, Dexin Zhang, Hui Zhang. 198-218 [doi]
- MemoFlow: Modifying Explicit Motion of Inconsistency in Optical FlowMengfei Wang, Wenjun Shi, Dongchen Zhu, Lei Wang, Jiamao Li. 219-234 [doi]
- Enhanced Brain Tumor Segmentation Using Preprocessing Techniques and 3D U-NetAbdelrahman Telib, Mohamed Gabr. 235-248 [doi]
- Joint Top-Down and Bottom-Up Frameworks for 3D Visual GroundingYang Liu, Daizong Liu, Wei Hu 0003. 249-264 [doi]
- Anticipating Future Object Compositions Without ForgettingYoussef Zahran, Gertjan J. Burghouts, Yke Bauke Eisma. 265-279 [doi]
- SPK: Semantic and Positional Knowledge for Zero-Shot Referring Expression ComprehensionZetao Du, Jianhua Yang, Junbo Wang, Yan Huang, Liang Wang. 280-295 [doi]
- Can Language Improve Visual Features For Distinguishing Unseen Plant Diseases?Jerad Zherui Liaw, Abel Yu Hao Chai, Sue Han Lee, Pierre Bonnet, Alexis Joly. 296-311 [doi]
- Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text TranslationShreyas Vaidya, Arvind Kumar Sharma, Prajwal Gatti, Anand Mishra 0001. 312-328 [doi]
- iGrasp: An Interactive 2D-3D Framework for 6-DoF Grasp DetectionJian-Jian Jiang, Xiao-Ming Wu 0002, Zibo Chen, Yi-Lin Wei, Wei-Shi Zheng 0001. 329-345 [doi]
- Goal-Driven Transformer for Robot Behavior Learning from Play DataCongcong Wen, Jiazhao Liang, Shuaihang Yuan, Hao Huang 0003, Yu Hao, Hui Lin, Yu-Shen Liu, Yi Fang 0006. 346-359 [doi]
- Adaptive Dynamic VSLAM: Refining Semantic-Geometric Fusion and Static Background InpaintingQi Mu, Baizhang Guo, Shuai Guo, Zhanli Li. 360-376 [doi]
- Hierarchical Visual Place Recognition with Semantic-Guided AttentionWenwen Ming, Xucan Chen, Zhe Liu, Ruihao Li, Wei Yi. 377-392 [doi]
- Dense Reconstruction and Localization in Scenes with Glass Surfaces Based on ORB-SLAM2Zeyuan Chen, Ziquan Wang, Qiang Gao, Masahiko Mikawa, Makoto Fujisawa. 393-410 [doi]
- Content-Aware Feature Upsampling for Voxel-Based 3D Semantic SegmentationYu Song, Ruigang Fu, Qingyong Hu, Biao Li, Ping Zhong. 411-426 [doi]
- Enhancing 3D Referential Grounding by Learning Coarse Spatial RelationshipsSoham Joshi, Aditay Tripathi, Viswanath Gopalakrishnan, Anirban Chakraborty 0001. 427-442 [doi]
- PointGADM: Geometry Acquainted Deep Model for 3D Point Cloud AnalysisSeema Kumari, Samay Kalpesh Patel, Raja Muthalagu, Shanmuganathan Raman. 443-458 [doi]
- CroMA: Cross-Modal Attention for Visual Question Answering in Robotic SurgeryGreetta Antonio, Jobin Jose, Sudhish N. George, Kiran B. Raja. 459-471 [doi]