Abstract is missing.
- DepthFisheye: Efficient Fine-Tuning of Depth Estimation Models for Fisheye CamerasWenbin Wu, Zhiwei Zhang, Xin Tan 0002, Zhizhong Zhang 0001, Lizhuang Ma. 3-18 [doi]
- DIMATrack: Dimension Aware Data Association for Multi-Object TrackingShu Liu 0002, Melikamu Liyih Sinishaw, Luo Zheng. 19-36 [doi]
- Efficient Transformer Network for Visible and Ultraviolet Object TrackingQinghua Song, Xiaolei Wang. 37-51 [doi]
- LightGR-Transformer: Light Grouped Residual Transformer for Multispectral Object DetectionMingming Li, Fei Wu 0004, Yinjie Wang. 52-71 [doi]
- ADMMOA: Attribute-Driven Multimodal Optimization for Face Recognition Adversarial AttacksRuizhong Du, Luman Zhao, Mingyue Li, Yidan Li, Shenyu Li, Caixia Ma. 72-88 [doi]
- Training-Free Language-Guided Video Summarization via Multi-Grained Saliency ScoringWei Ge, Yongwei Nie, Fei Ma 0006, Keke Tang, Fei Richard Yu, Hongmin Cai, Ping Li 0016. 89-104 [doi]
- Reinforced Label Denoising for Weakly-Supervised Audio-Visual Video ParsingYongbiao Gao, Xiangcheng Sun, Guohua Lv, Deng Yu, Sijiu Niu. 107-124 [doi]
- Bridging the Modality Gap: Advancing Multimodal Human Pose Estimation with Modality-Adaptive Pose Estimator and Novel Benchmark DatasetsJiangnan Xia, Zhiyuan Zhang 0004, Yanyin Guo, Qilong Wu, Yi Li, Jianghan Cheng, Junwei Li. 125-153 [doi]
- Momentum-Based Uni-modal Soft-Label Alignment and Multi-modal Latent Projection Networks for Optimizing Image-Text RetrievalXiaole Zhu, Zongtao Duan, Junchen Huang, Xing-sheng. 154-176 [doi]
- Multi-granularity and Multi-modal Prompt Learning for Person Re-IdentificationHao Tong, Jiawei Liu 0001, Yong Wu, Guozhi Zhao, Fanrui Zhang, Zheng-Jun Zha. 177-200 [doi]
- Local and Global Feature Cross-Attention Multimodal Place RecognitionLu Xu, Shuaixin Li, Xin Zhou, Xiaozhou Zhu, Wen Yao. 201-220 [doi]
- IML-CMM - A Multimodal Sentiment Analysis Framework Integrating Intra-modal Learning and Cross-Modal Mixup EnhancementZheng Zhang, Ruiqing Yang, Chuanlei Zhang. 221-243 [doi]
- MCFG with GUMAP: A Simple and Effective Clustering Framework on Grassmann ManifoldBenchao Li, Yun Zou, Ruisheng Ran. 247-265 [doi]
- Joint UMAP for Visualization of Time-Dependent DataYun Zou, Benchao Li, Ruisheng Ran. 266-288 [doi]
- Unsupervised Domain Adaptation on Point Cloud Classification via Imposing Structural Manifolds into Representation SpaceHongchao Zhong, Li Yu 0004, Longkun Zou, Ke Chen 0004. 289-307 [doi]
- Learning Adaptive Basis Fonts to Fuse Content Features for Few-Shot Font GenerationKeyang Lin, Zhijun Fang, Sicong Zang, Hang Wu. 311-332 [doi]
- TaiCrowd: A High-Performance Simulation Framework for Massive CrowdXiaoyu Guan, Yihao Li, Tianyu Huang. 333-350 [doi]
- Feature Disentanglement and Fusion Model for Multi-source Domain Adaptation with Domain-Specific FeaturesChengrong Yang, Qiwen Jin, Xiaoguo Zhang, Yujue Zhou. 351-372 [doi]
- A Trademark Retrieval Method Based on Self-supervised LearningKailang Hu, Yixiao Lu, Huibing Li, Xuan Song. 373-398 [doi]
- Weaken Noisy Feature: Boosting Semi-supervised Learning by Noise EstimationJunjiang Liu, Dandan Sun, Hailun Xia, Jiangtao Bai, Xinyue Fan. 399-418 [doi]
- Multi-dimension Full Scene Integrated Visual Emotion Analysis NetworkWeiye Peng, Shenghua Zhong. 419-434 [doi]
- Gap-KD: Bridging the Significant Capacity Gap Between Teacher and Student ModelShan Huang, Wenhua Qian. 435-453 [doi]