Abstract is missing.
- Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion Based ClassificationChengguo Yuan, Yu Jin, Zongzhen Wu, Fanting Wei, Yangzirui Wang, Lan Chen, Xiao Wang. 3-15 [doi]
- Multi-scale Dilated Attention Graph Convolutional Network for Skeleton-Based Action RecognitionYang Shu, Wanggen Li, Doudou Li, Kun Gao, Biao Jie. 16-28 [doi]
- Auto-Learning-GCN: An Ingenious Framework for Skeleton-Based Action RecognitionWentian Xin, Yi Liu, Ruyi Liu, Qiguang Miao, Cheng Shi, Chi-Man Pun. 29-42 [doi]
- Skeleton-Based Action Recognition with Combined Part-Wise Topology Graph Convolutional NetworksXiaowei Zhu, Qian Huang, Chang Li, Jingwen Cui, Yingying Chen. 43-59 [doi]
- Segmenting Key Clues to Induce Human-Object Interaction DetectionMingliang Xue, Siwei Wang, Bing Fu, Zhengyang Zhao, Tao Liu, Lingfeng Lai. 60-71 [doi]
- Lightweight Multispectral Skeleton and Multi-stream Graph Attention Networks for Enhanced Action Prediction with Multiple ModalitiesTeng Huang, Weiqing Kong, Jiaming Liang, Ziyu Ding, Hui Li, Xi Zhang. 72-83 [doi]
- Spatio-Temporal Self-supervision for Few-Shot Action RecognitionWanchuan Yu, Hanyu Guo, Yan Yan 0001, Jie Li, Hanzi Wang. 84-96 [doi]
- A Fuzzy Error Based Fine-Tune Method for Spatio-Temporal Recognition ModelJiulin Li, Mengyu Yang, Yang Liu, Gongli Xi, Lanshan Zhang, Ye Tian 0008. 97-108 [doi]
- Temporal-Channel Topology Enhanced Network for Skeleton-Based Action RecognitionJinzhao Luo, Lu Zhou, Guibo Zhu, Guojing Ge, Beiying Yang, Jinqiao Wang. 109-119 [doi]
- HFGCN-Based Action Recognition System for Figure SkatingYing Zhou, Yana Zhang, Aiqiu Wu. 120-130 [doi]
- Image Priors Assisted Pre-training for Point Cloud Shape AnalysisZhengyu Li, Yao Wu, Yanyun Qu. 133-145 [doi]
- AMM-GAN: Attribute-Matching Memory for Person Text-to-Image GenerationWei Yue. 146-158 [doi]
- RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual DialogLiucun Lu, Jinghui Qin, Zequn Jie, Lin Ma 0002, Liang Lin, Xiaodan Liang. 159-171 [doi]
- KV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action EditingJiancheng Huang, Yifan Liu, Jin Qin, Shifeng Chen. 172-184 [doi]
- Enhancing Text-Image Person Retrieval Through Nuances Varied SampleJiaer Xia, Haozhe Yang, Yan Zhang, Pingyang Dai. 185-196 [doi]
- Unsupervised Prototype Adapter for Vision-Language ModelsYi Zhang, Ce Zhang 0009, Xueting Hu, Zhihai He. 197-209 [doi]
- Multimodal Causal Relations Enhanced CLIP for Image-to-Text RetrievalWenjun Feng, Dazhen Lin, Donglin Cao. 210-221 [doi]
- Exploring Cross-Modal Inconsistency in Entities and Emotions for Multimodal Fake News DetectionLongzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Siqi Wang. 222-234 [doi]
- Deep Consistency Preserving Network for Unsupervised Cross-Modal HashingMengluan Li, Yanqing Guo, Haiyan Fu, Yi Li, Hong Su. 235-246 [doi]
- Learning Adapters for Text-Guided Portrait Stylization with Pretrained Diffusion ModelsMintu Yang, Xianxu Hou, Hao Li, LinLin Shen, Lixin Fan. 247-258 [doi]
- EdgeFusion: Infrared and Visible Image Fusion Algorithm in Low LightZikun Song, Pinle Qin, Jianchao Zeng 0001, Shuangjiao Zhai, Rui Chai, Junyi Yan. 259-270 [doi]
- An Efficient Momentum Framework for Face-Voice Association LearningYuanyuan Qiu, Zhenning Yu, Zhenguo Gao. 271-283 [doi]
- Multi-modal Instance Refinement for Cross-Domain Action RecognitionYuan Qing, Naixing Wu, Shaohua Wan 0001, Lixin Duan. 284-296 [doi]
- Modality Interference Decoupling and Representation Alignment for Caricature-Visual Face RecognitionYang Xu, Junyi Wu, Yan Yan, Xinsheng Du, Huiji Zhang, Jianqiang Zhao, Zhipeng Gao. 297-308 [doi]
- Plugging Stylized Controls in Open-Stylized Image CaptioningJie Wang, Yixiao Zheng, Ruoyi Du, Yiming Zhang, Kongming Liang, Zhanyu Ma. 309-320 [doi]
- MGT: Modality-Guided Transformer for Infrared and Visible Image FusionTaoying Zhang, Hesong Li, Qiankun Liu, Xiaoyong Wang, Ying Fu. 321-332 [doi]
- Multimodal Rumor Detection by Using Additive Angular Margin with Class-Aware Attention for Hard SamplesChenyu Zhou, Xiuhong Li, Zhe Li, Fan Chen, Xiaofan Wang, Dan Yang, Bin Chen, Songlin Li. 333-344 [doi]
- An Effective Dynamic Reweighting Method for Unbiased Scene Graph GenerationLingfeng Hu, Si Liu, Hanzi Wang. 345-356 [doi]
- Multi-modal Graph and Sequence Fusion Learning for RecommendationZejun Wang, Xinglong Wu, Hongwei Yang, Hui He, Yu Tai, Weizhe Zhang. 357-369 [doi]
- Co-attention Guided Local-Global Feature Fusion for Aspect-Level Multimodal Sentiment AnalysisGuoyong Cai, Shunjie Wang, Guangrui Lv. 370-382 [doi]
- Discovering Multimodal Hierarchical Structures with Graph Neural Networks for Multi-modal and Multi-hop Question AnsweringQing Zhang, Haocheng Lv, Jie Liu, Zhiyun Chen, Jianyong Duan, Mingying Xv, Hao Wang. 383-394 [doi]
- Enhancing Recommender System with Multi-modal Knowledge GraphChengjie Sun, Weiwei Chen, Lei Lin 0001, Lili Shan. 395-407 [doi]
- Location Attention Knowledge Embedding Model for Image-Text MatchingGuoqing Xu, Min Hu, Xiaohua Wang 0002, Jiaoyun Yang, Nan Li, Qingyu Zhang. 408-421 [doi]
- Pedestrian Attribute Recognition Based on Multimodal TransformerDan Liu, Wei Song, Xiaobing Zhao. 422-433 [doi]
- RGB-D Road Segmentation Based on Geometric Prior InformationXinyi Wu, Xia Yuan, YanChao Cui, Chunxia Zhao. 434-445 [doi]
- Contrastive Perturbation Network for Weakly Supervised Temporal Sentence GroundingTingting Han 0003, Yuanxin Lv, Zhou Yu 0001, Jun Yu 0002, Jianping Fan 0001, Liu Yuan. 446-460 [doi]
- MLDF-Net: Metadata Based Multi-level Dynamic Fusion NetworkFeng Li, Enguang Zuo, Chen Chen, Cheng Chen, Mingrui Ma, Yunling Wang, Xiaoyi Lv, Min Li. 461-473 [doi]
- Efficient Adversarial Training with Membership Inference ResistanceRan Yan, RuiYing Du, Kun He 0008, Jing Chen 0003. 474-486 [doi]
- Enhancing Image Comprehension for Computer Science Visual Question AnsweringHongyu Wang, Pengpeng Qiang, Hongye Tan, Jingchang Hu. 487-498 [doi]
- Cross-Modal Attentive Recalibration and Dynamic Fusion for Multispectral Pedestrian DetectionWei Bao, Jingjing Hu, Meiyu Huang, Xueshuang Xiang. 499-510 [doi]