Abstract is missing.
- Global-to-Local Feature Mining Network for RGB-Infrared Person Re-IdentificationQiang Chen, Fuxiao He, Guoqiang Xiao 0001. 1-13 [doi]
- Semantic Transition Detection for Self-supervised Video Scene SegmentationLu Chen, Jiawei Tan, Pingan Yang, Hongxing Wang 0001. 14-27 [doi]
- Multi-task Collaborative Network for Image-Text RetrievalXueyang Qin, Lishuang Li, Jing Hao, Meiling Ge, Jiayi Huang, Guangyao Pang. 28-42 [doi]
- FGENet: Fine-Grained Extraction Network for Congested Crowd CountingHao-Yuan Ma, Li Zhang, Xiang-Yi Wei. 43-56 [doi]
- MSMV-UNet: A 2.5D Stroke Lesion Segmentation Method Based on Multi-slice Feature FusionJingjing Xie, Jixuan Hong, Manjin Sheng, Chenhui Yang. 57-69 [doi]
- Non-Local Spatial-Wise and Global Channel-Wise Transformer for Efficient Image Super-ResolutionXiang Gao, Sining Wu, Fan Wang, Xiaopeng Hu. 70-85 [doi]
- MobileViT-FocR: MobileViT with Fixed-One-Centre Loss and Gradient Reversal for Generalised Fake Face DetectionTing Peng, Yihang Zhou, Rong Sun, Yizhi Luo, Yuqi Li. 86-100 [doi]
- ASF-Conformer: Audio Scoring Conformer with FFC for Speaker Verification in Noisy EnvironmentsXiran Zhang, Haiyan Liu, Caixia Liu, Haiyang Zhang, Zhiwei Huo. 101-111 [doi]
- Prior-Knowledge-Free Video Frame Interpolation with Bidirectional Regularized Implicit Neural RepresentationsYuanjian He, Weile Zhang, Junyuan Deng, Yulai Cong. 112-126 [doi]
- Two-Stage Reasoning Network with Modality Decomposition for Text VQAShengrong Ling, Sisi You, Bing-Kun Bao. 127-140 [doi]
- Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery VideosHonglei Zheng, Wenkang Fan, Yinran Chen, Xiongbiao Luo. 141-154 [doi]
- Co-speech Gesture Generation with Variational Auto EncoderShinichi Ka, Koichi Shinoda. 155-168 [doi]
- Differentiable Neural Architecture Search Based on Efficient Architecture for Lightweight Image Super-ResolutionChunyin Sheng, Xiang Gao, Xiaopeng Hu, Fan Wang. 169-183 [doi]
- Learning Collaborative Reinforcement Attention for 3D Face Reconstruction and Dense AlignmentZhengwei Yang, Yange Wang, Lei Ma, Xiangzheng Li. 184-197 [doi]
- Exploring Multi-modal Fusion for Image Manipulation Detection and LocalizationKonstantinos Triaridis, Vasileios Mezaris. 198-211 [doi]
- Appearance-Motion Dual-Stream Heterogeneous Network for VideoQAFeifei Xu, Zheng Zhong, Yitao Zhu, Yingchen Zhou, Guangzhen Li. 212-227 [doi]
- Adaptive Token Selection and Fusion Network for Multimodal Sentiment AnalysisXiang Li, Ming Lu, Ziming Guo, Xiaoming Zhang. 228-241 [doi]
- r Color SpacePei Chen, Zhiyong Feng 0002, Meng-xing, Yiming Zhang, Jinqing Zheng. 242-256 [doi]
- Fractional-Order Image Moments and ApplicationsLiyun Xu, Min Zhang. 257-269 [doi]
- Time-Quality Tradeoff of MuseHash Query Processing PerformanceMaria Pegia, Ferran Agullo Lopez, Anastasia Moumtzidou, Alberto Gutierrez-Torre, Björn Þór Jónsson 0001, Josep Lluis Berral-Garcia, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris. 270-283 [doi]
- Dual-Fisheye Image Stitching via Unsupervised Deep LearningZhanjie Jin, Anming Dong, Jiguo Yu, Shuxiang Dong, You Zhou 0006. 284-298 [doi]
- CA-GAN: Conditional Adaptive Generative Adversarial Network for Text-to-Image SynthesisJunpeng Liu, Hengkang Bao. 299-312 [doi]
- RDC-YOLOv5: Improved Safety Helmet Detection in Adverse WeatherDexu Yao, Aimin Li, Deqi Liu, Mengfan Cheng. 313-326 [doi]
- Sustainable Commercial Fishery Control Using Multimedia Forensics Data from Non-trusted, Mobile Edge NodesAril Bernhard Ovesen, Tor-Arne Schmidt Nordmo, Michael Alexander Riegler, Pål Halvorsen, Dag Johansen. 327-340 [doi]
- MC-TCMNER: A Multi-modal Fusion Model Combining Contrast Learning Method for Traditional Chinese Medicine NERShan Cao, Qingfeng Wu. 341-354 [doi]
- C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough SoundsXiangyu Chen, Md Ayshik Rahman Khan, Md. Rakibul Hasan 0001, Tom Gedeon, Md. Zakir Hossain. 355-368 [doi]
- Pseudo-label Based Unsupervised Momentum Representation Learning for Multi-domain Image RetrievalMingyuan Ge, Jianan Shui, Junyu Chen, Mingyong Li. 369-380 [doi]
- DFGait: Decomposition Fusion Representation Learning for Multimodal Gait RecognitionJianbo Xiong, Shinan Zou, Jin Tang 0001. 381-395 [doi]
- MoPE: Mixture of Pooling Experts Framework for Image-Text RetrievalJiangfeng Li, Bowen Wang, Yongrui Qin, Chenxi Zhang, Gang Yu, Qinpei Zhao. 396-409 [doi]
- Multi-modal Video Topic Segmentation with Dual-Contrastive Domain AdaptationLinzi Xing, Quan Hung Tran, Fabian Caba, Franck Dernoncourt, Seunghyun Yoon 0002, Zhaowen Wang, Trung Bui, Giuseppe Carenini. 410-424 [doi]
- Unsupervised Multi-collaborative Learning Network for 3D Face ReconstructionWenlong Lu, Suping Wu, Xitie Zhang, Shengjia Zhang. 425-436 [doi]
- A Region Based Non-overlapping Reference Speech Estimation Method for Speaker ExtractionYiru Zhang, Zeke Li, Bijing Liu, Haiwei Fan, Yong Yang, Qun Yang. 437-447 [doi]
- Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel OptimizationPan Li, Suping Wu, Xitie Zhang, Yuxin Peng, Boyang Zhang, Bin Wang. 448-461 [doi]
- Prototype-Enhanced Hypergraph Learning for Heterogeneous Information NetworksShuai Wang, Jiayi Shen, Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring. 462-476 [doi]
- A Language-Based Solution to Enable Metaverse RetrievalAli Abdari, Alex Falcon, Giuseppe Serra 0001. 477-488 [doi]
- Part-Aware Prompt Tuning for Weakly Supervised Referring Expression GroundingChenlin Zhao, Jiabo Ye, Yaguang Song, Ming Yan, Xiaoshan Yang, Changsheng Xu. 489-502 [doi]
- Adversarially Robust Deepfake Detection via Adversarial Feature Similarity LearningSarwar Khan, Jun-Cheng Chen, Wen-Hung Liao, Chu-Song Chen. 503-516 [doi]
- A Multidimensional Taxonomy Model for Music Tangible User InterfacesAdriano Baratè, Luca Andrea Ludovico. 517-531 [doi]