0 | -- | 0 | Jie Nie, Lei Huang 0010, Chengyu Zheng, Xiaowei Lv, Rui Wang. Cross-scale Graph Interaction Network for Semantic Segmentation of Remote Sensing Images |
0 | -- | 0 | Xiaohan Lan, Yitian Yuan, Xin Wang 0019, Long Chen 0016, Zhi Wang 0001, Lin Ma 0002, Wenwu Zhu 0001. A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach |
0 | -- | 0 | Xiaolong Liu, Yang Yu, Xiaolong Li 0001, Yao Zhao 0001, Guodong Guo. TCSD: Triple Complementary Streams Detector for Comprehensive Deepfake Detection |
0 | -- | 0 | Zheming Xu, Lili Wei, Congyan Lang, Songhe Feng, Tao Wang 0011, Adrian G. Bors, Hongzhe Liu. SSR-Net: A Spatial Structural Relation Network for Vehicle Re-identification |
0 | -- | 0 | Wu Liu, Hailin Shi, Yunchao Wei, Dan Zeng 0001, Nicu Sebe, Jiebo Luo. Introduction to the Special Issue on Trustworthy Multimedia Computing and Applications in Urban Scenes |
0 | -- | 0 | Ruoyu Chen, Jingzhi Li, Hua Zhang 0008, Changchong Sheng, Li Liu 0002, Xiaochun Cao. Sim2Word: Explaining Similarity with Representative Attribute Words via Counterfactual Explanations |
0 | -- | 0 | Xingyu Gao, Jinyang Xie, Zhenyu Chen 0003, An-An Liu, Zhenan Sun, Lei Lyu. Dilated Convolution-based Feature Refinement Network for Crowd Localization |
0 | -- | 0 | Hao Li, Jinwei Wang, Neal Xiong 0001, Yi Zhang 0026, Athanasios V. Vasilakos, Xiangyang Luo. A Siamese Inverted Residuals Network Image Steganalysis Scheme based on Deep Learning |
0 | -- | 0 | Zhuming Wang, Yaowen Xu, Lifang Wu, Hu Han 0001, Yukun Ma, Zun Li. Improving Face Anti-spoofing via Advanced Multi-perspective Feature Learning |
0 | -- | 0 | Weigang Zhang, Zhaobo Qi, Shuhui Wang, Chi Su, Li Su 0003, Qingming Huang. Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition |
1 | -- | 23 | Bingzheng Liu, Jianjun Lei, Bo Peng 0007, Chuanbo Yu, Wanqing Li 0001, Nam Ling. Novel View Synthesis from a Single Unposed Image via Unsupervised Learning |
1 | -- | 20 | Bo Li, Yong Zhang, Chengyang Zhang, Xinglin Piao, Baocai Yin. Hypergraph Association Weakly Supervised Crowd Counting |
1 | -- | 21 | Boqiang Xu, Jian Liang 0001, Lingxiao He, Jinlin Wu, Chao Fan, Zhenan Sun. Color-Unrelated Head-Shoulder Networks for Fine-Grained Person Re-identification |
1 | -- | 21 | Rui Li 0059, Baopeng Zhang, Wei Liu, Zhu Teng, Jianping Fan 0007. PANet: An End-to-end Network Based on Relative Motion for Online Multi-object Tracking |
1 | -- | 21 | Zhen Chen, Ming Yang 0007, Shiliang Zhang. Complementary Coarse-to-Fine Matching for Video Object Segmentation |
1 | -- | 15 | Jin Xie 0005, Yanwei Pang, Jing Pan, Jing Nie 0001, Jiale Cao, Jungong Han. Complementary Feature Pyramid Network for Object Detection |
1 | -- | 19 | Kun Li 0008, Jiaxiu Li, Dan Guo 0001, Xun Yang 0001, Meng Wang 0001. Transformer-Based Visual Grounding with Cross-Modality Interaction |
1 | -- | 23 | Yikun Xu, Xingxing Wei, Pengwen Dai, Xiaochun Cao. 2SC: Adversarial Attacks on Subspace Clustering |
1 | -- | 18 | Cong Huang, Xiulian Peng, Dong Liu 0002, Yan Lu 0001. Text Image Super-Resolution Guided by Text Structure and Embedding Priors |
1 | -- | 23 | Puneet Kumar 0003, Gaurav Bhatt, Omkar Ingle, Daksh Goyal, Balasubramanian Raman. Affective Feedback Synthesis Towards Multimodal Text and Image Data |
1 | -- | 21 | Patrick P. K. Chan, Xiaoman Hu, Haorui Song, Peng Peng 0005, Keke Chen. Learning Disentangled Features for Person Re-identification under Clothes Changing |
1 | -- | 20 | Federico Becattini, Pietro Bongini, Luana Bulla, Alberto Del Bimbo, Ludovica Marinucci, Misael Mongiovì, Valentina Presutti. VISCOUNTH: A Large-scale Multilingual Visual Question Answering Dataset for Cultural Heritage |
1 | -- | 20 | Meng Wang 0017, Jizheng Xu, Li Zhang 0006, Junru Li, Kai Zhang 0007, Shiqi Wang 0001, Siwei Ma. Compressed Screen Content Image Super Resolution |
1 | -- | 20 | Geyu Tang, Xingyu Gao 0001, Zhenyu Chen. Learning Semantic Representation on Visual Attribute Graph for Person Re-identification and Beyond |
1 | -- | 18 | Zijun Deng, Xiangteng He, Yuxin Peng. LFR-GAN: Local Feature Refinement based Generative Adversarial Network for Text-to-Image Generation |
1 | -- | 19 | Jiayuan Xie, Jiali Chen, Yi Cai 0001, Qingbao Huang, Qing Li 0001. Visual Paraphrase Generation with Key Information Retained |
1 | -- | 24 | Zhenjun Tang, Zhiyuan Chen, Zhixin Li 0001, Bineng Zhong, Xianquan Zhang, Xinpeng Zhang. Unifying Dual-Attention and Siamese Transformer Network for Full-Reference Image Quality Assessment |
1 | -- | 22 | Hongguang Zhu, Yunchao Wei, Yao Zhao 0001, Chunjie Zhang, Shujuan Huang. AMC: Adaptive Multi-expert Collaborative Network for Text-guided Image Retrieval |
1 | -- | 20 | Xiumei Chen, Xiangtao Zheng, Xiaoqiang Lu. Identity Feature Disentanglement for Visible-Infrared Person Re-Identification |
1 | -- | 22 | Kankanala Srinivas, Ashish Kumar Bhandari. Context-Based Novel Histogram Bin Stretching Algorithm for Automatic Contrast Enhancement |
1 | -- | 25 | Tomaso Fontanini, Luca Donati, Massimo Bertozzi, Andrea Prati 0001. Unsupervised Discovery and Manipulation of Continuous Disentangled Factors of Variation |
1 | -- | 20 | Yichun Tai, Hailin Shi, Dan Zeng 0001, Hang Du, Yibo Hu 0003, Zicheng Zhang, Zhijiang Zhang, Tao Mei 0001. Multi-Agent Semi-Siamese Training for Long-Tail and Shallow Face Learning |
1 | -- | 19 | Zhenyu Shu, Ling Gao, Shun-yi, Fangyu Wu, Xin Ding, Ting Wan, Shiqing Xin. Context-Aware 3D Points of Interest Detection via Spatial Attention Mechanism |
1 | -- | 17 | Jie Zhu, Bo Peng 0007, Wanqing Li 0001, Haifeng Shen, Qingming Huang, Jianjun Lei. Modeling Long-range Dependencies and Epipolar Geometry for Multi-view Stereo |
1 | -- | 25 | Rongfei Zeng, Mai Su, Ruiyun Yu, Xingwei Wang 0001. 2 : Fine-grained 3D Mesh Reconstruction with Twice Chamfer Distance |
1 | -- | 20 | Tianyi Wang 0006, Harry Cheng 0002, Kam-Pui Chow, Liqiang Nie. Deep Convolutional Pooling Transformer for Deepfake Detection |
1 | -- | 21 | Wei-Yen Hsu, Pei-Wen Jian. Recurrent Multi-scale Approximation-Guided Network for Single Image Super-Resolution |
1 | -- | 19 | Yongchao Du, Min Wang 0019, Zhenbo Lu, Wengang Zhou 0001, Houqiang Li. Weakly Supervised Hashing with Reconstructive Cross-modal Attention |
1 | -- | 23 | Mingliang Zhou, Hongyue Leng, Bin Fang 0001, Tao Xiang 0001, Xuekai Wei, Weijia Jia 0001. Low-light Image Enhancement via a Frequency-based Model with Structure and Texture Decomposition |
1 | -- | 21 | Tian-Zi Niu, Shan-Shan Dong, Zhen-Duo Chen 0001, Xin Luo 0006, Shanqing Guo, Zi Huang, Xin-Shun Xu. Semantic Enhanced Video Captioning with Multi-feature Fusion |
1 | -- | 23 | Ye Yuan, Jiawan Zhang. Shot Boundary Detection Using Color Clustering and Attention Mechanism |
1 | -- | 28 | Xianhua Zeng, Saiyuan Chen, Yicai Xie, Tianxing Liao. 3V3D: Three-View Contextual Cross-slice Difference Three-dimensional Medical Image Segmentation Adversarial Network |