Journal: Vis. Intell.

Volume 2, Issue 1

0 -- 0Gang Li, Xiang Li, Shanshan Zhang, Jian Yang. Towards more reliable evaluation in pedestrian detection by rethinking "ignore regions"
0 -- 0Rui Qian, Weiyao Lin, John See, Dian Li. Controllable augmentations for video representation learning
0 -- 0Zihao Chen, Kunhong Li 0001, Haoran Li 0009, Zhiheng Fu, Hanmo Zhang, Yulan Guo. Metric localization for lunar rovers via cross-view image matching
0 -- 0Yizhou Wang, Longguang Wang, Qingyong Hu, Yan Liu 0043, Ye Zhang, Yulan Guo. Panoptic segmentation of 3D point clouds with Gaussian mixture model in outdoor scenes
0 -- 0Wenqing Zhao, Lijiao Xu. Weakly supervised target detection based on spatial attention
0 -- 0Yaonan Wang. In Memoriam: Professor Edwin R. Hancock
0 -- 0Xingyu Xie, Jianlong Wu, Guangcan Liu, Zhouchen Lin. SSCNet: learning-based subspace clustering
0 -- 0Bin Fan 0002, Yuchao Dai, Yongduek Seo, Mingyi He. A revisit of the normalized eight-point algorithm and a self-supervised deep solution
0 -- 0Jia-Mu Sun, Tong Wu 0009, Lin Gao 0004. Recent advances in implicit representation-based 3D shape generation
0 -- 0Zihao Jia, Shengkun Sun, Guangcan Liu, Bo Liu. MSSD: multi-scale self-distillation for object detection
0 -- 0Lichun Tang, Zhaoxia Yin, Hang Su, Wanli Lyu, Bin Luo 0001. WFSS: weighted fusion of spectral transformer and spatial self-attention for robust hyperspectral image classification against adversarial attacks
0 -- 0Li Fang, Qian Wang, Long Ye. GLGNet: light field angular superresolution with arbitrary interpolation rates
13 -- 0Huaizhou Lin, Dan Cai, Zengmin Xu, Jinsong Wu, Lixian Sun, Haibin Jia. Fabric4show: real-time vision system for fabric defect detection and post-processing
14 -- 0Yuliang Sun, Xudong Zhang 0003, Yongwei Miao. A review of point cloud segmentation for understanding 3D indoor scenes
15 -- 0Zhiqiang Yan, Yupeng Zheng, Deng-Ping Fan, Xiang Li 0041, Jun Li 0027, Jian Yang 0003. Learnable differencing center for nighttime depth perception
16 -- 0Chang Liu 0072, Xudong Jiang 0001, Henghui Ding. PrimitiveNet: decomposing the global constraints for referring segmentation
17 -- 0Yao Jiang 0002, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan. Effectiveness assessment of recent large vision-language models
18 -- 0Zelong Zeng, Fan Yang, Hong Liu 0009, Shin'ichi Satoh 0001. Improving deep metric learning via self-distillation and online batch diffusion process
19 -- 0Qingjie Zeng, Yutong Xie, Zilin Lu, Yong Xia 0001. A human-in-the-loop method for pulmonary nodule detection in CT scans
20 -- 0Lei Cao, Zirui Shen, Sheng Xu 0003. Efficient forest fire detection based on an improved YOLO model
21 -- 0Siran Peng, Xiangyu Zhu, Dong Yi, Chen Qian 0006, Zhen Lei 0001. Formulating facial mesh tracking as a differentiable optimization problem: a backpropagation-based solution
22 -- 0Fan Yu, Yaqun Fang, Zhixiang Zhao, Jia Bei, Tongwei Ren, Gangshan Wu. CAGNet: a context-aware graph neural network for detecting social relationships in videos
23 -- 0Junpei Liao, Liang Yi, Wenxin Shi, Wenyuan Yang, Yanmei Fang, Xin Yang. Imperceptible backdoor watermarks for speech recognition model copyright protection
24 -- 0Yichao Yan, Zanwei Zhou, Zi Wang, Jingnan Gao, Xiaokang Yang. DialogueNeRF: towards realistic avatar face-to-face conversation video generation
25 -- 0Kaiwen Guo, Chaoyang Zhao, Jinqiao Wang. A fast mask synthesis method for face recognition
26 -- 0Zonglin Li, Xiaoqian Lv, Wei Yu 0004, Qinglin Liu, Jingbo Lin, Shengping Zhang. Face shape transfer via semantic warping
27 -- 0Megani Rajendran, Chek Tien Tan, Indriyati Atmosukarto, Aik Beng Ng, Simon See. Review on synergizing the Metaverse and AI-driven synthetic data: enhancing virtual realms and activity recognition in computer vision
28 -- 0Fangyi Liu, Mang Ye, Bo Du 0001. Learning a generalizable re-identification model from unlabelled data with domain-agnostic expert
29 -- 0Yong Li 0032, Menglin Liu, Lingjie Lao, Yuanzhi Wang, Zhen Cui 0001. Counterfactual discriminative micro-expression recognition
30 -- 0Xiyao Liu 0001, Jiaxin Hu, Qingying Yang, Ming Jiang, Jianbiao He, Hui Fang 0003. A divide-and-conquer reconstruction method for defending against adversarial example attacks
31 -- 0Yuehao Song, Xinggang Wang, Jingfeng Yao, Wenyu Liu 0001, Jinglin Zhang, Xiangmin Xu. ViTGaze: gaze following with interaction features in vision transformers
32 -- 0Zhangwei Gao, Zhe Chen 0017, Erfei Cui, Yiming Ren, Weiyun Wang, Jinguo Zhu, Hao Tian 0006, Shenglong Ye, Junjun He, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao 0001, Jifeng Dai, Wenhai Wang. Mini-InternVL: a flexible-transfer pocket multi-modal model with 5% parameters and 90% performance
33 -- 0Dehong Kong, Siyuan Liang, Xiaopeng Zhu, Yuansheng Zhong, Wenqi Ren. Patch is enough: naturalistic adversarial patch against vision-language pre-training models
34 -- 0Xiaoguang Tu, Zhi He, Yi Huang, Zhi-hao Zhang, Ming Yang, Jian Zhao 0006. An overview of large AI models and their applications
35 -- 0Chang Liu, Yongsheng Yuan, Xin Chen 0032, Huchuan Lu, Dong Wang 0004. Spatial-temporal initialization dilemma: towards realistic visual tracking
36 -- 0Wei Huang, Xingyu Zheng, Xudong Ma, Haotong Qin, Chengtao Lv, Hong Chen 0004, Jie Luo 0004, Xiaojuan Qi 0001, Xianglong Liu 0001, Michele Magno. An empirical study of LLaMA3 quantization: from LLMs to MLLMs
37 -- 0Xinyu Xie, Yawen Cui, Tao Tan, Xubin Zheng, Zitong Yu. FusionMamba: dynamic feature enhancement for multimodal image fusion with Mamba
38 -- 0Chi Zhang 0020, Meng Yuan, Xiaoning Ma, Yu Liu, Haoang Lu, Le Wang 0003, Yuanqi Su, Yuehu Liu. Unified regularity measures for sample-wise learning and generalization