Journal: Multimedia Syst.

Volume 29, Issue 6

3151 -- 3168Chhavi Dixit, Shashank Mouli Satapathy. A customizable framework for multimodal emotion recognition using ensemble of deep neural network models
3169 -- 3177Ce Zhang, Xiao Yao, Changfeng Shi, Min Gu. Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning
3179 -- 3191Humaira Shafiq, Ghulam Gilanie, Muhammad Sajid, Muhammad Ahsan. Dental radiology: a convolutional neural network-based approach to detect dental disorders from dental images in a real-time environment
3193 -- 3207Yuhan Huang, Jiacheng Lu, Nianzhe Chen, Hui Ding, Yuanyuan Shang. A deep learning image inpainting method based on stationary wavelet transform
3209 -- 3221Chuanwang Wen, Shucheng Huang. A LiDAR point cloud registration method combining linear feature extraction and TrICP algorithm
3223 -- 3243Baoying Zheng, Fang Liu 0002, Mohan Zhang, Tongqing Zhou, Shenglan Cui, Yunfan Ye, Yeting Guo. Image captioning for cultural artworks: a case study on ceramics
3245 -- 3258Huimin Qian, Wenyu Shen, Zhengqi Wang, Shuwei Xu. Hotspot defect detection for photovoltaic modules under complex backgrounds
3259 -- 3276Liyan Xiong, Zhida Li, Xiaohui Huang, Yijuan Zeng, Peng Huang. TFA-CNN: an efficient method for dealing with crowding and noise problems in crowd counting
3277 -- 3290Xiaohui Guan, Qiqi Shao, Yaguan Qian, Tengteng Yao, Bin Wang 0062. Adversarial training in logit space against tiny perturbations
3291 -- 3303Zekang Wang, Li Liu 0031, Huaxiang Zhang 0001, Dongmei Liu, Yu Song. Generative adversarial text-to-image generation with style image constraint
3305 -- 3328Mamta Gehlot, Rakesh Kumar Saxena, Geeta Chhabra Gandhi. "Tomato-Village": a dataset for end-to-end tomato disease detection in a real-world environment
3329 -- 3339Xin Wang, Ning He, Chen Hong, Fengxi Sun, Wenjing Han, Qi Wang. YOLO-ERF: lightweight object detector for UAV aerial images
3341 -- 3356Yongwei Gai, Jinglei Liu. Clustering by sparse orthogonal NMF and interpretable neural network
3357 -- 3367Mingju Shao, Guodong Wang. Class-agnostic counting with feature augmentation and similarity comparison
3369 -- 3384Ugur Berk Sahin, Fatih Kamisli. Image compression with learned lifting-based DWT and learned tree-based entropy models
3385 -- 3402Amal Bouatrous, Abdelkrim Meziane, Nadia Zenati, Chafiaâ Hamitouche. A new adaptive VR-based exergame for hand rehabilitation after stroke
3403 -- 3419V. Praveena, L. R. Sujithra, S. Karthik, Muthu Subash Kavitha. Bio-Inspired ensemble feature selection and deep auto-encoder approach for rapid diagnosis of breast cancer
3421 -- 3430Dayong Tian, Yiqin Cao, Yiwen Wei, Deyun Zhou. Narrowing the variance of variational cross-encoder for cross-modal hashing
3431 -- 3446Xin Zhang, Xiaotian Cao, Jun Wang, Lei Wan. G-UNeXt: a lightweight MLP-based network for reducing semantic gap in medical image segmentation
3447 -- 3466Jennil Thiyam, Sanasam Ranbir Singh, Prabin Kumar Bora. Integrated document segmentation and region identification: textual, equation and graphical
3467 -- 3480Tuo Li, Yahong Han. Improving transferable adversarial attack for vision transformers via global attention and local drop
3481 -- 3504Jakub Lokoc, Stelios Andreadis, Werner Bailer, Aaron Duane, Cathal Gurrin, Zhixin Ma, Nicola Messina, Thao-Nhu Nguyen, Ladislav Peska, Luca Rossetto, Loris Sauter, Konstantin Schall, Klaus Schoeffmann, Omar Shahbaz Khan, Florian Spiess, Lucia Vadicamo, Stefanos Vrochidis. Interactive video retrieval in the age of effective joint embedding deep models: lessons from the 11th VBS
3505 -- 3520Shijie Jia, Yan Cui, Xiaoyan Su, Zongzheng Liang. A social-aware video sharing solution using demand prediction of epidemic-based propagation in wireless networks
3521 -- 3530Quan Lin Gu, Sai Yang, TianXing Yu. Lite general network and MagFace CNN for micro-expression spotting in long videos
3531 -- 3547Li Han, Jinhai He, Feng Dou, Huiwen Ma, Xinyang Xie, Wanwen Yang. A viewpoint-guided prototype network for 3D shape classification
3549 -- 3557Zhiwei Ma, Guilin Yao. Deep portrait matting via double-grained segmentation
3559 -- 3577Nan Xie, Zhaojie Liu, Zhengxu Li, Wei Pang, Beier Lu. Student engagement detection in online environment using computer vision and multi-dimensional feature fusion
3579 -- 3597Fanqiang Kong, Jiahui Tang, Yunsong Li, Dan Li 0014, Kedi Hu. Dual-branch spectral-spatial feature extraction network for multispectral image compression
3599 -- 3608Jun Wu, Tianliang Zhu, Jiahui Zhu, Tianyi Li, Chunzhi Wang. Hierarchical multiples self-attention mechanism for multi-modal analysis
3609 -- 3623Yizhong Yang, Tingting Xia, Dajin Li, Zhang Zhang, Guangjun Xie. A multi-scale feature fusion spatial-channel attention model for background subtraction
3625 -- 3638Tao Hu, Xuyu Xiang, Jiaohua Qin, Yun Tan. Audio-text retrieval based on contrastive learning and collaborative attention mechanism
3639 -- 3653Kaisi Yang, Lianyu Zhao, Chenglin Wang. Workpiece tracking based on improved SiamFC++ and virtual dataset
3655 -- 3668Xinglin Pan, Mingxin Gan. Multi-behavior recommendation based on intent learning
3669 -- 3684Bin Liu, Siyan Fang. Multi-aggregation network based on non-separable lifting wavelet for single image deraining
3685 -- 3701Deeksha Gupta, Akashdeep Sharma. A two-stage attention augmented fully convolutional network-based dynamic video summarization
3703 -- 3720Honglin Li, Qinghua Huang. MAF-Net: multidimensional attention fusion network for multichannel speech separation
3721 -- 3744Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu. Compression of face images using meta-heuristic algorithms based on curvelet transform with variable bit allocation
3745 -- 3755Haiyan Zhang, Quan Wang, Guorui Feng. Artistic image adversarial attack via style perturbation
3757 -- 3770Xin Zheng, Xin He, Yimo Ren, Jinfa Wang, Junyang Yu. Owner named entity recognition in website based on multidimensional text guidance and space alignment co-attention
3771 -- 3780Jiangpeng Zheng, Fan Shi, Meng Zhao, Chen Jia, Congcong Wang. Learning intra-inter-modality complementary for brain tumor segmentation
3781 -- 3804Bowen Xin, Ning Xu 0003, Yingchen Zhai, Tingting Zhang, Zimu Lu, Jing Liu, Weizhi Nie, Xuanya Li, An-An Liu. A comprehensive survey on deep-learning-based visual captioning
3805 -- 3818Wei Xiong, Haoliang Liu, Siya Mi, Yu Zhang 0004. Asymmetric bi-encoder for image-text retrieval
3819 -- 3832Fengjun Xiao, Zhuxi Zhang, Ye Yao. CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
3833 -- 3845Emrah Dönmez, Serhat Kiliçarslan, Cemil Közkurt, Aykut Diker, Fahrettin Burak Demir, Abdullah Elen. Identification of haploid and diploid maize seeds using hybrid transformer model
3847 -- 3861Na Ta, Haipeng Chen, Xianzhu Liu, Nuo Jin. LET-Net: locally enhanced transformer network for medical image segmentation
3863 -- 3876Haoliang Zhou, Shucheng Huang, Yuqiao Xu. Inceptr: micro-expression recognition integrating inception-CBAM and vision transformer
3877 -- 3890Noor Ahmed, Rozina, Ahmad Ali, Abdul Raziq. Images denoising for COVID-19 chest X-ray based on multi-scale parallel convolutional neural network
3891 -- 3901Jiacheng Chang, Lanyong Zhang, Zhuang Shao. View-target relation-guided unsupervised 2D image-based 3D model retrieval via transformer
3903 -- 3930Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu. Variable bit allocation method based on meta-heuristic algorithms for facial image compression
3931 -- 3949Hüseyin Yasar, Murat Ceylan. A novel study for automatic two-class COVID-19 diagnosis (between COVID-19 and Healthy, Pneumonia) on X-ray images using texture analysis and 2-D/3-D convolutional neural networks
3951 -- 3969Alison Reboud, Ismail Harrando, Pasquale Lisena, Raphaël Troncy. Stories of love and violence: zero-shot interesting events' classification for unsupervised TV series summarization
3971 -- 0Akash Tayal, Jivansha Gupta, Arun Solanki, Khyati Bisht, Anand Nayyar, Mehedi Masud. Correction to: DL‑CNN‑based approach with image processing techniques for diagnosis of retinal diseases
3973 -- 0Hwei Teeng Chong, Chen Kim Lim, Ahmad Rafi, Kian Lam Tan, Mazlin Mokhtar. Correction: Comprehensive systematic review on virtual reality for cultural heritage practices: coherent taxonomy and motivations

Volume 29, Issue 5

2455 -- 2467Ajay Sharma, Bhavana P. Shrivastava, Aayushi Priya. Multilevel progressive recursive dilated networks with correlation filter (MPRDNCF) for image super-resolution
2469 -- 2482Maosheng Zhong, Youde Chen, Hao Zhang, Hao Xiong, Zhixiang Wang. Multimodal-enhanced hierarchical attention network for video captioning
2483 -- 2494Yongzhen Ke, Yin Wang, Kai Wang, Fan Qin, Jing Guo, Shuai Yang. Image aesthetics assessment using composite features from transformer and CNN
2495 -- 2509Susmi Jacob, P. Vinod 0001, Arjun Subramanian, Varun G. Menon. Affect sensing from smartphones through touch and motion contexts
2511 -- 2526Yuqiang Li, Xinyi Shangguan, Chun Liu, Haochen Meng. I2I translation model based on CondConv and spectral domain realness measurement: BCS-StarGAN
2527 -- 2543Chuan Liu, Ying-Ying Tan, Tian-Tian Xia, Jiajing Zhang, Ming Zhu. Co-attention graph convolutional network for visual question answering
2545 -- 2562Zhenying Fang, Jianping Fan 0001, Jun Yu 0002. LPR: learning point-level temporal action localization through re-training
2563 -- 2573Aiping Yang, Yan Liu, Simeng Cheng, Jiale Cao, Zhong Ji, Yanwei Pang. Spatial attention-guided deformable fusion network for salient object detection
2575 -- 2589Xin Yang 0002, Xiangchen Wang, Xiaohui Ye, Tao Li 0011. VMSG: a video caption network based on multimodal semantic grouping and semantic attention
2591 -- 2601Weihao Gao, Yongjun Zhang, Wei Long, Zhongwei Cui. A deraining with detail-recovery network via context aggregation
2603 -- 2614Asha Rani, Pankaj Yadav, Yashaswi Verma. Early-stage autism diagnosis using action videos and contrastive feature learning
2615 -- 2631Yunfei Zheng, Meng Sun 0001, Xiaobing Wang, Tieyong Cao, Xiongwei Zhang, Lixing Xing, Zheng Fang. Self-distillation object segmentation via pyramid knowledge representation and transfer
2633 -- 2650Jian-Wei Zhang, Yifan Sun, Wei Chen. Pull and concentrate: improving unsupervised semantic segmentation adaptation with cross- and intra-domain consistencies
2651 -- 2668Longfeng Shen, Fenglan Qin, Hongying Zhu, Dengdi Sun, Hai Min. EGARNet: adjacent residual lightweight super-resolution network based on extended group-enhanced convolution
2669 -- 2687Mahsa Soleimani, Ali Nazari, Mohsen Ebrahimi Moghaddam. Deepfake detection of occluded images using a patch-based approach
2689 -- 2703Chaithanyadas K. V., G. R. Gnana King. Computer-aided diagnosis for early detection and staging of human pancreatic tumors using an optimized 3D CNN on computed tomography
2705 -- 2714Xiuxia Cai, Pin Zhang, Shuaibin Du. Imitation camouflage synthesis based on shallow neural network
2715 -- 2728Yan Li, Min Xia 0002, Dongmei Jiang. Cross-view adaptive graph attention network for dynamic facial expression recognition
2729 -- 2746Hongwei Zhao, Siquan Wu, Zhen Tian, Yidong Li, Yi Jin 0001, Shengchun Wang. Context-guided coarse-to-fine detection model for bird nest detection on high-speed railway catenary
2747 -- 2760Weiyi Wei, Jian Wang, Mengyu Xu, Futong Zhang. Multimodal heterogeneous graph convolutional network for image recommendation
2761 -- 2777Jiachang Li, Haitao Zhang, Huadong Ma. DRL-based transmission control for QoE guaranteed transmission efficiency optimization in tile-based panoramic video streaming
2779 -- 2790Si Chen 0002, Bolun Xu, Miaohui Zhang, Yan Yan 0001, Xia Du, Weiwei Zhuang, Yun Wu. HC-GCN: hierarchical contrastive graph convolutional network for unsupervised domain adaptation on person re-identification
2791 -- 2807Zhangyu Liu, Zhi Li, Guomei Wang, Youliang Tian, Long Zheng. Robust zero-watermarking algorithm for diffusion-weighted images based on multiscale feature fusion
2809 -- 2823Xianhua Duan, Chaoqiang Jin, Xin Shu. HCPSNet: heterogeneous cross-pseudo-supervision network with confidence evaluation for semi-supervised medical image segmentation
2825 -- 2839Guangtao Wang, Jun Li 0033, Zhijian Wu, Jianhua Xu, Jifeng Shen, Wankou Yang. EfficientFace: an efficient deep network with feature enhancement for accurate face detection
2841 -- 0. Editorial note for few-shot learning for intelligent multimedia systems
2843 -- 2851Xuewei Chao, Lixin Zhang. Few-shot imbalanced classification based on data augmentation
2853 -- 2863Shan Liu, Yichao Tang, Ying Tian, Hansong Su. Visual driving assistance system based on few-shot learning
2865 -- 2875Yue Yang, Zhuo Zhang, Wei Mao, Yang Li 0111, Chengang Lv. Radar target recognition based on few-shot learning
2877 -- 2886You Zhou, Changlin Chen, Shukun Ma. Few-shot ship classification based on metric learning
2887 -- 2898Changlin Chen, Xuewei Chao. Conversion of infrared ocean target images to visible images driven by energy information
2899 -- 2912Rajdeep Chatterjee, Ankita Chatterjee, SK Hafizul Islam, Muhammad Khurram Khan. An object detection-based few-shot learning approach for multimedia quality assessment
2913 -- 2922Xiaolei Li. Few-shot wind turbine blade damage early warning system based on sound signal fusion
2923 -- 2933Wei Ren, Li Zhou, Jie Chen. Unsupervised single image dehazing with generative adversarial network
2935 -- 2950Abdelkader Tayeb Herouala, Benameur Ziani, Chaker Abdelaziz Kerrache, Abdou El Karim Tahari, Nasreddine Lagraa, Spyridon Mastorakis. CaDaCa: a new caching strategy in NDN using data categorization
2951 -- 2959M. Poongodi, Mounir Hamdi, Huihui Wang 0001. Image and audio caps: automated captioning of background sounds and images using deep learning
2961 -- 2977Neha Sharma, Chinmay Chakraborty, Rajeev Kumar. Optimized multimedia data through computationally intelligent algorithms
2979 -- 2989Jiandong Lv, Xingang Wang, Cuiling Shao. TMIF: transformer-based multi-modal interactive fusion for automatic rumor detection
2991 -- 3000Wei Chen, Jing Nie. A MADDPG-based multi-agent antagonistic algorithm for sea battlefield confrontation
3001 -- 3013Zhengjian Li, Jingyi He, Tianlei Ni, Jiaming Huo. Numerical computation based few-shot learning for intelligent sea surface temperature prediction
3015 -- 0. Editorial note for trustworthy multimedia big data computing
3017 -- 3026Zijie Song, Zhenzhen Hu, Richang Hong. Efficient and self-adaptive rationale knowledge base for visual commonsense reasoning
3027 -- 3040Wenzhe Zhai, Qilei Li, Ying Zhou, Xuesong Li, Jinfeng Pan, Guofeng Zou, Mingliang Gao 0001. $\hbox {DA}^2$Net: a dual attention-aware network for robust crowd counting
3041 -- 3054Na Ta 0009, Haipeng Chen 0002, Yingda Lyu, Taosuo Wu. BLE-Net: boundary learning and enhancement network for polyp segmentation
3055 -- 3067Dengyun Xu, Xuanjing Shen, Yongping Huang, Zenan Shi. RB-Net: integrating region and boundary features for image manipulation localization
3069 -- 3079Chunxiao Fan 0002, Zhenxing Wang, Jia Li, Shanshan Wang, Xiao Sun. Robust facial expression recognition with global-local joint representation learning
3081 -- 3093Jing Ge, Qianxiang Wang, Guangyu Gao. Hardest and semi-hard negative pairs mining for text-based person search with visual-textual attention
3095 -- 3103Yi Wang, Shixin Zheng, Xiao Sun 0003, Dan Guo, Junjie Lang. Micro-expression recognition with attention mechanism and region enhancement
3105 -- 3114Wenyi Hu, Xiao Wang, Zheng Wang, Xin Xu, Ruimin Hu. Dual-focus: person search from Coarse-Grained Focus to Fine-Grained Focus
3115 -- 3138Haoming Chen, Runyang Feng, Sifan Wu, Hao Xu, Fengcheng Zhou, Zhenguang Liu. 2D Human pose estimation: a survey
3139 -- 3150Jian Wang, Xiaoyu Du, Yu Cheng, Yunlian Sun, Jinhui Tang 0001. SI-Net: spatial interaction network for deepfake detection

Volume 29, Issue 4

1853 -- 1863Saifullah Tumrani, Wazir Ali, Rajesh Kumar 0014, Abdullah Aman Khan, Fayaz Ali Dharejo. View-aware attribute-guided network for vehicle re-identification
1865 -- 1895Palash Ray, Asish Bera, Debasis Giri, Debotosh Bhattacharjee. Style matching CAPTCHA: match neural transferred styles to thwart intelligent attacks
1897 -- 1915He Zhang, Lu Yin, Hanling Zhang. A review of micro-expression spotting: methods and challenges
1917 -- 1940Carlos Vilchis, Carmina Pérez-Guerrero, Mauricio Mendez-Ruiz, Miguel González-Mendoza 0001. A survey on the pipeline evolution of facial capture and tracking for digital humans
1941 -- 1954Kai Hu 0006, Junlan Jin, Chaowen Shen, Min Xia 0002, Liguo Weng. Attentional weighting strategy-based dynamic GCN for skeleton-based action recognition
1955 -- 1966Anqi Zheng, Shiqi Zheng, Cong Bai, Deng Chen. Triple-level relationship enhanced transformer for image captioning
1967 -- 1980Gang Wang, Shucheng Huang, Zhe Tao. Shallow multi-branch attention convolutional neural network for micro-expression recognition
1981 -- 1994Lei Yang, Yong Feng 0002, Mingliang Zhou, Xiancai Xiong, Yongheng Wang, Baohua Qiang. Multi-level network based on transformer encoder for fine-grained image-text matching
1995 -- 2007An-An Liu, Yuwei Zhang, Chenyu Zhang, Wenhui Li 0001, Bo Lv, Lei Lei, Xuanya Li. Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval
2009 -- 2035B. Bhaskar Reddy, M. Venkata Sudhakar, P. Rahul Reddy, P. Raghava Reddy. Ensemble deep honey architecture for COVID-19 prediction using CT scan and chest X-ray images
2037 -- 2048Yizhong Yang, Ce Hou, Haixia Huang, Zhang Zhang, Guangjun Xie. Cascaded deep residual learning network for single image dehazing
2049 -- 2057Elena Battini Sönmez, Sefer Memis, Berker Arslan, Okan Zafer Batur. The segmented UEC Food-100 dataset with benchmark experiment on food detection
2059 -- 2072Furong Ma, Guiyu Xia, Qingshan Liu 0001. Human pose transfer via shape-aware partial flow prediction network
2073 -- 2083Xin Xu, Gang Lv, Yining Sun, Yuxia Hu, Fudong Nian. Hierarchical cross-modal contextual attention network for visual grounding
2085 -- 2097Honghong Yang, Hongxi Liu, Yumei Zhang, Xiaojun Wu 0002. HSGNet: hierarchically stacked graph network with attention mechanism for 3D human pose estimation
2099 -- 2110Awais Ahmed, She Kun 0001, Junaid Ahmed, Shaukat Hayat, Abdullah Aman Khan. Multimodal image enhancement using convolutional sparse coding
2111 -- 2124Tarun Agrawal, Prakash Choudhary. COVID-SegNet: encoder-decoder-based architecture for COVID-19 lesion segmentation in chest X-ray
2125 -- 2135Kangkang Wei, Weiqi Luo 0001, Minglin Liu, Miaoxin Ye. Residual guided coordinate attention for selection channel aware image steganalysis
2137 -- 2152Jian Shi, Geng Sun 0001, Jinyu Zhang, Zhihui Wang, Haojie Li. Face attribute recognition via end-to-end weakly supervised regional location
2153 -- 2164Mengting Liu, Xinrui Li, Yongge Liu, Yahong Han. Weakly supervised anomaly detection with multi-level contextual modeling
2165 -- 2180Hafsa Ilyas, Ali Javed, Khalid Mahmood Malik, Aun Irtaza. E-Cap Net: an efficient-capsule network for shallow and deepfakes forgery detection
2181 -- 2191Yingyuan Zhao, Zhiyi Tan 0002, Bing-Kun Bao, Zhengzheng Tu. Centralized sub-critic based hierarchical-structured reinforcement learning for temporal sentence grounding
2193 -- 2203Zepeng Li, Wenchuan Cheng, Jiawei Zhou, Zhengyi An, Bin Hu 0001. Deep learning model with multi-feature fusion and label association for suicide detection
2205 -- 2216Jing Sun, Rui Yan, Bing Zhang, Bing Zhu, Fuming Sun. A cross-view geo-localization method guided by relation-aware global attention
2217 -- 2238Mei-Ting Su, Mei-Ling Chiang, Chia-Hsuan Tsai, Chi-Wei Lin, Rong-Xuan Liu, Yong-Ting Juang, Hsin-Hao Chen. An acupoint health care system with real-time acupoint localization and visualization in augmented reality
2239 -- 2252Tobias Mühling, Isabelle Späth, Joy Backhaus, Nathalie Milke, Sebastian Oberdörfer, Alexander Meining, Marc Erich Latoschik, Sarah Koenig. Virtual reality in medical emergencies training: benefits, perceived stress, and learning success
2253 -- 2262Shulin Cheng, Huimin Jiang, Wanyan Wang, Wei Jiang. Research on multi-context aware recommendation methods based on tensor factorization
2263 -- 2279Yulin Deng, Liju Yin, Xiaoning Gao, Hui Zhou, ZhenZhou Wang, Guofeng Zou. EA-EDNet: encapsulated attention encoder-decoder network for 3D reconstruction in low-light-level environment
2281 -- 2292Fangzheng Xu, Yu Bao, Bingye Li, Zhining Hou, Lekang Wang. Entropy minimization and domain adversarial training guided by label distribution similarity for domain adaptation
2293 -- 2322Khouloud Salameh, Farah El Akoum, Joe Tekli. Unsupervised knowledge representation of panoramic dental X-ray images using SVG image-and-object clustering
2323 -- 2335Dailiang Wei, Juanli Li, Bo Li, Xin Wang, Siyuan Chen, Xuewen Wang, Luyao Wang. A fast recognition method for coal gangue image processing
2337 -- 2349Suchi Jain, Geeta Sikka, Renu Dhir. An automatic cascaded approach for pancreas segmentation via an unsupervised localization using 3D CT volumes
2351 -- 2362Changshui Yang, Yan Liu, Qiang Liu, Riaz Ullah Khan, Bin Chen, Wenyong Wang. Dual semantic-aligned clustering for cross-domain person re-identification
2363 -- 2373Bolin Wang, Yuanyuan Sun, Yonghe Chu, Changrong Min, Zhihao Yang, Hongfei Lin. Local discriminative graph convolutional networks for text classification
2375 -- 2388Israr-Ur-Rehman, Muhammad Shehzad Hanif, Zulfiqar Ali 0002, Zahoor Jan, Cobbinah Bernard Mawuli, Waqar Ali. Empowering neural collaborative filtering with contextual features for multimedia recommendation
2389 -- 2398Zekun Yang, Yuta Nakashima, Haruo Takemura. Multi-modal humor segment prediction in video
2399 -- 2413Gaoming Yang, Anxing Wei, Xianjin Fang, Ji Zhang. FDS_2D: rethinking magnitude-phase features for DeepFake detection
2415 -- 2427Hong Lin, Xi Wang, Chun Liu, Dewei Peng. HRCutBlur Augment: effectively enhancing data diversity for image super-resolution
2429 -- 2437Hongbo Xing, Guanqun Zhou, Shusen Yuan, Youjun Jiang, Pinyong Geng, Yewen Cao, Yujun Li, Lei Chen. Micro-expression spotting network based on attention and one-dimensional convolutional sliding window
2439 -- 2454Hitesh D. Panchal, Hitesh B. Shah. Multiple forgery detection in digital video based on inconsistency in video quality assessment attributes

Volume 29, Issue 3

887 -- 895Shanqing Zhang, Yujie Chen, Yiheng Meng, Jianfeng Lu 0005, Li Li 0014, Rui Bai. A multi-level feature weight fusion model for salient object detection
897 -- 915Sara Akan, Songül Varli. Use of deep learning in soccer videos analysis: survey
917 -- 943Anjali Gautam. Recent advancements of deep learning in detecting breast cancer: a survey
945 -- 959Linfeng Liu, Tong Chen 0004, Haojie Liu, Shiliang Pu, Li Wang, Qiu Shen. 2C-Net: integrate image compression and classification via deep neural network
961 -- 979M. Kavitha. MDP-HML: an efficient detection method for multiple human disease using retinal fundus images based on hybrid learning techniques
981 -- 1000Shuying Zhang, Jing Zhang, Yizhou Wang, Li Zhuo. Short video fingerprint extraction: from audio-visual fingerprint fusion to multi-index hashing
1001 -- 1010Qingtian Zeng, Liangwei Niu, Shansong Wang, Weijian Ni. SEViT: a large-scale and fine-grained plant disease classification model based on transformer and attention convolution
1011 -- 1023Yanxue Wang, Shansong Wang, Weijian Ni, Qingtian Zeng. PAST-net: a swin transformer and path aggregation model for anthracnose instance segmentation
1025 -- 1041Deepak Dhillon, Rajlaxmi Chouhan. Edge-preserving image denoising using noise-enhanced patch-based non-local means
1043 -- 1056Jingdan Li, Yi Wang, Dexin Zhao. Layer-wise enhanced transformer with multi-modal fusion for image caption
1057 -- 1071Hao Sun, Xiaolin Qin, Xiaojing Liu. Image-text matching using multi-subspace joint representation
1073 -- 1087Wenying Wen, Rongxin Tu, Yushu Zhang, Yuming Fang, Yong Yang 0001. A multi-level approach with visual information for encrypted H.265/HEVC videos
1089 -- 1101Heling Cao, Lei Li, Yonghe Chu, Miaolei Deng, Panpan Wang, Chenyang Zhao. A coincidental correctness test case identification framework with fuzzy C-means clustering
1103 -- 1116Jasvinder Pal Singh, Uday Pratap Singh, Sanjeev Jain. Model-based person identification in multi-gait scenario using hybrid classifier
1117 -- 1130Thae Song Kim, Su Hyon Kim. An improved contrast enhancement for dark images with non-uniform illumination based on edge preservation
1131 -- 1144Fuming Sun, Tingting Zhao, Bing Zhu, Xu Jia, Fasheng Wang. Deblurring transformer tracking with conditional cross-attention
1145 -- 1159Xingjian Gu, Yongjie Zhu, Shougang Ren, Xiangbo Shu. BCMask: a finer leaf instance segmentation with bilayer convolution mask
1161 -- 1172Chenquan Gan, Xiaopeng Cao, Qingyi Zhu. Microblog sentiment analysis via user representative relationship under multi-interaction hybrid neural networks
1173 -- 1185Neetu Singla, Sushama Nagpal, Jyotsna Singh. A two-stage forgery detection and localization framework based on feature classification and similarity metric
1187 -- 1202Dong Xie 0005, Bin Wu, Fulong Chen 0002, Taochun Wang, Zebang Hu, Yibo Zhang. A low-overhead compressed sensing-driven multi-party secret image sharing scheme
1203 -- 1230Anusha Chhabra, Dinesh Kumar Vishwakarma. A literature survey on multimodal and multilingual automatic hate speech identification
1231 -- 1244Youyu Liu, Yi Li, Dezhang Xu, Qingyan Yang, Wanbao Tao. Adaptive Kalman Filter with power transformation for online multi-object tracking
1245 -- 1276Min-Jen Tsai, Hung-Yu Wu, Di-Ting Lin. Auto ROI & mask R-CNN model for QR code beautification (ARM-QR)
1277 -- 1290Adithya Sineesh, Mahesh Raveendranatha Panicker. Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks
1291 -- 1300Fangmei Chen, Yuying Wang, Sheng Xu, Fasheng Wang, Fuming Sun, Xu Jia. Style transfer network for complex multi-stroke text
1301 -- 1314Xin Chao, Zhenjie Hou, Yujian Mo, Haiyong Shi, Wenjing Yao. Structural feature representation and fusion of human spatial cooperative motion for action recognition
1315 -- 1334Joel Dickson, Arul Linsely, R. J. Alice Nineta. An integrated 3D-sparse deep belief network with enriched seagull optimization algorithm for liver segmentation
1335 -- 1345Sicheng Zhang, Jin Liu, Bo Hu, Zhendong Mao. GH-DDM: the generalized hybrid denoising diffusion model for medical image generation
1347 -- 1360Huanjie Tao, Minghao Lu, Zhenwu Hu, Jianfeng An. A gated multi-hierarchical feature fusion network for recognizing steel plate surface defects
1361 -- 1376Aashania Antil, Chhavi Dhiman. A two stream face anti-spoofing framework using multi-level deep features and ELBP features
1377 -- 1389N. Venugopal. SCMACDnet: multilevel fusion-based deep twin capsule network for change detection
1391 -- 1403Jiajun Ding, Beili Liu, Jun Yu 0002, Huanlei Guo, Ming Shen, Kenong Shen. An efficient multi-path structure with staged connection and multi-scale mechanism for text-to-image synthesis
1405 -- 1416Wei Li, Xiwei Yang, Zhixin Li. MLCB-Net: a multi-level class balancing network for domain adaptive semantic segmentation
1417 -- 1429Yuzhe He, Ning He, Haigang Yu, Ren Zhang, Kang Yan. From macro to micro: rethinking multi-scale pedestrian detection
1431 -- 1451Ehsan Jafari, Ardeshir Dolati, Kamran Layeghi. Object tracking using fuzzy-based improved graph, interesting patches and multi-label MRF optimization
1453 -- 1462Zhenfeng Zhang, Chuhua Huang, RenJing Huang, Yanan Li, Yifan Chen. Illu-NASNet: unsupervised illumination estimation based on dense spatio-temporal smoothness
1463 -- 1479Shisong Huang, Danyang Li, Zhuhong Zhang, Yating Wu, Yumei Tang, Xing Chen, Yiqing Wu. CSLSEP: an ensemble pruning algorithm based on clustering soft label and sorting for facial expression recognition
1481 -- 1498Pengqing Li, Hongjuan Zhang, Yansong Chen. Structural local sparse and low-rank tracker using deep features
1499 -- 1512Lei Li 0043, Tingting Liu, Chengyu Wang 0001, Minghui Qiu, Cen Chen, Ming Gao 0001, Aoying Zhou. Resizing codebook of vector quantization without retraining
1513 -- 1526Seyma Derdiyok, Fatma Patlar Akbulut. Biosignal based emotion-oriented video summarization
1527 -- 1577Deepika Sharma, Arvind Selwal. A survey on face presentation attack detection mechanisms: hitherto and future perspectives
1579 -- 1592Leyuan Liu, Yunqi Gao, Jianchi Sun, Jingying Chen. Single-image clothed 3D human reconstruction guided by a well-aligned parametric body model
1593 -- 1601Xin Shu, Jia Li, Liang Shi, Shucheng Huang. RES-CapsNet: an improved capsule network for micro-expression recognition
1603 -- 1627Ercan Gürsoy, Yasin Kaya. An overview of deep learning techniques for COVID-19 detection: methods, challenges, and future works
1629 -- 1650A. Mary Dayana, W. R. Sam Emmanuel, C. Harriet Linda. Feature fusion and optimization integrated refined deep residual network for diabetic retinopathy severity classification using fundus image
1651 -- 1661Birkan Buyukarikan, Erkan Ülker. Convolutional neural network-based apple images classification and image quality measurement by light colors using the color-balancing approach
1663 -- 1664Nawab Muhammad Faseeh Qureshi, Varun G. Menon, Ali Kashif Bashir, Shahid Mumtaz, Irfan Mehmood. Role of deep learning models and analytics in industrial multimedia environment
1665 -- 1681Tiago do Carmo Nogueira, Cássio Dener Noronha Vinhal, Gélson da Cruz Júnior, Matheus Rudolfo Diedrich Ullmann, Thyago Carvalho Marques. A reference-based model using deep learning for image captioning
1683 -- 1697Ahmed Barnawi, Prateek Chhikara, Rajkumar Tekchandani, Neeraj Kumar 0001, Mehrez Boulares. A CNN-based scheme for COVID-19 detection with emergency services provisions using an optimal path planning
1699 -- 1715Faria Nazir, Muhammad Nadeem Majeed, Mustansar Ali Ghazanfar, Muazzam Maqsood. A computer-aided speech analytics approach for pronunciation feedback using deep feature clustering
1717 -- 1727Linbo Wang, Li Tan, Xianyong Fang, Yanwen Guo 0001, Shaohua Wan 0001. Adaptively feature matching via joint transformational-spatial clustering
1729 -- 1738Loveleen Gaur, Ujwal Bhatia, N. Z. Jhanjhi, Ghulam Muhammad, Mehedi Masud. Medical image-based detection of COVID-19 using Deep Convolution Neural Networks
1739 -- 1749Asma Kausar, Imran Razzak, Mohd Ibrahim Shapiai, Amin Beheshti. 3D shallow deep neural network for fast and precise segmentation of left atrium
1751 -- 1770Jimmy Ming-Tai Wu, Zhongcui Li, Norbert Herencsar, Bay Vo, Jerry Chun-Wei Lin. A graph-based CNN-LSTM stock price prediction algorithm with leading indicators
1771 -- 1783Gengsheng Xie, Xianbin Wen, Liming Yuan, Jianchen Wang, Changlun Guo, Yansong Jia, Minghao Li. Pose-guided feature region-based fusion network for occluded person re-identification
1785 -- 1797Sumit Pundir, Mohammad S. Obaidat, Mohammad Wazid, Ashok Kumar Das, Devesh Pratap Singh, Joel J. P. C. Rodrigues. MADP-IIME: malware attack detection protocol in IoT-enabled industrial multimedia environment using machine learning approach
1799 -- 1813Akshi Kumar 0001. Leveraging crowd knowledge to curate documentation for agile software industry using deep learning and expert ranking
1815 -- 1824Ranran Lou, Zhihan Lv, Shuping Dang, Tianyun Su, Xinfang Li. Application of machine learning in ocean data
1825 -- 1838Mohib Ullah Khan, Abdul Rehman Javed, Mansoor Ihsan, Usman Tariq. A novel category detection of social media reviews in the restaurant industry
1839 -- 1852Celestine Iwendi, Gautam Srivastava 0001, Suleman Khan 0003, Praveen Kumar Reddy Maddikunta. Cyberbullying detection solutions based on deep learning architectures

Volume 29, Issue 2

457 -- 458Zhenguang Liu, Roger Zimmermann, Li Cheng 0001. Special issue on human-centric intelligent multimedia understanding
459 -- 468Xiena Dong, Jun Yu, Jian Zhang. Position constrained network for 3D human pose estimation
469 -- 485Xiaofeng Qu, Li Liu 0031, Lei Zhu, Huaxiang Zhang. Attribute-aware style adaptation for person re-identification
487 -- 498Aihua Zhou, Yujun Ma, Wanting Ji, Ming Zong, Pei Yang, Min Wu, Mingzhe Liu. Multi-head attention-based two-stream EfficientNet for action recognition
499 -- 510Liqiang Peng, Qiang Li, Fei Wang. Context-aware and ethics-first crowd mobility portraits over massive smart card data
511 -- 523Yulin Wu, Chang Liu, Lei Chen, Dong Zhao, Qinghe Zheng, Hongchao Zhou. Perturbation consistency and mutual information regularization for semi-supervised semantic segmentation
525 -- 538Haipeng Chen 0002, Yunjie Liu, Zenan Shi. FPF-Net: feature propagation and fusion based on attention mechanism for pancreas segmentation
539 -- 552Fan Liu 0003, Junfeng Wang, Delong Chen, Chunmei Shen, Feng Xu. Asymmetric exponential loss function for crack segmentation
553 -- 568Tao Liu, Mingjun Li, Haibin Zheng, Zhaoyan Ming, Jinyin Chen. Evil vs evil: using adversarial examples to against backdoor attack in federated learning
569 -- 575Chumeng Zhang, Yue Yang, Junbo Guo, Guoqing Jin, Dan Song 0006, Anan Liu. Improving text-image cross-modal retrieval with contrastive loss
577 -- 586An-An Liu, Xiaowen Wang, Ning Xu 0003, Jing Liu, Yuting Su 0001, Quan Zhang, Shenyuan Zhang, Yejun Tang, Junbo Guo, Guoqing Jin, Xuanya Li. SMPC: boosting social media popularity prediction with caption
587 -- 603Xiao Li, Shexiang Ma, Liqing Shan, Xiao Li 0001. Multi-window Transformer parallel fusion feature pyramid network for pedestrian orientation detection
605 -- 614Yifan Jiao, Sisi You. Rescue decision via Earthquake Disaster Knowledge Graph reasoning
615 -- 626Xiaoyan Tian, Ye Jin, Xianglong Tang. Local-Global Transformer Neural Network for temporal action segmentation
627 -- 640Zupeng Ai, Chengwei Peng, Jun Jiang, Zekun Li, Bing Li 0001. Face swapping detection based on identity spatial constraints with weighted frequency division
641 -- 652Zhong Qu, Lili Wang. Gating attention convolutional networks with dense connection for pixel-level crack detection
653 -- 668Yutong Shi, Xiujuan Wang, Kangfeng Zheng, Siwei Cao. User authentication method based on keystroke dynamics and mouse dynamics using HDA
669 -- 691Mohammad Javad Parseh, Mohammad Rahmanimanesh, Parviz Keshavarzi, Zohreh Azimifar. Semantic embedding: scene image classification using scene-specific objects
693 -- 724Ping Feng, Zhenjun Tang. A survey of visual neural networks: current trends, challenges and opportunities
725 -- 738Lei Li, Fan Tang, Juan Cao, Xirong Li 0001, Danding Wang. Bias oriented unbiased data augmentation for cross-bias representation learning
739 -- 751Sree Ganesh T. N, Rishi Satish, Rajeswari Sridhar. Learning effective embedding for automated COVID-19 prediction from chest X-ray images
753 -- 762Jianhui He, Chunlong Hu, Lijuan Wang. Facial age estimation based on asymmetrical label distribution
763 -- 770Jin Che, Yuxia Zhang, Qi Yang, Yuting He. Research on person re-identification based on posture guidance and feature alignment
771 -- 786Rudrika Kalsotra, Sakshi Arora. Performance analysis of U-Net with hybrid loss for foreground detection
787 -- 796Gan Hu, Yanli Ji, Xingzhu Liang, Yuexing Han. Layer-fusion for online mutual knowledge distillation
797 -- 809Xuyang Lu, Yang Gao. Guide and interact: scene-graph based generation and control of video captions
811 -- 829Zhenhua Tang, Jiemei Yao, Qian Zhang, Yuanting Luo. Multi-operator image retargeting with visual quality preservation of salient regions
831 -- 845Wenying Wen, Yunpeng Jian, Yuming Fang, Yushu Zhang, Baolin Qiu. Authenticable medical image-sharing scheme based on embedded small shadow QR code and blockchain framework
847 -- 869Luis Rei, Dunja Mladenic, Mareike Dorozynski, Franz Rottensteiner, Thomas Schleider, Raphaël Troncy, Jorge Sebastián Lozano, Mar Gaitán Salvatella. Multimodal metadata assignment for cultural heritage artifacts
871 -- 886Cheng-Jian Qiu, Yuqing Song 0001, Zhe Liu 0004, Jing Yin, Kai Han, Yi Liu. CMFCUNet: cascaded multi-scale feature calibration UNet for pancreas segmentation

Volume 29, Issue 1

1 -- 13Menghao Hu, Mingxuan Luo, Menghua Huang, Wenhua Meng, Baochen Xiong, Xiaoshan Yang, Jitao Sang. Towards a multimodal human activity dataset for healthcare
15 -- 31Santosh Kumar Tripathy, Harsh Kostha, Rajeev Srivastava. TS-MDA: two-stream multiscale deep architecture for crowd behavior prediction
33 -- 48Zijie Yang, Lingxi Xie, Wei Zhou, Xinyue Huo, Longhui Wei, Jian Lu, Qi Tian 0001, Sheng Tang. VoxSeP: semi-positive voxels assist self-supervised 3D medical segmentation
49 -- 58Hengyou Wang, Yanfei Song, Lianzhi Huo, Linlin Chen, Qiang He. Multiscale object detection based on channel and data enhancement at construction sites
59 -- 71Weijia Liu, Jiuxin Cao, Yilin Zhu, Bo Liu 0004, Xuelin Zhu. Real-time anomaly detection on surveillance video with two-stream spatio-temporal generative model
73 -- 103R. Rashmi Adyapady, B. Annappa. A comprehensive review of facial expression recognition techniques
105 -- 115Zhexin Zhang, Jiajun Ding, Jun Yu 0002, Yiming Yuan, Jianping Fan 0001. Import vertical characteristic of rain streak for single image deraining
117 -- 128Kunhong Wu, Liang Li, Yahong Han. Weighted progressive alignment for multi-source domain adaptation
129 -- 138Zhongyue Chen, Jiangqi Chen, Guangliu Ding, He Huang. A lightweight CNN-based algorithm and implementation on embedded system for real-time face recognition
139 -- 151Zehao Lin, Jiahui She, Qiu Shen. Real emotion seeker: recalibrating annotation for facial expression recognition
153 -- 166Letian Wang, Quan Zhou, Yuling Ma, Jie Guo, Xiushan Nie, Yilong Yin. Deep regional detail-aware hashing
167 -- 195Shradha Dubey, Manish Dixit. A comprehensive survey on human pose estimation approaches
197 -- 210Shengjie Liu, Ning He, Cheng Wang, Haigang Yu, Wenjing Han. Lightweight human pose estimation algorithm based on polarized self-attention
211 -- 222Ye Li, Kangning Yin, Jie Liang, Zhuofu Tan, Xinzhong Wang, Guangqiang Yin, Zhiguo Wang. A multitask joint framework for real-time person search
223 -- 234Weidong Zhu, Jun Sun 0019, Simin Wang, Kaifeng Yang, Jifeng Shen, Xin Zhou. Segmentation and recognition of filed sweet pepper based on improved self-attention convolutional neural networks
235 -- 246Pengyi Hao, Yali Li, Cong Bai. Meta-relationship for course recommendation in MOOCs
247 -- 259Dicong Wang, Qinghua Hu, Kaijun Wu. Dual-branch network with memory for video anomaly detection
261 -- 273Zhiling Cai, Ruijia Li, Hong Wu. Learning unified anchor graph based on affinity relationships with strong consensus for multi-view spectral clustering
275 -- 287Lu Zhao, Liming Yuan, Kun Hao, Xianbin Wen. Generalized attention-based deep multi-instance learning
289 -- 303Xiang Gao, Lijuan Xu, Fan Wang, Xiaopeng Hu. Multi-branch aware module with channel shuffle pixel-wise attention for lightweight image super-resolution
305 -- 321Xikun Liang, Limin Tao, Bin Hu. Image bit planes approximate reconstruction and encryption based on Gaussian function and multiple parameters chaos
323 -- 332Hanguang Xiao, Yuewei Li, Yu Xiu, Qingling Xia. Development of outdoor swimmers detection system with small object detection method based on deep learning
333 -- 346Wen Guo, Dong Li, Bowen Liang, Bin Shan. Multi-view region proposal network predictive learning for tracking
347 -- 359Hufei Wang, Kaiqiang Zhao, Dexin Zhao. A triple fusion model for cross-modal deep hashing retrieval
361 -- 375Nesrine Tarhouni, Masmoudi Salma, Maha Charfeddine, Chokri Ben Amar. Fake COVID-19 videos detector based on frames and audio watermarking
377 -- 387Wanjun Liu, Junkai Wang, Haicheng Qu, Lei Shen. Hierarchical MVSNet with cost volume separation and fusion based on U-shape feature extraction
389 -- 399Qiming Yan, Yubao Sun, Shaojing Fan, Liling Zhao. Polarity-aware attention network for image sentiment analysis
401 -- 420Srishti Yadav, Shahram Payandeh. DATaR: Depth Augmented Target Redetection using Kernelized Correlation Filter
421 -- 433Sahar Dammak, Hazar Mliki, Emna Fendri. Gender estimation based on deep learned and handcrafted features in an uncontrolled environment
435 -- 446Yang Yang 0046, Yiwen Xiong, Yanqing Cao, Lanling Zeng, Yan Zhao, Yongzhao Zhan. Fast bilateral filter with spatial subsampling
449 -- 0Honggui Li, Dimitri Galayko. Correction to: Deep reconstruction of 1D ISOMAP representations
451 -- 0Muhammad Pervez Akhter, Jiangbin Zheng, Irfan Raza Naqvi, Mohammed Abdelmajeed, Tehseen Zia. Correction to: Abusive language detection from social media comments using conventional machine learning and deep learning approaches
453 -- 0Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay. Correction to: Attention based video captioning framework for Hindi
455 -- 0Hanyun Zhang, Dongliang Guo, Wei Liu, Junlan Nie, Shuo Li. Correction to: An improved algorithm of video quality assessment by danmaku analysis