Multimedia Syst. - researchr journal

researchr

You are not signed in
Sign in
Sign up

3151	--	3168	Chhavi Dixit, Shashank Mouli Satapathy. A customizable framework for multimodal emotion recognition using ensemble of deep neural network models
3169	--	3177	Ce Zhang, Xiao Yao, Changfeng Shi, Min Gu. Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning
3179	--	3191	Humaira Shafiq, Ghulam Gilanie, Muhammad Sajid, Muhammad Ahsan. Dental radiology: a convolutional neural network-based approach to detect dental disorders from dental images in a real-time environment
3193	--	3207	Yuhan Huang, Jiacheng Lu, Nianzhe Chen, Hui Ding, Yuanyuan Shang. A deep learning image inpainting method based on stationary wavelet transform
3209	--	3221	Chuanwang Wen, Shucheng Huang. A LiDAR point cloud registration method combining linear feature extraction and TrICP algorithm
3223	--	3243	Baoying Zheng, Fang Liu 0002, Mohan Zhang, Tongqing Zhou, Shenglan Cui, Yunfan Ye, Yeting Guo. Image captioning for cultural artworks: a case study on ceramics
3245	--	3258	Huimin Qian, Wenyu Shen, Zhengqi Wang, Shuwei Xu. Hotspot defect detection for photovoltaic modules under complex backgrounds
3259	--	3276	Liyan Xiong, Zhida Li, Xiaohui Huang, Yijuan Zeng, Peng Huang. TFA-CNN: an efficient method for dealing with crowding and noise problems in crowd counting
3277	--	3290	Xiaohui Guan, Qiqi Shao, Yaguan Qian, Tengteng Yao, Bin Wang 0062. Adversarial training in logit space against tiny perturbations
3291	--	3303	Zekang Wang, Li Liu 0031, Huaxiang Zhang 0001, Dongmei Liu, Yu Song. Generative adversarial text-to-image generation with style image constraint
3305	--	3328	Mamta Gehlot, Rakesh Kumar Saxena, Geeta Chhabra Gandhi. "Tomato-Village": a dataset for end-to-end tomato disease detection in a real-world environment
3329	--	3339	Xin Wang, Ning He, Chen Hong, Fengxi Sun, Wenjing Han, Qi Wang. YOLO-ERF: lightweight object detector for UAV aerial images
3341	--	3356	Yongwei Gai, Jinglei Liu. Clustering by sparse orthogonal NMF and interpretable neural network
3357	--	3367	Mingju Shao, Guodong Wang. Class-agnostic counting with feature augmentation and similarity comparison
3369	--	3384	Ugur Berk Sahin, Fatih Kamisli. Image compression with learned lifting-based DWT and learned tree-based entropy models
3385	--	3402	Amal Bouatrous, Abdelkrim Meziane, Nadia Zenati, Chafiaâ Hamitouche. A new adaptive VR-based exergame for hand rehabilitation after stroke
3403	--	3419	V. Praveena, L. R. Sujithra, S. Karthik, Muthu Subash Kavitha. Bio-Inspired ensemble feature selection and deep auto-encoder approach for rapid diagnosis of breast cancer
3421	--	3430	Dayong Tian, Yiqin Cao, Yiwen Wei, Deyun Zhou. Narrowing the variance of variational cross-encoder for cross-modal hashing
3431	--	3446	Xin Zhang, Xiaotian Cao, Jun Wang, Lei Wan. G-UNeXt: a lightweight MLP-based network for reducing semantic gap in medical image segmentation
3447	--	3466	Jennil Thiyam, Sanasam Ranbir Singh, Prabin Kumar Bora. Integrated document segmentation and region identification: textual, equation and graphical
3467	--	3480	Tuo Li, Yahong Han. Improving transferable adversarial attack for vision transformers via global attention and local drop
3481	--	3504	Jakub Lokoc, Stelios Andreadis, Werner Bailer, Aaron Duane, Cathal Gurrin, Zhixin Ma, Nicola Messina, Thao-Nhu Nguyen, Ladislav Peska, Luca Rossetto, Loris Sauter, Konstantin Schall, Klaus Schoeffmann, Omar Shahbaz Khan, Florian Spiess, Lucia Vadicamo, Stefanos Vrochidis. Interactive video retrieval in the age of effective joint embedding deep models: lessons from the 11th VBS
3505	--	3520	Shijie Jia, Yan Cui, Xiaoyan Su, Zongzheng Liang. A social-aware video sharing solution using demand prediction of epidemic-based propagation in wireless networks
3521	--	3530	Quan Lin Gu, Sai Yang, TianXing Yu. Lite general network and MagFace CNN for micro-expression spotting in long videos
3531	--	3547	Li Han, Jinhai He, Feng Dou, Huiwen Ma, Xinyang Xie, Wanwen Yang. A viewpoint-guided prototype network for 3D shape classification
3549	--	3557	Zhiwei Ma, Guilin Yao. Deep portrait matting via double-grained segmentation
3559	--	3577	Nan Xie, Zhaojie Liu, Zhengxu Li, Wei Pang, Beier Lu. Student engagement detection in online environment using computer vision and multi-dimensional feature fusion
3579	--	3597	Fanqiang Kong, Jiahui Tang, Yunsong Li, Dan Li 0014, Kedi Hu. Dual-branch spectral-spatial feature extraction network for multispectral image compression
3599	--	3608	Jun Wu, Tianliang Zhu, Jiahui Zhu, Tianyi Li, Chunzhi Wang. Hierarchical multiples self-attention mechanism for multi-modal analysis
3609	--	3623	Yizhong Yang, Tingting Xia, Dajin Li, Zhang Zhang, Guangjun Xie. A multi-scale feature fusion spatial-channel attention model for background subtraction
3625	--	3638	Tao Hu, Xuyu Xiang, Jiaohua Qin, Yun Tan. Audio-text retrieval based on contrastive learning and collaborative attention mechanism
3639	--	3653	Kaisi Yang, Lianyu Zhao, Chenglin Wang. Workpiece tracking based on improved SiamFC++ and virtual dataset
3655	--	3668	Xinglin Pan, Mingxin Gan. Multi-behavior recommendation based on intent learning
3669	--	3684	Bin Liu, Siyan Fang. Multi-aggregation network based on non-separable lifting wavelet for single image deraining
3685	--	3701	Deeksha Gupta, Akashdeep Sharma. A two-stage attention augmented fully convolutional network-based dynamic video summarization
3703	--	3720	Honglin Li, Qinghua Huang. MAF-Net: multidimensional attention fusion network for multichannel speech separation
3721	--	3744	Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu. Compression of face images using meta-heuristic algorithms based on curvelet transform with variable bit allocation
3745	--	3755	Haiyan Zhang, Quan Wang, Guorui Feng. Artistic image adversarial attack via style perturbation
3757	--	3770	Xin Zheng, Xin He, Yimo Ren, Jinfa Wang, Junyang Yu. Owner named entity recognition in website based on multidimensional text guidance and space alignment co-attention
3771	--	3780	Jiangpeng Zheng, Fan Shi, Meng Zhao, Chen Jia, Congcong Wang. Learning intra-inter-modality complementary for brain tumor segmentation
3781	--	3804	Bowen Xin, Ning Xu 0003, Yingchen Zhai, Tingting Zhang, Zimu Lu, Jing Liu, Weizhi Nie, Xuanya Li, An-An Liu. A comprehensive survey on deep-learning-based visual captioning
3805	--	3818	Wei Xiong, Haoliang Liu, Siya Mi, Yu Zhang 0004. Asymmetric bi-encoder for image-text retrieval
3819	--	3832	Fengjun Xiao, Zhuxi Zhang, Ye Yao. CTNet: hybrid architecture based on CNN and transformer for image inpainting detection
3833	--	3845	Emrah Dönmez, Serhat Kiliçarslan, Cemil Közkurt, Aykut Diker, Fahrettin Burak Demir, Abdullah Elen. Identification of haploid and diploid maize seeds using hybrid transformer model
3847	--	3861	Na Ta, Haipeng Chen, Xianzhu Liu, Nuo Jin. LET-Net: locally enhanced transformer network for medical image segmentation
3863	--	3876	Haoliang Zhou, Shucheng Huang, Yuqiao Xu. Inceptr: micro-expression recognition integrating inception-CBAM and vision transformer
3877	--	3890	Noor Ahmed, Rozina, Ahmad Ali, Abdul Raziq. Images denoising for COVID-19 chest X-ray based on multi-scale parallel convolutional neural network
3891	--	3901	Jiacheng Chang, Lanyong Zhang, Zhuang Shao. View-target relation-guided unsupervised 2D image-based 3D model retrieval via transformer
3903	--	3930	Reza Khodadadi, Gholamreza Ardeshir, Hadi Grailu. Variable bit allocation method based on meta-heuristic algorithms for facial image compression
3931	--	3949	Hüseyin Yasar, Murat Ceylan. A novel study for automatic two-class COVID-19 diagnosis (between COVID-19 and Healthy, Pneumonia) on X-ray images using texture analysis and 2-D/3-D convolutional neural networks
3951	--	3969	Alison Reboud, Ismail Harrando, Pasquale Lisena, Raphaël Troncy. Stories of love and violence: zero-shot interesting events' classification for unsupervised TV series summarization
3971	--	0	Akash Tayal, Jivansha Gupta, Arun Solanki, Khyati Bisht, Anand Nayyar, Mehedi Masud. Correction to: DL‑CNN‑based approach with image processing techniques for diagnosis of retinal diseases
3973	--	0	Hwei Teeng Chong, Chen Kim Lim, Ahmad Rafi, Kian Lam Tan, Mazlin Mokhtar. Correction: Comprehensive systematic review on virtual reality for cultural heritage practices: coherent taxonomy and motivations

2455	--	2467	Ajay Sharma, Bhavana P. Shrivastava, Aayushi Priya. Multilevel progressive recursive dilated networks with correlation filter (MPRDNCF) for image super-resolution
2469	--	2482	Maosheng Zhong, Youde Chen, Hao Zhang, Hao Xiong, Zhixiang Wang. Multimodal-enhanced hierarchical attention network for video captioning
2483	--	2494	Yongzhen Ke, Yin Wang, Kai Wang, Fan Qin, Jing Guo, Shuai Yang. Image aesthetics assessment using composite features from transformer and CNN
2495	--	2509	Susmi Jacob, P. Vinod 0001, Arjun Subramanian, Varun G. Menon. Affect sensing from smartphones through touch and motion contexts
2511	--	2526	Yuqiang Li, Xinyi Shangguan, Chun Liu, Haochen Meng. I2I translation model based on CondConv and spectral domain realness measurement: BCS-StarGAN
2527	--	2543	Chuan Liu, Ying-Ying Tan, Tian-Tian Xia, Jiajing Zhang, Ming Zhu. Co-attention graph convolutional network for visual question answering
2545	--	2562	Zhenying Fang, Jianping Fan 0001, Jun Yu 0002. LPR: learning point-level temporal action localization through re-training
2563	--	2573	Aiping Yang, Yan Liu, Simeng Cheng, Jiale Cao, Zhong Ji, Yanwei Pang. Spatial attention-guided deformable fusion network for salient object detection
2575	--	2589	Xin Yang 0002, Xiangchen Wang, Xiaohui Ye, Tao Li 0011. VMSG: a video caption network based on multimodal semantic grouping and semantic attention
2591	--	2601	Weihao Gao, Yongjun Zhang, Wei Long, Zhongwei Cui. A deraining with detail-recovery network via context aggregation
2603	--	2614	Asha Rani, Pankaj Yadav, Yashaswi Verma. Early-stage autism diagnosis using action videos and contrastive feature learning
2615	--	2631	Yunfei Zheng, Meng Sun 0001, Xiaobing Wang, Tieyong Cao, Xiongwei Zhang, Lixing Xing, Zheng Fang. Self-distillation object segmentation via pyramid knowledge representation and transfer
2633	--	2650	Jian-Wei Zhang, Yifan Sun, Wei Chen. Pull and concentrate: improving unsupervised semantic segmentation adaptation with cross- and intra-domain consistencies
2651	--	2668	Longfeng Shen, Fenglan Qin, Hongying Zhu, Dengdi Sun, Hai Min. EGARNet: adjacent residual lightweight super-resolution network based on extended group-enhanced convolution
2669	--	2687	Mahsa Soleimani, Ali Nazari, Mohsen Ebrahimi Moghaddam. Deepfake detection of occluded images using a patch-based approach
2689	--	2703	Chaithanyadas K. V., G. R. Gnana King. Computer-aided diagnosis for early detection and staging of human pancreatic tumors using an optimized 3D CNN on computed tomography
2705	--	2714	Xiuxia Cai, Pin Zhang, Shuaibin Du. Imitation camouflage synthesis based on shallow neural network
2715	--	2728	Yan Li, Min Xia 0002, Dongmei Jiang. Cross-view adaptive graph attention network for dynamic facial expression recognition
2729	--	2746	Hongwei Zhao, Siquan Wu, Zhen Tian, Yidong Li, Yi Jin 0001, Shengchun Wang. Context-guided coarse-to-fine detection model for bird nest detection on high-speed railway catenary
2747	--	2760	Weiyi Wei, Jian Wang, Mengyu Xu, Futong Zhang. Multimodal heterogeneous graph convolutional network for image recommendation
2761	--	2777	Jiachang Li, Haitao Zhang, Huadong Ma. DRL-based transmission control for QoE guaranteed transmission efficiency optimization in tile-based panoramic video streaming
2779	--	2790	Si Chen 0002, Bolun Xu, Miaohui Zhang, Yan Yan 0001, Xia Du, Weiwei Zhuang, Yun Wu. HC-GCN: hierarchical contrastive graph convolutional network for unsupervised domain adaptation on person re-identification
2791	--	2807	Zhangyu Liu, Zhi Li, Guomei Wang, Youliang Tian, Long Zheng. Robust zero-watermarking algorithm for diffusion-weighted images based on multiscale feature fusion
2809	--	2823	Xianhua Duan, Chaoqiang Jin, Xin Shu. HCPSNet: heterogeneous cross-pseudo-supervision network with confidence evaluation for semi-supervised medical image segmentation
2825	--	2839	Guangtao Wang, Jun Li 0033, Zhijian Wu, Jianhua Xu, Jifeng Shen, Wankou Yang. EfficientFace: an efficient deep network with feature enhancement for accurate face detection
2841	--	0	. Editorial note for few-shot learning for intelligent multimedia systems
2843	--	2851	Xuewei Chao, Lixin Zhang. Few-shot imbalanced classification based on data augmentation
2853	--	2863	Shan Liu, Yichao Tang, Ying Tian, Hansong Su. Visual driving assistance system based on few-shot learning
2865	--	2875	Yue Yang, Zhuo Zhang, Wei Mao, Yang Li 0111, Chengang Lv. Radar target recognition based on few-shot learning
2877	--	2886	You Zhou, Changlin Chen, Shukun Ma. Few-shot ship classification based on metric learning
2887	--	2898	Changlin Chen, Xuewei Chao. Conversion of infrared ocean target images to visible images driven by energy information
2899	--	2912	Rajdeep Chatterjee, Ankita Chatterjee, SK Hafizul Islam, Muhammad Khurram Khan. An object detection-based few-shot learning approach for multimedia quality assessment
2913	--	2922	Xiaolei Li. Few-shot wind turbine blade damage early warning system based on sound signal fusion
2923	--	2933	Wei Ren, Li Zhou, Jie Chen. Unsupervised single image dehazing with generative adversarial network
2935	--	2950	Abdelkader Tayeb Herouala, Benameur Ziani, Chaker Abdelaziz Kerrache, Abdou El Karim Tahari, Nasreddine Lagraa, Spyridon Mastorakis. CaDaCa: a new caching strategy in NDN using data categorization
2951	--	2959	M. Poongodi, Mounir Hamdi, Huihui Wang 0001. Image and audio caps: automated captioning of background sounds and images using deep learning
2961	--	2977	Neha Sharma, Chinmay Chakraborty, Rajeev Kumar. Optimized multimedia data through computationally intelligent algorithms
2979	--	2989	Jiandong Lv, Xingang Wang, Cuiling Shao. TMIF: transformer-based multi-modal interactive fusion for automatic rumor detection
2991	--	3000	Wei Chen, Jing Nie. A MADDPG-based multi-agent antagonistic algorithm for sea battlefield confrontation
3001	--	3013	Zhengjian Li, Jingyi He, Tianlei Ni, Jiaming Huo. Numerical computation based few-shot learning for intelligent sea surface temperature prediction
3015	--	0	. Editorial note for trustworthy multimedia big data computing
3017	--	3026	Zijie Song, Zhenzhen Hu, Richang Hong. Efficient and self-adaptive rationale knowledge base for visual commonsense reasoning
3027	--	3040	Wenzhe Zhai, Qilei Li, Ying Zhou, Xuesong Li, Jinfeng Pan, Guofeng Zou, Mingliang Gao 0001. $\hbox {DA}^2$Net: a dual attention-aware network for robust crowd counting
3041	--	3054	Na Ta 0009, Haipeng Chen 0002, Yingda Lyu, Taosuo Wu. BLE-Net: boundary learning and enhancement network for polyp segmentation
3055	--	3067	Dengyun Xu, Xuanjing Shen, Yongping Huang, Zenan Shi. RB-Net: integrating region and boundary features for image manipulation localization
3069	--	3079	Chunxiao Fan 0002, Zhenxing Wang, Jia Li, Shanshan Wang, Xiao Sun. Robust facial expression recognition with global-local joint representation learning
3081	--	3093	Jing Ge, Qianxiang Wang, Guangyu Gao. Hardest and semi-hard negative pairs mining for text-based person search with visual-textual attention
3095	--	3103	Yi Wang, Shixin Zheng, Xiao Sun 0003, Dan Guo, Junjie Lang. Micro-expression recognition with attention mechanism and region enhancement
3105	--	3114	Wenyi Hu, Xiao Wang, Zheng Wang, Xin Xu, Ruimin Hu. Dual-focus: person search from Coarse-Grained Focus to Fine-Grained Focus
3115	--	3138	Haoming Chen, Runyang Feng, Sifan Wu, Hao Xu, Fengcheng Zhou, Zhenguang Liu. 2D Human pose estimation: a survey
3139	--	3150	Jian Wang, Xiaoyu Du, Yu Cheng, Yunlian Sun, Jinhui Tang 0001. SI-Net: spatial interaction network for deepfake detection

1853	--	1863	Saifullah Tumrani, Wazir Ali, Rajesh Kumar 0014, Abdullah Aman Khan, Fayaz Ali Dharejo. View-aware attribute-guided network for vehicle re-identification
1865	--	1895	Palash Ray, Asish Bera, Debasis Giri, Debotosh Bhattacharjee. Style matching CAPTCHA: match neural transferred styles to thwart intelligent attacks
1897	--	1915	He Zhang, Lu Yin, Hanling Zhang. A review of micro-expression spotting: methods and challenges
1917	--	1940	Carlos Vilchis, Carmina Pérez-Guerrero, Mauricio Mendez-Ruiz, Miguel González-Mendoza 0001. A survey on the pipeline evolution of facial capture and tracking for digital humans
1941	--	1954	Kai Hu 0006, Junlan Jin, Chaowen Shen, Min Xia 0002, Liguo Weng. Attentional weighting strategy-based dynamic GCN for skeleton-based action recognition
1955	--	1966	Anqi Zheng, Shiqi Zheng, Cong Bai, Deng Chen. Triple-level relationship enhanced transformer for image captioning
1967	--	1980	Gang Wang, Shucheng Huang, Zhe Tao. Shallow multi-branch attention convolutional neural network for micro-expression recognition
1981	--	1994	Lei Yang, Yong Feng 0002, Mingliang Zhou, Xiancai Xiong, Yongheng Wang, Baohua Qiang. Multi-level network based on transformer encoder for fine-grained image-text matching
1995	--	2007	An-An Liu, Yuwei Zhang, Chenyu Zhang, Wenhui Li 0001, Bo Lv, Lei Lei, Xuanya Li. Prototype-based semantic consistency learning for unsupervised 2D image-based 3D shape retrieval
2009	--	2035	B. Bhaskar Reddy, M. Venkata Sudhakar, P. Rahul Reddy, P. Raghava Reddy. Ensemble deep honey architecture for COVID-19 prediction using CT scan and chest X-ray images
2037	--	2048	Yizhong Yang, Ce Hou, Haixia Huang, Zhang Zhang, Guangjun Xie. Cascaded deep residual learning network for single image dehazing
2049	--	2057	Elena Battini Sönmez, Sefer Memis, Berker Arslan, Okan Zafer Batur. The segmented UEC Food-100 dataset with benchmark experiment on food detection
2059	--	2072	Furong Ma, Guiyu Xia, Qingshan Liu 0001. Human pose transfer via shape-aware partial flow prediction network
2073	--	2083	Xin Xu, Gang Lv, Yining Sun, Yuxia Hu, Fudong Nian. Hierarchical cross-modal contextual attention network for visual grounding
2085	--	2097	Honghong Yang, Hongxi Liu, Yumei Zhang, Xiaojun Wu 0002. HSGNet: hierarchically stacked graph network with attention mechanism for 3D human pose estimation
2099	--	2110	Awais Ahmed, She Kun 0001, Junaid Ahmed, Shaukat Hayat, Abdullah Aman Khan. Multimodal image enhancement using convolutional sparse coding
2111	--	2124	Tarun Agrawal, Prakash Choudhary. COVID-SegNet: encoder-decoder-based architecture for COVID-19 lesion segmentation in chest X-ray
2125	--	2135	Kangkang Wei, Weiqi Luo 0001, Minglin Liu, Miaoxin Ye. Residual guided coordinate attention for selection channel aware image steganalysis
2137	--	2152	Jian Shi, Geng Sun 0001, Jinyu Zhang, Zhihui Wang, Haojie Li. Face attribute recognition via end-to-end weakly supervised regional location
2153	--	2164	Mengting Liu, Xinrui Li, Yongge Liu, Yahong Han. Weakly supervised anomaly detection with multi-level contextual modeling
2165	--	2180	Hafsa Ilyas, Ali Javed, Khalid Mahmood Malik, Aun Irtaza. E-Cap Net: an efficient-capsule network for shallow and deepfakes forgery detection
2181	--	2191	Yingyuan Zhao, Zhiyi Tan 0002, Bing-Kun Bao, Zhengzheng Tu. Centralized sub-critic based hierarchical-structured reinforcement learning for temporal sentence grounding
2193	--	2203	Zepeng Li, Wenchuan Cheng, Jiawei Zhou, Zhengyi An, Bin Hu 0001. Deep learning model with multi-feature fusion and label association for suicide detection
2205	--	2216	Jing Sun, Rui Yan, Bing Zhang, Bing Zhu, Fuming Sun. A cross-view geo-localization method guided by relation-aware global attention
2217	--	2238	Mei-Ting Su, Mei-Ling Chiang, Chia-Hsuan Tsai, Chi-Wei Lin, Rong-Xuan Liu, Yong-Ting Juang, Hsin-Hao Chen. An acupoint health care system with real-time acupoint localization and visualization in augmented reality
2239	--	2252	Tobias Mühling, Isabelle Späth, Joy Backhaus, Nathalie Milke, Sebastian Oberdörfer, Alexander Meining, Marc Erich Latoschik, Sarah Koenig. Virtual reality in medical emergencies training: benefits, perceived stress, and learning success
2253	--	2262	Shulin Cheng, Huimin Jiang, Wanyan Wang, Wei Jiang. Research on multi-context aware recommendation methods based on tensor factorization
2263	--	2279	Yulin Deng, Liju Yin, Xiaoning Gao, Hui Zhou, ZhenZhou Wang, Guofeng Zou. EA-EDNet: encapsulated attention encoder-decoder network for 3D reconstruction in low-light-level environment
2281	--	2292	Fangzheng Xu, Yu Bao, Bingye Li, Zhining Hou, Lekang Wang. Entropy minimization and domain adversarial training guided by label distribution similarity for domain adaptation
2293	--	2322	Khouloud Salameh, Farah El Akoum, Joe Tekli. Unsupervised knowledge representation of panoramic dental X-ray images using SVG image-and-object clustering
2323	--	2335	Dailiang Wei, Juanli Li, Bo Li, Xin Wang, Siyuan Chen, Xuewen Wang, Luyao Wang. A fast recognition method for coal gangue image processing
2337	--	2349	Suchi Jain, Geeta Sikka, Renu Dhir. An automatic cascaded approach for pancreas segmentation via an unsupervised localization using 3D CT volumes
2351	--	2362	Changshui Yang, Yan Liu, Qiang Liu, Riaz Ullah Khan, Bin Chen, Wenyong Wang. Dual semantic-aligned clustering for cross-domain person re-identification
2363	--	2373	Bolin Wang, Yuanyuan Sun, Yonghe Chu, Changrong Min, Zhihao Yang, Hongfei Lin. Local discriminative graph convolutional networks for text classification
2375	--	2388	Israr-Ur-Rehman, Muhammad Shehzad Hanif, Zulfiqar Ali 0002, Zahoor Jan, Cobbinah Bernard Mawuli, Waqar Ali. Empowering neural collaborative filtering with contextual features for multimedia recommendation
2389	--	2398	Zekun Yang, Yuta Nakashima, Haruo Takemura. Multi-modal humor segment prediction in video
2399	--	2413	Gaoming Yang, Anxing Wei, Xianjin Fang, Ji Zhang. FDS_2D: rethinking magnitude-phase features for DeepFake detection
2415	--	2427	Hong Lin, Xi Wang, Chun Liu, Dewei Peng. HRCutBlur Augment: effectively enhancing data diversity for image super-resolution
2429	--	2437	Hongbo Xing, Guanqun Zhou, Shusen Yuan, Youjun Jiang, Pinyong Geng, Yewen Cao, Yujun Li, Lei Chen. Micro-expression spotting network based on attention and one-dimensional convolutional sliding window
2439	--	2454	Hitesh D. Panchal, Hitesh B. Shah. Multiple forgery detection in digital video based on inconsistency in video quality assessment attributes

887	--	895	Shanqing Zhang, Yujie Chen, Yiheng Meng, Jianfeng Lu 0005, Li Li 0014, Rui Bai. A multi-level feature weight fusion model for salient object detection
897	--	915	Sara Akan, Songül Varli. Use of deep learning in soccer videos analysis: survey
917	--	943	Anjali Gautam. Recent advancements of deep learning in detecting breast cancer: a survey
945	--	959	Linfeng Liu, Tong Chen 0004, Haojie Liu, Shiliang Pu, Li Wang, Qiu Shen. 2C-Net: integrate image compression and classification via deep neural network
961	--	979	M. Kavitha. MDP-HML: an efficient detection method for multiple human disease using retinal fundus images based on hybrid learning techniques
981	--	1000	Shuying Zhang, Jing Zhang, Yizhou Wang, Li Zhuo. Short video fingerprint extraction: from audio-visual fingerprint fusion to multi-index hashing
1001	--	1010	Qingtian Zeng, Liangwei Niu, Shansong Wang, Weijian Ni. SEViT: a large-scale and fine-grained plant disease classification model based on transformer and attention convolution
1011	--	1023	Yanxue Wang, Shansong Wang, Weijian Ni, Qingtian Zeng. PAST-net: a swin transformer and path aggregation model for anthracnose instance segmentation
1025	--	1041	Deepak Dhillon, Rajlaxmi Chouhan. Edge-preserving image denoising using noise-enhanced patch-based non-local means
1043	--	1056	Jingdan Li, Yi Wang, Dexin Zhao. Layer-wise enhanced transformer with multi-modal fusion for image caption
1057	--	1071	Hao Sun, Xiaolin Qin, Xiaojing Liu. Image-text matching using multi-subspace joint representation
1073	--	1087	Wenying Wen, Rongxin Tu, Yushu Zhang, Yuming Fang, Yong Yang 0001. A multi-level approach with visual information for encrypted H.265/HEVC videos
1089	--	1101	Heling Cao, Lei Li, Yonghe Chu, Miaolei Deng, Panpan Wang, Chenyang Zhao. A coincidental correctness test case identification framework with fuzzy C-means clustering
1103	--	1116	Jasvinder Pal Singh, Uday Pratap Singh, Sanjeev Jain. Model-based person identification in multi-gait scenario using hybrid classifier
1117	--	1130	Thae Song Kim, Su Hyon Kim. An improved contrast enhancement for dark images with non-uniform illumination based on edge preservation
1131	--	1144	Fuming Sun, Tingting Zhao, Bing Zhu, Xu Jia, Fasheng Wang. Deblurring transformer tracking with conditional cross-attention
1145	--	1159	Xingjian Gu, Yongjie Zhu, Shougang Ren, Xiangbo Shu. BCMask: a finer leaf instance segmentation with bilayer convolution mask
1161	--	1172	Chenquan Gan, Xiaopeng Cao, Qingyi Zhu. Microblog sentiment analysis via user representative relationship under multi-interaction hybrid neural networks
1173	--	1185	Neetu Singla, Sushama Nagpal, Jyotsna Singh. A two-stage forgery detection and localization framework based on feature classification and similarity metric
1187	--	1202	Dong Xie 0005, Bin Wu, Fulong Chen 0002, Taochun Wang, Zebang Hu, Yibo Zhang. A low-overhead compressed sensing-driven multi-party secret image sharing scheme
1203	--	1230	Anusha Chhabra, Dinesh Kumar Vishwakarma. A literature survey on multimodal and multilingual automatic hate speech identification
1231	--	1244	Youyu Liu, Yi Li, Dezhang Xu, Qingyan Yang, Wanbao Tao. Adaptive Kalman Filter with power transformation for online multi-object tracking
1245	--	1276	Min-Jen Tsai, Hung-Yu Wu, Di-Ting Lin. Auto ROI & mask R-CNN model for QR code beautification (ARM-QR)
1277	--	1290	Adithya Sineesh, Mahesh Raveendranatha Panicker. Edge preserved universal pooling: novel strategies for pooling in convolutional neural networks
1291	--	1300	Fangmei Chen, Yuying Wang, Sheng Xu, Fasheng Wang, Fuming Sun, Xu Jia. Style transfer network for complex multi-stroke text
1301	--	1314	Xin Chao, Zhenjie Hou, Yujian Mo, Haiyong Shi, Wenjing Yao. Structural feature representation and fusion of human spatial cooperative motion for action recognition
1315	--	1334	Joel Dickson, Arul Linsely, R. J. Alice Nineta. An integrated 3D-sparse deep belief network with enriched seagull optimization algorithm for liver segmentation
1335	--	1345	Sicheng Zhang, Jin Liu, Bo Hu, Zhendong Mao. GH-DDM: the generalized hybrid denoising diffusion model for medical image generation
1347	--	1360	Huanjie Tao, Minghao Lu, Zhenwu Hu, Jianfeng An. A gated multi-hierarchical feature fusion network for recognizing steel plate surface defects
1361	--	1376	Aashania Antil, Chhavi Dhiman. A two stream face anti-spoofing framework using multi-level deep features and ELBP features
1377	--	1389	N. Venugopal. SCMACDnet: multilevel fusion-based deep twin capsule network for change detection
1391	--	1403	Jiajun Ding, Beili Liu, Jun Yu 0002, Huanlei Guo, Ming Shen, Kenong Shen. An efficient multi-path structure with staged connection and multi-scale mechanism for text-to-image synthesis
1405	--	1416	Wei Li, Xiwei Yang, Zhixin Li. MLCB-Net: a multi-level class balancing network for domain adaptive semantic segmentation
1417	--	1429	Yuzhe He, Ning He, Haigang Yu, Ren Zhang, Kang Yan. From macro to micro: rethinking multi-scale pedestrian detection
1431	--	1451	Ehsan Jafari, Ardeshir Dolati, Kamran Layeghi. Object tracking using fuzzy-based improved graph, interesting patches and multi-label MRF optimization
1453	--	1462	Zhenfeng Zhang, Chuhua Huang, RenJing Huang, Yanan Li, Yifan Chen. Illu-NASNet: unsupervised illumination estimation based on dense spatio-temporal smoothness
1463	--	1479	Shisong Huang, Danyang Li, Zhuhong Zhang, Yating Wu, Yumei Tang, Xing Chen, Yiqing Wu. CSLSEP: an ensemble pruning algorithm based on clustering soft label and sorting for facial expression recognition
1481	--	1498	Pengqing Li, Hongjuan Zhang, Yansong Chen. Structural local sparse and low-rank tracker using deep features
1499	--	1512	Lei Li 0043, Tingting Liu, Chengyu Wang 0001, Minghui Qiu, Cen Chen, Ming Gao 0001, Aoying Zhou. Resizing codebook of vector quantization without retraining
1513	--	1526	Seyma Derdiyok, Fatma Patlar Akbulut. Biosignal based emotion-oriented video summarization
1527	--	1577	Deepika Sharma, Arvind Selwal. A survey on face presentation attack detection mechanisms: hitherto and future perspectives
1579	--	1592	Leyuan Liu, Yunqi Gao, Jianchi Sun, Jingying Chen. Single-image clothed 3D human reconstruction guided by a well-aligned parametric body model
1593	--	1601	Xin Shu, Jia Li, Liang Shi, Shucheng Huang. RES-CapsNet: an improved capsule network for micro-expression recognition
1603	--	1627	Ercan Gürsoy, Yasin Kaya. An overview of deep learning techniques for COVID-19 detection: methods, challenges, and future works
1629	--	1650	A. Mary Dayana, W. R. Sam Emmanuel, C. Harriet Linda. Feature fusion and optimization integrated refined deep residual network for diabetic retinopathy severity classification using fundus image
1651	--	1661	Birkan Buyukarikan, Erkan Ülker. Convolutional neural network-based apple images classification and image quality measurement by light colors using the color-balancing approach
1663	--	1664	Nawab Muhammad Faseeh Qureshi, Varun G. Menon, Ali Kashif Bashir, Shahid Mumtaz, Irfan Mehmood. Role of deep learning models and analytics in industrial multimedia environment
1665	--	1681	Tiago do Carmo Nogueira, Cássio Dener Noronha Vinhal, Gélson da Cruz Júnior, Matheus Rudolfo Diedrich Ullmann, Thyago Carvalho Marques. A reference-based model using deep learning for image captioning
1683	--	1697	Ahmed Barnawi, Prateek Chhikara, Rajkumar Tekchandani, Neeraj Kumar 0001, Mehrez Boulares. A CNN-based scheme for COVID-19 detection with emergency services provisions using an optimal path planning
1699	--	1715	Faria Nazir, Muhammad Nadeem Majeed, Mustansar Ali Ghazanfar, Muazzam Maqsood. A computer-aided speech analytics approach for pronunciation feedback using deep feature clustering
1717	--	1727	Linbo Wang, Li Tan, Xianyong Fang, Yanwen Guo 0001, Shaohua Wan 0001. Adaptively feature matching via joint transformational-spatial clustering
1729	--	1738	Loveleen Gaur, Ujwal Bhatia, N. Z. Jhanjhi, Ghulam Muhammad, Mehedi Masud. Medical image-based detection of COVID-19 using Deep Convolution Neural Networks
1739	--	1749	Asma Kausar, Imran Razzak, Mohd Ibrahim Shapiai, Amin Beheshti. 3D shallow deep neural network for fast and precise segmentation of left atrium
1751	--	1770	Jimmy Ming-Tai Wu, Zhongcui Li, Norbert Herencsar, Bay Vo, Jerry Chun-Wei Lin. A graph-based CNN-LSTM stock price prediction algorithm with leading indicators
1771	--	1783	Gengsheng Xie, Xianbin Wen, Liming Yuan, Jianchen Wang, Changlun Guo, Yansong Jia, Minghao Li. Pose-guided feature region-based fusion network for occluded person re-identification
1785	--	1797	Sumit Pundir, Mohammad S. Obaidat, Mohammad Wazid, Ashok Kumar Das, Devesh Pratap Singh, Joel J. P. C. Rodrigues. MADP-IIME: malware attack detection protocol in IoT-enabled industrial multimedia environment using machine learning approach
1799	--	1813	Akshi Kumar 0001. Leveraging crowd knowledge to curate documentation for agile software industry using deep learning and expert ranking
1815	--	1824	Ranran Lou, Zhihan Lv, Shuping Dang, Tianyun Su, Xinfang Li. Application of machine learning in ocean data
1825	--	1838	Mohib Ullah Khan, Abdul Rehman Javed, Mansoor Ihsan, Usman Tariq. A novel category detection of social media reviews in the restaurant industry
1839	--	1852	Celestine Iwendi, Gautam Srivastava 0001, Suleman Khan 0003, Praveen Kumar Reddy Maddikunta. Cyberbullying detection solutions based on deep learning architectures

457	--	458	Zhenguang Liu, Roger Zimmermann, Li Cheng 0001. Special issue on human-centric intelligent multimedia understanding
459	--	468	Xiena Dong, Jun Yu, Jian Zhang. Position constrained network for 3D human pose estimation
469	--	485	Xiaofeng Qu, Li Liu 0031, Lei Zhu, Huaxiang Zhang. Attribute-aware style adaptation for person re-identification
487	--	498	Aihua Zhou, Yujun Ma, Wanting Ji, Ming Zong, Pei Yang, Min Wu, Mingzhe Liu. Multi-head attention-based two-stream EfficientNet for action recognition
499	--	510	Liqiang Peng, Qiang Li, Fei Wang. Context-aware and ethics-first crowd mobility portraits over massive smart card data
511	--	523	Yulin Wu, Chang Liu, Lei Chen, Dong Zhao, Qinghe Zheng, Hongchao Zhou. Perturbation consistency and mutual information regularization for semi-supervised semantic segmentation
525	--	538	Haipeng Chen 0002, Yunjie Liu, Zenan Shi. FPF-Net: feature propagation and fusion based on attention mechanism for pancreas segmentation
539	--	552	Fan Liu 0003, Junfeng Wang, Delong Chen, Chunmei Shen, Feng Xu. Asymmetric exponential loss function for crack segmentation
553	--	568	Tao Liu, Mingjun Li, Haibin Zheng, Zhaoyan Ming, Jinyin Chen. Evil vs evil: using adversarial examples to against backdoor attack in federated learning
569	--	575	Chumeng Zhang, Yue Yang, Junbo Guo, Guoqing Jin, Dan Song 0006, Anan Liu. Improving text-image cross-modal retrieval with contrastive loss
577	--	586	An-An Liu, Xiaowen Wang, Ning Xu 0003, Jing Liu, Yuting Su 0001, Quan Zhang, Shenyuan Zhang, Yejun Tang, Junbo Guo, Guoqing Jin, Xuanya Li. SMPC: boosting social media popularity prediction with caption
587	--	603	Xiao Li, Shexiang Ma, Liqing Shan, Xiao Li 0001. Multi-window Transformer parallel fusion feature pyramid network for pedestrian orientation detection
605	--	614	Yifan Jiao, Sisi You. Rescue decision via Earthquake Disaster Knowledge Graph reasoning
615	--	626	Xiaoyan Tian, Ye Jin, Xianglong Tang. Local-Global Transformer Neural Network for temporal action segmentation
627	--	640	Zupeng Ai, Chengwei Peng, Jun Jiang, Zekun Li, Bing Li 0001. Face swapping detection based on identity spatial constraints with weighted frequency division
641	--	652	Zhong Qu, Lili Wang. Gating attention convolutional networks with dense connection for pixel-level crack detection
653	--	668	Yutong Shi, Xiujuan Wang, Kangfeng Zheng, Siwei Cao. User authentication method based on keystroke dynamics and mouse dynamics using HDA
669	--	691	Mohammad Javad Parseh, Mohammad Rahmanimanesh, Parviz Keshavarzi, Zohreh Azimifar. Semantic embedding: scene image classification using scene-specific objects
693	--	724	Ping Feng, Zhenjun Tang. A survey of visual neural networks: current trends, challenges and opportunities
725	--	738	Lei Li, Fan Tang, Juan Cao, Xirong Li 0001, Danding Wang. Bias oriented unbiased data augmentation for cross-bias representation learning
739	--	751	Sree Ganesh T. N, Rishi Satish, Rajeswari Sridhar. Learning effective embedding for automated COVID-19 prediction from chest X-ray images
753	--	762	Jianhui He, Chunlong Hu, Lijuan Wang. Facial age estimation based on asymmetrical label distribution
763	--	770	Jin Che, Yuxia Zhang, Qi Yang, Yuting He. Research on person re-identification based on posture guidance and feature alignment
771	--	786	Rudrika Kalsotra, Sakshi Arora. Performance analysis of U-Net with hybrid loss for foreground detection
787	--	796	Gan Hu, Yanli Ji, Xingzhu Liang, Yuexing Han. Layer-fusion for online mutual knowledge distillation
797	--	809	Xuyang Lu, Yang Gao. Guide and interact: scene-graph based generation and control of video captions
811	--	829	Zhenhua Tang, Jiemei Yao, Qian Zhang, Yuanting Luo. Multi-operator image retargeting with visual quality preservation of salient regions
831	--	845	Wenying Wen, Yunpeng Jian, Yuming Fang, Yushu Zhang, Baolin Qiu. Authenticable medical image-sharing scheme based on embedded small shadow QR code and blockchain framework
847	--	869	Luis Rei, Dunja Mladenic, Mareike Dorozynski, Franz Rottensteiner, Thomas Schleider, Raphaël Troncy, Jorge Sebastián Lozano, Mar Gaitán Salvatella. Multimodal metadata assignment for cultural heritage artifacts
871	--	886	Cheng-Jian Qiu, Yuqing Song 0001, Zhe Liu 0004, Jing Yin, Kai Han, Yi Liu. CMFCUNet: cascaded multi-scale feature calibration UNet for pancreas segmentation

1	--	13	Menghao Hu, Mingxuan Luo, Menghua Huang, Wenhua Meng, Baochen Xiong, Xiaoshan Yang, Jitao Sang. Towards a multimodal human activity dataset for healthcare
15	--	31	Santosh Kumar Tripathy, Harsh Kostha, Rajeev Srivastava. TS-MDA: two-stream multiscale deep architecture for crowd behavior prediction
33	--	48	Zijie Yang, Lingxi Xie, Wei Zhou, Xinyue Huo, Longhui Wei, Jian Lu, Qi Tian 0001, Sheng Tang. VoxSeP: semi-positive voxels assist self-supervised 3D medical segmentation
49	--	58	Hengyou Wang, Yanfei Song, Lianzhi Huo, Linlin Chen, Qiang He. Multiscale object detection based on channel and data enhancement at construction sites
59	--	71	Weijia Liu, Jiuxin Cao, Yilin Zhu, Bo Liu 0004, Xuelin Zhu. Real-time anomaly detection on surveillance video with two-stream spatio-temporal generative model
73	--	103	R. Rashmi Adyapady, B. Annappa. A comprehensive review of facial expression recognition techniques
105	--	115	Zhexin Zhang, Jiajun Ding, Jun Yu 0002, Yiming Yuan, Jianping Fan 0001. Import vertical characteristic of rain streak for single image deraining
117	--	128	Kunhong Wu, Liang Li, Yahong Han. Weighted progressive alignment for multi-source domain adaptation
129	--	138	Zhongyue Chen, Jiangqi Chen, Guangliu Ding, He Huang. A lightweight CNN-based algorithm and implementation on embedded system for real-time face recognition
139	--	151	Zehao Lin, Jiahui She, Qiu Shen. Real emotion seeker: recalibrating annotation for facial expression recognition
153	--	166	Letian Wang, Quan Zhou, Yuling Ma, Jie Guo, Xiushan Nie, Yilong Yin. Deep regional detail-aware hashing
167	--	195	Shradha Dubey, Manish Dixit. A comprehensive survey on human pose estimation approaches
197	--	210	Shengjie Liu, Ning He, Cheng Wang, Haigang Yu, Wenjing Han. Lightweight human pose estimation algorithm based on polarized self-attention
211	--	222	Ye Li, Kangning Yin, Jie Liang, Zhuofu Tan, Xinzhong Wang, Guangqiang Yin, Zhiguo Wang. A multitask joint framework for real-time person search
223	--	234	Weidong Zhu, Jun Sun 0019, Simin Wang, Kaifeng Yang, Jifeng Shen, Xin Zhou. Segmentation and recognition of filed sweet pepper based on improved self-attention convolutional neural networks
235	--	246	Pengyi Hao, Yali Li, Cong Bai. Meta-relationship for course recommendation in MOOCs
247	--	259	Dicong Wang, Qinghua Hu, Kaijun Wu. Dual-branch network with memory for video anomaly detection
261	--	273	Zhiling Cai, Ruijia Li, Hong Wu. Learning unified anchor graph based on affinity relationships with strong consensus for multi-view spectral clustering
275	--	287	Lu Zhao, Liming Yuan, Kun Hao, Xianbin Wen. Generalized attention-based deep multi-instance learning
289	--	303	Xiang Gao, Lijuan Xu, Fan Wang, Xiaopeng Hu. Multi-branch aware module with channel shuffle pixel-wise attention for lightweight image super-resolution
305	--	321	Xikun Liang, Limin Tao, Bin Hu. Image bit planes approximate reconstruction and encryption based on Gaussian function and multiple parameters chaos
323	--	332	Hanguang Xiao, Yuewei Li, Yu Xiu, Qingling Xia. Development of outdoor swimmers detection system with small object detection method based on deep learning
333	--	346	Wen Guo, Dong Li, Bowen Liang, Bin Shan. Multi-view region proposal network predictive learning for tracking
347	--	359	Hufei Wang, Kaiqiang Zhao, Dexin Zhao. A triple fusion model for cross-modal deep hashing retrieval
361	--	375	Nesrine Tarhouni, Masmoudi Salma, Maha Charfeddine, Chokri Ben Amar. Fake COVID-19 videos detector based on frames and audio watermarking
377	--	387	Wanjun Liu, Junkai Wang, Haicheng Qu, Lei Shen. Hierarchical MVSNet with cost volume separation and fusion based on U-shape feature extraction
389	--	399	Qiming Yan, Yubao Sun, Shaojing Fan, Liling Zhao. Polarity-aware attention network for image sentiment analysis
401	--	420	Srishti Yadav, Shahram Payandeh. DATaR: Depth Augmented Target Redetection using Kernelized Correlation Filter
421	--	433	Sahar Dammak, Hazar Mliki, Emna Fendri. Gender estimation based on deep learned and handcrafted features in an uncontrolled environment
435	--	446	Yang Yang 0046, Yiwen Xiong, Yanqing Cao, Lanling Zeng, Yan Zhao, Yongzhao Zhan. Fast bilateral filter with spatial subsampling
449	--	0	Honggui Li, Dimitri Galayko. Correction to: Deep reconstruction of 1D ISOMAP representations
451	--	0	Muhammad Pervez Akhter, Jiangbin Zheng, Irfan Raza Naqvi, Mohammed Abdelmajeed, Tehseen Zia. Correction to: Abusive language detection from social media comments using conventional machine learning and deep learning approaches
453	--	0	Alok Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay. Correction to: Attention based video captioning framework for Hindi
455	--	0	Hanyun Zhang, Dongliang Guo, Wei Liu, Junlan Nie, Shuo Li. Correction to: An improved algorithm of video quality assessment by danmaku analysis

External Links

Journal: Multimedia Syst.

Volume 29, Issue 6

Volume 29, Issue 5

Volume 29, Issue 4

Volume 29, Issue 3

Volume 29, Issue 2

Volume 29, Issue 1