Abstract is missing.
- Coarse-Fine Spectral-Aware Deformable Convolution for Hyperspectral Image ReconstructionJincheng Yang, Lishun Wang, Miao Cao, Huan Wang, Yinping Zhao, Xin Yuan 0002. 1-7 [doi]
- HoloGesture: A Multimodal Dataset For Hand Gesture Recognition Robust To Hand Textures On Head-Mounted Mixed-Reality DevicesJeongwoo Park, Je Hyeong Hong. 1-7 [doi]
- A New People-Object Interaction Dataset and NVS BenchmarksShuai Guo 0002, Houqiang Zhong, Qiuwen Wang, Ziyu Chen, Yijie Gao, Jiajing Yuan, Chenyu Zhang, Rong Xie, Li Song 0001. 8-14 [doi]
- Thqa: A Perceptual Quality Assessment Database for Talking HeadsYingjie Zhou, Zicheng Zhang, Wei Sun, Xiaohong Liu 0001, Xiongkuo Min, Zhihua Wang, Xiao-Ping Zhang 0002, Guangtao Zhai. 15-21 [doi]
- VR-Based Generation of Photorealistic Synthetic Data for Training Hand-Object Tracking ModelsChengyan Zhang, Rahul Chaudhari. 22-28 [doi]
- Removing Reflective Flare in Real-World ConditionsFengbo Lan, Chang Wen Chen. 29-33 [doi]
- PVDN-Urban - A Dataset for Provident Vehicle Detection at Night in Urban ScenariosLukas Ewecker, Florian Schiffel, Robin Schwager, Tim Brühl, Tin Stribor Sohn, Thomas Villmann. 34-40 [doi]
- Towards Unifying Anatomy Segmentation: Automated Generation of a Full-Body CT DatasetAlexander Jaus, Constantin Seibold, Kelsey Hermann, Negar Shahamiri, Alexandra Walter, Kristina Giske, Johannes Haubold, Jens Kleesiek, Rainer Stiefelhagen. 41-47 [doi]
- Subjective Portrait Region Cropping On Landscape Video StudyCheng-Han Lee, Maniratnam Mandal, Neil Birkbeck, Yilin Wang 0001, Balu Adsumilli, Alan C. Bovik. 48-54 [doi]
- EarthquakeNet: A High-Resolution UAV-Based Dataset for Earthquake Damage AssessmentShenlu Jiang, Yuxin Bian, Yiran Wang, Xufeng Li, Zhankeng Liu, Yi Ren, Yunxuan Zhao. 55-61 [doi]
- Bri3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory PerceptionAniket Roy, Anirban Roy, Soma Mitra, Kuntal Ghosh. 62-68 [doi]
- Co2Wounds-V2: Extended Chronic Wounds Dataset from Leprosy PatientsKaren Sanchez, Carlos Hinojosa, Olinto Mieles, Chen Zhao 0002, Bernard Ghanem, Henry Arguello. 69-75 [doi]
- CAPTIV8: A Comprehensive Large Scale Capsule Endoscopy Dataset For Integrated DiagnosisAnuja Vats, Bilal Ahmad, Pål Anders Floor, Ahmed Kedir Mohammed, Marius Pedersen, Øistein Hovde. 76-82 [doi]
- A Real-World Satellite Video Subjective QOE DatabaseBowen Chen, Zaixi Shang, Alan C. Bovik, Jae-won Chung, David Lerner. 83-88 [doi]
- SE3D: A Framework for Saliency Method Evaluation in 3D ImagingMariusz Wisniewski, Loris Giulivi, Giacomo Boracchi. 89-95 [doi]
- Youtube SFV+HDR Quality DatasetYilin Wang 0001, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli. 96-102 [doi]
- Bmt-Bench: A Benchmark Sports Dataset For Video GenerationZiang Shi, Yang Xiao, Da Yan 0001, Min-Te Sun, Wei-Shinn Ku, Bo Hui 0001. 103-109 [doi]
- OpenAnimalTracks: A Dataset for Animal Track RecognitionRisa Shinoda, Kaede Shiohara. 110-116 [doi]
- A Toolkit to Benchmark Point Cloud Quality Metrics with Multi-Track Evaluation CriteriaAli Ak, Emin Zerman, Maurice Quach, Aladine Chetouani, Giuseppe Valenzise, Patrick Le Callet. 117-123 [doi]
- Long-Term Geo-Positioned Re-Identification Dataset of Urban ElementsPaula Moral, Álvaro García-Martín, José M. Martínez. 124-130 [doi]
- ODVISTA: An Omnidirectional Video Dataset for Super-Resolution and Quality Enhancement TasksAhmed Telili, Ibrahim Farhat, Wassim Hamidouche, Hadi Amirpour. 131-136 [doi]
- Synthmanticlidar: A Synthetic Dataset For Semantic Segmentation On Lidar ImagingJavier Montalvo, Pablo Carballeira, Álvaro García-Martín. 137-143 [doi]
- On the Cloud Detection from Backscattered Images Generated from a Lidar-Based Ceilometer: Current State and OpportunitiesAlessio Barbaro Chisari, Alessandro Ortis, Luca Guarnera, Wladimiro Carlo Patatu, Rosaria Ausilia Giandolfo, Emanuele Spampinato, Sebastiano Battiato, Mario Valerio Giuffrida. 144-150 [doi]
- SODA: A Dataset for Small Object Detection in UAV Captured ImageryDaniel Pisani, Dylan Seychell, Carl James Debono, Michael Schembri. 151-157 [doi]
- DAPlankton: Benchmark Dataset For Multi-Instrument Plankton Recognition Via Fine-Grained Domain AdaptationDaniel Batrakhanov, Tuomas Eerola, Kaisa Kraft, Lumi Haraguchi, Lasse Lensu, Sanna Suikkanen, María Teresa Camarena-Gómez, Jukka Seppälä, Heikki Kälviäinen. 158-164 [doi]
- A Dataset for Understanding Open UGC Video DatasetsPierre R. Lebreton, Patrick Le Callet, Neil Birkbeck, Yilin Wang 0001, Balu Adsumilli. 165-171 [doi]
- 3D-COCO: Extension of MS-COCO Dataset for Scene Understanding and 3D ReconstructionBideaux Maxence, Phe Alice, Mohamed Chaouch, Luvison Bertrand, Quoc-Cuong Pham. 172-178 [doi]
- MWIRSTD: A MWIR Small Target Detection DatasetNikhil Kumar, Avinash Upadhyay, Shreya Sharma, Manoj Sharma, Pravendra Singh. 179-185 [doi]
- LWIRPOSE: A Novel Long Wave Infrared Thermal Image Pose Dataset and BenchmarkAvinash Upadhyay, Bhipanshu Dhupar, Manoj Sharma, Ankit Shukla, Ajith Abraham. 186-192 [doi]
- Unicrowd Simulator: Visual and Behavioral Fidelity For The Generation of Crowd DatasetsNiccoló Bisagno, Antonio Luigi Stefani, Nicola Garau, Francesco G. B. De Natale, Nicola Conci. 193-199 [doi]
- Multi-View Multi-Focus Image Fusion: A Novel Benchmark Dataset and MethodZhilong Li, Kejun Wu, Junhao Liu 0001, Qiong Liu 0001, You Yang. 200-206 [doi]
- Paon: A New Neuron Model Using Padé ApproximantsOnur Keles, A. Murat Tekalp. 207-213 [doi]
- Non-Separablewavelet Transform Using Learnable Convolutional Lifting StepsJoao O. Parracho, Eduardo A. B. da Silva, Lucas A. Thomaz, Luis M. N. Tavora, Sérgio M. M. Faria. 214-220 [doi]
- Robustness of Tensor Decomposition-Based Neural Network CompressionThéo Rudkiewicz, Mohamed Ouerfelli, Riccardo Finotello, Zakariya Chaouai, Mohamed Tamaazousti. 221-227 [doi]
- Explaining Representation Learning With Perceptual ComponentsYavuz Yarici, Kiran Kokilepersaud, Mohit Prabhushankar, Ghassan Alregib. 228-234 [doi]
- ET: Explain to Train: Leveraging Explanations to Enhance the Training of A Multimodal TransformerMeghna P. Ayyar, Jenny Benois-Pineau, Akka Zemmari. 235-241 [doi]
- Saliency As A Schedule: Intuitive Image AttributionAniket Singh, Anoop M. Namboodiri. 242-248 [doi]
- ATAC-NET: Zoomed View Works Better for Anomaly DetectionShaurya Gupta, Neil Gautam, Anurag Malyala. 249-255 [doi]
- Rotated R-CNN: A Two-Stage Object Detection Method Adapted To Oriented Bounding BoxesChengdao Pu, Jun Yu 0001, Wen Su, Tianyu Liu. 256-262 [doi]
- Masked Momentum Contrastive Learning for Semantic Understanding by ObservationJiantao Wu, Shentong Mo, Sara Atito 0001, Zhenhua Feng 0001, Josef Kittler, Syed Sameed Husain, Muhammad Awais 0001. 263-269 [doi]
- Taxes are All You Need: Integration Of Taxonomical Hierarchy Relationships Into the Contrastive LossKiran Kokilepersaud, Yavuz Yarici, Mohit Prabhushankar, Ghassan Alregib. 270-276 [doi]
- Imbalanced Data Robust Online Continual Learning Based on Evolving Class Aware Memory Selection and Built-In Contrastive Representation LearningRui Yang, Emmanuel Dellandréa, Matthieu Grard, Liming Chen 0002. 277-283 [doi]
- Conditional Past Experience Generation for Dark Continual LearningCheng Feng, Chaoliang Zhong, Jie Wang, Jun Sun 0004, Yasuto Yokota. 284-290 [doi]
- Unsupervised Domain Adaptive Semantic Segmentation Based on Clip-Guided Prototypical Contrastive LearningKebin Liu, Chuang Zhu. 291-297 [doi]
- Instance-Aware Uncertainty for Active Learning in Object DetectionZhipeng Zhang, Wenting Ma, Xiaohang Yuan, Yuan-Hao, Meng Guo, Hongyi Tang, Zhiheng Zhou, Zhenjie Yao. 298-304 [doi]
- Spatiality-Aware Prompt Tuning for Few-Shot Small Object DetectionTakumi Karasawa, Nakamasa Inoue, Rei Kawakami. 305-311 [doi]
- Disentangled Knowledge Distillation for Unified Multi-Class Anomaly DetectionJiyong Jang, Hayeon Lee, Younkwan Lee. 312-318 [doi]
- MMAQ: A Multi-Modal Self-Supervised Approach For Estimating Air Quality From Remote Sensing DataGeorgios-Fotios Angelis, Alexandros Emvoliadis, Anastasios Drosou, Dimitrios Tzovaras. 319-325 [doi]
- Crowdassign: A Label Assignment Scheme for Pedestrian Detection in Crowded ScenesZihao Li, Ning Luo, Xiwen Zhang, Ziliang Guo, Xingqi Fang, Yu Qiao 0003. 326-331 [doi]
- MVAFormer: RGB-Based Multi-View Spatio-Temporal Action Recognition with TransformerTaiga Yamane, Satoshi Suzuki, Ryo Masumura, Shotaro Tora. 332-338 [doi]
- Prune Channel And Distill: Discriminative Knowledge Distillation For Semantic SegmentationBokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko. 339-345 [doi]
- TDAD: Trident Distillations for Anomaly DetectionWenrui Hu, Yuan Xie, Wei Yu. 346-352 [doi]
- Neural Mesh Fusion: Unsupervised 3D Planar Surface UnderstandingFarhad G. Zanjani, Hong Cai, Yinhao Zhu, Leyla Mirvakhabova, Fatih Porikli. 353-359 [doi]
- Contrast-Guided Wireframe ParsingXueyuan Chen, Baojiang Zhong. 360-366 [doi]
- Adversarial Robustness for Deep Metric LearningEzgi Paket, Inci M. Baytas. 367-373 [doi]
- Adversarial Detection Transformer For Kuzushiji RecognitionPengfeng Lu, Sei-ichiro Kamata, Mengyunqiu Zhang, Weilian Zhou. 374-380 [doi]
- Improving Automatic Target Recognition With Infrared Imagery Using Vision Transformers and Focused Data AugmentationNada Baili, Hichem Frigui. 381-387 [doi]
- Graph Convolutional Networks With Minimal Appearance Information For Action RecognitionHiroaki Tani. 388-394 [doi]
- Knowledge-Infused Learning for Fine-Grained Plant Disease RecognitionJamil Ahmad, Wail Gueaieb, Abdulmotaleb El-Saddik, Giulia De Masi, Fakhri Karray. 395-401 [doi]
- Adaptxray: Vision Transformer And Adapter In X-Ray Images For Prohibited Items DetectionYaobin Huang, Hongxia Gao, Xiaomeng Li. 402-408 [doi]
- Fusion of Independent and Interactive Features for Human-Object Interaction DetectionZehai Wu, Lijie Sheng, Songnian Zhang, Qiguang Miao. 409-415 [doi]
- Source-Free Continual Adaptive Learning With Limited Labels on Evolving Data DriftsAmrutha Machireddy, Ranganath Krishnan, Athmanarayanan Lakshmi Narayanan, Omesh Tickoo. 416-422 [doi]
- SLNL: Soft Label Regularization For Semi-Supervised Facial Expression Recognition With Negative Label LearningYouwei Zhang, Jing Jiang, Yuying Zhao, Kongming Liang. 423-429 [doi]
- Pyramid Coder: Hierarchical Code Generator for Compositional Visual Question AnsweringRuoyue Shen, Nakamasa Inoue, Koichi Shinoda. 430-436 [doi]
- Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and AccuracyStanislav Dereka, Ivan Karpukhin, Maksim Zhdanov, Sergey Kolesnikov. 437-443 [doi]
- Crocos-V1: Enhancing Mask Leakage and Bounding Box Localization for Real-Time Crop/Weed Instance SegmentationJesus Franco-Robles, Jorge E. Avilés-Mejia, Ouiddad Labbani-Igbida. 444-450 [doi]
- FedAWA: Aggregation Weight Adjustment in Federated Domain GeneralizationYiming Chen, Nan He, Lifeng Sun. 451-457 [doi]
- Memsvd: Long-Range Temporal Structure Capturing Using Incremental SVDIoanna Ntinou, Enrique Sanchez, Georgios Tzimiropoulos. 458-464 [doi]
- A Self-Supervised Diffusion Framework For Facial Emotion RecognitionSaif Hassan, Mohib Ullah, Ali Shariq Imran, Ghulam Mujtaba 0001, Muhammad Mudassar Yamin, Ehtesham Hashmi, Faouzi Alaya Cheikh, Azeddine Beghdadi. 465-471 [doi]
- Learning Orthonormal Features in Self-Supervised Learning using Functional Maximal CorrelationBo Hu, Yuheng Bu, José C. Príncipe. 472-478 [doi]
- Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware MinimizationTanapat Ratchatorn, Masayuki Tanaka. 479-485 [doi]
- Reinforcing Pre-Trained Models Using Counterfactual ImagesXiang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa 0001, Miki Haseyama. 486-492 [doi]
- Vito: Vision Transformer Optimization Via Knowledge Distillation On DecodersGiovanni Bellitto, Renato Sortino, Paolo Spadaro, Simone Palazzo, Federica Proietto Salanitri, Giuseppe Fiameni, Efstratios Gavves, Concetto Spampinato. 493-499 [doi]
- Video Class-Incremental Learning With Clip Based TransformerShuyun Lu, Jian Jiao, Lanxiao Wang, Heqian Qiu, Xingtao Lin, Hefei Mei, Hongliang Li 0001. 500-506 [doi]
- Explaining 3D Object Detection Through Shapley Value-Based Attribution MapMichihiro Kuroki, Toshihiko Yamasaki. 507-513 [doi]
- Features Disentanglement For Explainable Convolutional Neural NetworksPasquale Coscia, Angelo Genovese, Fabio Scotti, Vincenzo Piuri. 514-520 [doi]
- Embedding Attention Blocks For Answer GroundingSeyedalireza Khoshsirat, Chandra Kambhamettu. 521-527 [doi]
- Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via PrototypesBhushan Atote, Victor Sanchez. 528-534 [doi]
- Interactive Teaching For Fine-Granular Few-Shot Object Recognition Using Vision TransformersPhilip Keller, Daniel Jost, Arne Roennau, Rüdiger Dillmann. 535-541 [doi]
- Joint Image Restoration For Domain Adaptive Object Detection In Foggy Weather ConditionJing Ma, Meng Lin, Gang Zhou, Zhenhong Jia. 542-548 [doi]
- Weather-Aware Drone-View Object Detection Via Environmental Context UnderstandingHyunjun Kim, Dahye Lee, Sungjune Park, Yong Man Ro. 549-555 [doi]
- Sparse Transformer Refinement Similarity Map for Aerial TrackingXi Tao, Ke Qi, Peijia Chen, Wenhao Xu, Yutao Qi. 556-562 [doi]
- LFGN: Low-Level Feature-Guided Network For Adversarial DefenseChih-Chung Hsu, Ming-Hsuan Wu, En-Chao Liu. 563-567 [doi]
- Towards Robust Person Re-Identification Via Efficient and Generalized Adversarial TrainingHuiwang Liu, Yan Huang, Linlin Zeng, Ya Li. 568-574 [doi]
- Semantic Enhanced Few-Shot Object DetectionZheng Wang, Yingjie Gao, Qingjie Liu, Yunhong Wang 0001. 575-581 [doi]
- Segment Any Object Model (SAOM): Real-To-Simulation Fine-Tuning Strategy For Multi-Class Multi-Instance SegmentationMariia Khan, Yue Qiu 0001, Yuren Cong, Bodo Rosenhahn, Jumana Abu-Khalaf, David Suter. 582-588 [doi]
- Accelerating Cascade Classifier Training with Genetic Algorithms for Edge ML ApplicationsAbhishek Saini, Sajjad Moazeni. 589-595 [doi]
- Scene Generalized Multi-View Pedestrian Detection with Rotation-Based Augmentation and RegularizationSatoshi Suzuki, Shotaro Tora, Ryo Masumura. 596-602 [doi]
- Lercpose: Learned Ranking and Contrastive Loss for Robust Head Pose EstimationAratrik Chattopadhyay, Harshita Soni, Shuaib Ahmed. 603-609 [doi]
- ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic SegmentationErik Brorsson, Knut Åkesson, Lennart Svensson, Kristofer Bengtsson. 610-616 [doi]
- Intelligent Multi-View Test Time AugmentationEfe Ozturk, Mohit Prabhushankar, Ghassan Alregib. 617-623 [doi]
- Superpixel Mixing: A Data Augmentation Technique For Robust Deep Visual Recognition ModelsDanyang Sun, Fadi Dornaika, Vinh Truong Hoang, Nagore Barrena. 624-630 [doi]
- Diversified Task Augmentation with Redundancy Reduction for Cross-Domain Few-Shot LearningLing Yue, Lin Feng, Qiuping Shuai, Lingxiao Xu, Zihao Li. 631-637 [doi]
- Semi-Supervised 3D Object Detection With Channel Augmentation Using Transformation EquivarianceMinju Kang, Taehun Kong, Tae-Kyun Kim 0001. 638-644 [doi]
- Decoupling Domain Invariance and Variance With Tailored Prompts for Open-Set Domain AdaptationShihao Zeng, Xinghong Liu, Yi Zhou. 645-651 [doi]
- Cascading Unknown Detection With Known Classification For Open Set RecognitionDaniel Brignac, Abhijit Mahalanobis. 652-658 [doi]
- 3Dlaneformer: Rethinking Learning Views for 3D Lane DetectionKun Dong, Jian Xue, Xing Lan, Ke Lu 0002. 659-665 [doi]
- AdvART: Adversarial Art for Camouflaged Object Detection AttacksAmira Guesmi, Ioan Marius Bilasco, Muhammad Shafique 0001, Ihsen Alouani. 666-672 [doi]
- LSDM-PCB: A Lightweight Small Defect Detection Model for Printed Circuit BoardQi Zeng, Chongren Zhao, Pengfei He, Hongchao Gao. 673-679 [doi]
- Set-Nas: Sample-Efficient Training For Neural Architecture Search With Strong Predictor And Stratified SamplingYu-ming Zhang, Jun-Wei Hsieh, Yu-Hsiu Chang, Xin Li 0005, Ming-Ching Chang, Chun-Chieh Lee, Kuo-Chin Fan. 680-686 [doi]
- Contextuality Helps Representation Learning for Generalized Category DiscoveryTingzhang Luo, Mingxuan Du, Jiatao Shi, Xinxiang Chen, Bingchen Zhao, Shaoguang Huang. 687-693 [doi]
- Uimt: A Framework for Improving Unimodal Inference via Multimodal TrainingKateryna Chumachenko, Alexandros Iosifidis, Moncef Gabbouj. 694-700 [doi]
- Box-Level Class-Balanced Sampling For Active Object DetectionJingyi Liao, Xun Xu 0002, Chuan-Sheng Foo, Lile Cai. 701-707 [doi]
- Rsud20K: a Dataset for Road Scene Understanding in Autonomous DrivingHasib Zunair, Md Shakib Khan, A. Ben Hamza. 708-714 [doi]
- Multimodal-Enhanced Objectness Learner For Corner Case Detection In Autonomous DrivingLixing Xiao, Ruixiao Shi, Xiaoyang Tang, Yi Zhou. 715-721 [doi]
- Continual Road-Scene Semantic Segmentation Via Feature-Aligned Symmetric Multi-Modal NetworkFrancesco Barbato, Elena Camuffo, Simone Milani, Pietro Zanuttigh. 722-728 [doi]
- Open World Object Detection Via Cooperative Foundation Models for Driving ScenesSheng Luo, Yi Zhou. 729-735 [doi]
- Pose-Invariant Learning for Efficient Person Identification from Hyperspectral Hand ImagesKeigo Kunikata, Amane Kashino, Yota Yamamoto, Yukinobu Taniguchi, Yoko Sogabe, Ayumi Matsumoto, Masaki Kitahara, Go Irie. 736-740 [doi]
- RFNET: Refined Fusion Three-Branch RGB-D Salient Object Detection NetworkKexuan Wang, Chenhua Liu, Huiguang Wei, Li Jing, Rongfu Zhang. 741-746 [doi]
- Integrating Vision-Language Supervision for Uniform Appearance TrackingMohamad Alansari, Ahmed Abughali, Obadah Habash, Khaled Alnuaimi, Sajid Javed, Naoufel Werghi. 747-752 [doi]
- CLIFS: Clip-Driven Few-Shot Learning for Baggage Threat ClassificationAbdelfatah Hassan Ahmed, Divya Velayudhan, Mahmoud Elmezain, Muaz Al Radi, Abderrahmene Boudiaf, Taimur Hassan, Mohamed Deriche, Mohammed Bennamoun, Naoufel Werghi. 753-759 [doi]
- SMO-CLIP: Enhancing Anomalous Smoke Density Assessment Using A Hybrid LLM-VLM ApproachPengfei Li, Muaz Al Radi, Mahmoud Said Elmezain, Abdelfatah Hassan Ahmed, Abderrahmene Boudiaf, Said Boumaraf, Jorge Dias 0001, Hamad Karki, Sajid Javed, Khalid Yousef Al Awadhi, Naoufel Werghi. 760-765 [doi]
- Multi-Attribute Vision Transformers are Efficient and Robust LearnersHanan Gani, Nada Saadi, Noor Hussein, Karthik Nandakumar. 766-772 [doi]
- Meta-DM: Applications of Diffusion Models on Few-Shot LearningWentao Hu, Jiarun Liu, Jiawei Wang, Hui Tian 0003. 773-779 [doi]
- Universal Black-Box Adversarial Patch Attack with Optimized Genetic AlgorithmQun Zhao, Yuan-Gen Wang. 780-786 [doi]
- Deepfake Detection With Combined Unsupervised-Supervised Contrastive LearningJunshuai Zheng, Yichao Zhou, Xiyuan Hu, Zhenmin Tang. 787-793 [doi]
- SegGuard: Defending Scene Segmentation Against Adversarial Patch AttackThomas Gittings, Steve Schneider, John P. Collomosse. 794-800 [doi]
- Face Morphing Detection in Social Media ContentAkshay Agarwal 0001, Nalini K. Ratha. 801-806 [doi]
- Ensemble of Deep Variational Mixture Models for Unsupervised ClusteringXu Tan 0004, Junqi Chen 0001, Jiawei Yang 0001, Sylwan Rahardja, Mou Wang, Susanto Rahardja. 807-813 [doi]
- Towards Robust Visual Localization Using Multi-View Images and HD Vector MapLili Zhao 0001, Zhili Liu, Qian Yin 0002, Lei Yang 0063, Meng Guo. 814-820 [doi]
- An α-Divergence Approach To Robust Canonical Correlation AnalysisWenjing Yang, Abd-Krim Seghouane, Pavel Krupskiy. 821-827 [doi]
- VF-Net: Robustness Via Understanding Distortions and TransformationsFatemeh Amerehi, Patrick Healy. 828-834 [doi]
- Factorized Embedding Graph Matching Network For Learning Lawler's Quadratic Assignment ProblemYirui Yang, Xubin Lin, Li He, Yisheng Guan, Hong Zhang. 835-841 [doi]
- PUAD: Frustratingly Simple Method for Robust Anomaly DetectionShota Sugawara, Ryuji Imamura. 842-848 [doi]
- A Single Graph Convolution is All You Need: Efficient Grayscale Image ClassificationJacob Fein-Ashley, Sachini Wickramasinghe, Bingyi Zhang, Rajgopal Kannan, Viktor K. Prasanna. 849-855 [doi]
- Light-Weight Self-Supervised Contrastive Learning Network For Small Sample Hyperspectral Image ClassificationGan Yang, Zhaohui Wang. 856-861 [doi]
- MSSPG-AL: Few-Shot Hyperspectral Image Classification with Active Learning Updated Multi-Scale Superpixel Graph FusionLong Yu, Jun Li, Li Zhuo. 862-867 [doi]
- Graphic - Graph-Based Representation for Analyzing People's High-Level Interactions in CrowdsFrancesco Longobardi, Daniel Riccio. 868-874 [doi]
- Deep Optical Flow Learning With Deformable Large-Kernel Cross-AttentionXuezhi Xiang, Yiming Chen, Denis Ombati, Lei Zhang 0093, Xiantong Zhen. 875-879 [doi]
- Koopcon: A new approach towards smarter and less complex learningVahid Jebraeeli, Bo Jiang, Derya Cansever, Hamid Krim. 880-886 [doi]
- Medical Knowledge-Guided Semi-Supervised Bi-Ventricular SegmentationBehnam Rahmati, Shahram Shirani, Zahra Keshavarz-Motamed. 887-893 [doi]
- Latent Enhancing Autoencoder for Occluded Image ClassificationKetan Kotwal, Tanay Deshmukh, Preeti Gopal. 894-900 [doi]
- Learning With Instance-Dependent Noisy Labels By Anchor Hallucination And Hard Sample Label CorrectionPo-Hsuan Huang, Chia-Ching Lin, Chih-Fan Hsu, Ming-Ching Chang, Wei-Chao Chen. 901-907 [doi]
- Driving Through Graphs: a Bipartite Graph for Traffic Scene AnalysisAditya Humnabadkar, Arindam Sikdar, Huaizhong Zhang, Tanveer Hussain, Ardhendu Behera. 908-914 [doi]
- A Text Detector Based on the Specific Text PromptXingtao Lin, Chuanyang Gong, Lanxiao Wang, Heqian Qiu, Shengyu Tong, Hongliang Li 0001. 915-921 [doi]
- Deep Spectral Siamese Network For Heterogeneous Object Verification In Amazon Robotic WarehouseMaryam Rahnemoonfar. 922-928 [doi]
- Anomaly Unveiled: Securing Image Classification against Adversarial Patch AttacksNandish Chattopadhyay, Amira Guesmi, Muhammad Shafique 0001. 929-935 [doi]
- Similarity-Weighted IoU (sIOU): A Comprehensive Metric for Evaluating Model Performance Through Similarity-Weighted Class OverlapsUmamaheswaran Raman Kumar, Patrick Vandewalle. 936-942 [doi]
- Reading is Believing: Revisiting Language Bottleneck Models for Image ClassificationHonori Udo, Takafumi Koshinaka. 943-949 [doi]
- Norm-Integrated Softmax Loss For Deep Face RecognitionJun Chen, Yiwei Wang, Haiyan Zhang. 950-956 [doi]
- A Multi-Scale Feature Fusion Network for Chip Surface Defect DetectionHaoang Ren, Mengke Tian, Guanwen Zhang, Wei Zhou. 957-962 [doi]
- Power-Llava: Large Language and Vision Assistant for Power Transmission Line InspectionJiahao Wang, Mingxuan Li, Haichen Luo, Jinguo Zhu, Aijun Yang, Mingzhe Rong, Xiaohua Wang. 963-969 [doi]
- Enhanced Detection of Small Objects in Aerial Imagery: A High-Resolution Neural Network Approach With Amplified Feature Pyramid and Sigmoid Re-WeightingChanyeong Park, Junbo Jang, Heegwang Kim, Joonki Paik. 970-976 [doi]
- Decompl: Decompositional Learning with Attention Pooling for Group Activity Recognition from a Single Volleyball ImageBerker Demirel, Huseyin Ozkan. 977-983 [doi]
- Aerial View River Landform Video Segmentation: A Weakly Supervised Context-Aware Temporal Consistency Distillation ApproachChi Han Chen, Chieh-Ming Chen, Wen-Huang Cheng, Ching-Chun Huang. 984-990 [doi]
- Salient Guided Text Detection in E-Commerce ImagesBoon Yin Yin, Nurul Japar. 991-997 [doi]
- CenterRadarNet: Joint 3D Object Detection and Tracking Framework Using 4D FMCW RadarJen-Hao Cheng, Sheng-Yao Kuan, Hou-I Liu, Hugo Latapie, Gaowen Liu, Jenq-Neng Hwang. 998-1004 [doi]
- Exploring the Potential of Synthetic Data to Replace Real DataHyungtae Lee, Yan Zhang, Heesung Kwon, Shuvra S. Bhattacharyya. 1005-1011 [doi]
- Class-Specific Channel Attention For Few Shot LearningYi-Kuan Hsieh, Jun-Wei Hsieh, Ying-Yu Chen. 1012-1018 [doi]
- Mdbfusion: A Visible And Infrared Image Fusion Framework Capable For Motion DeblurringJun Chen, Wei Yu, Xin Tian, Jun Huang, Jiayi Ma. 1019-1025 [doi]
- Advanced Object Detection in Multibeam Forward-Looking Sonar Images Using Linear Cross-Attention TechniquesGangqi Chen, Zhaoyong Mao, Junge Shen. 1026-1031 [doi]
- Attention Enhancement With Parallel Groups for Remote Sensing Object DetectionZhigang Yang 0003, Yiming Liu, Zehao Gao, Jiayue He, Tao Chen 0002, Wei Emma Zhang. 1032-1036 [doi]
- Adaptative Context Normalization: A Boost for Deep Learning in Image ProcessingBilal Faye, Hanane Azzag, Mustapha Lebbah, Djamel Bouchaffra. 1037-1043 [doi]
- Efficient Black-Box Adversarial Attack on Deep Clustering ModelsNan Yang, Zihan Li, Zhen Long, Xiaolin Huang, Ce Zhu, Yipeng Liu 0001. 1044-1049 [doi]
- Mask-Based Invisible Backdoor Attacks on Object DetectionJeongjin Shin. 1050-1056 [doi]
- U-Tell: Unsupervised Task Expert Lifelong LearningIndu Solomon, Aye Phyu Phyu Aung, Uttam Kumar 0001, Senthilnath Jayavelu. 1057-1063 [doi]
- AAGF: An Efficient Transformer With Mix-Features For Visual Place RecognitionKuan Zhou, Zhenyu Xu, Qieshi Zhang, Jun Cheng, Ziliang Ren, Xiangyang Gao. 1064-1070 [doi]
- A Decoding Scheme With Successive Aggregation of Multi-Level Features For Light-Weight Semantic SegmentationJiwon Yoo, Jangwon Lee, Gyeonghwan Kim. 1071-1077 [doi]
- Gengmm: Generalized Gaussian-Mixture-Based Domain Adaptation Model for Semantic SegmentationNazanin Moradinasab, Hassan Jafarzadeh, Donald E. Brown. 1078-1084 [doi]
- Adversarially Robust Continual Learning with Anti-Forgetting LossKoki Mukai, Soichiro Kumano, Nicolas Michel, Ling Xiao 0001, Toshihiko Yamasaki. 1085-1091 [doi]
- Density-Guided Dense Pseudo Label Selection for Semi-Supervised Oriented Object DetectionTong Zhao, Qiang Fang, Shuohao Shi, Xin Xu 0001. 1092-1098 [doi]
- Online Anchor-Based Training For Image Classification TasksMaria Tzelepi, Vasileios Mezaris. 1099-1105 [doi]
- MAVAD: Audio-Visual Dataset and Method for Anomaly Detection in Traffic VideosBlazej Leporowski, Arian Bakhtiarnia, Nicole Bonnici, Adrian Muscat, Luca Zanella, Yiming Wang 0002, Alexandros Iosifidis. 1106-1112 [doi]
- Unleashing Fine-Coarse Curve Perception Via Trunk-Branch PerturbationYunxiang Cao, Li Chen, Yubo Wang, Zhida Feng, Xiaoming Liu. 1113-1119 [doi]
- FlexAE: A Self-Conditioned Detector To Prevent Model Overfitting For Unsupervised Video Anomaly DetectionJunqi Chen, Xu Tan 0004, Jiawei Yang 0001, Sylwan Rahardja, Susanto Rahardja. 1120-1125 [doi]
- Dynamic Activation Function Based on the Branching Process and its Application in Image ClassificationWanting Zhang, Libao Zhang. 1126-1132 [doi]
- Adaprompt: Prompt Tuning with Adaptive Neighbours for Generalized Category DiscoveryLiyana Sahir, Anwesha Banerjee, Soma Biswas. 1133-1138 [doi]
- SG-JND: Semantic-Guided Just Noticeable Distortion Predictor for Image CompressionLinhan Cao, Wei Sun, Xiongkuo Min, Jun Jia, Zicheng Zhang, Zijian Chen 0001, Yucheng Zhu, Lizhou Liu, Qiubo Chen, Jing Chen, Guangtao Zhai. 1139-1145 [doi]
- Perceptual Learned Image Compression via End-to-End JND-Based OptimizationFarhad Pakdaman, Sanaz Nami, Moncef Gabbouj. 1146-1151 [doi]
- Legit: Text Legibility For User-Generated MediaManiratnam Mandal, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik. 1152-1158 [doi]
- Comparison of Crowdsourcing And Laboratory Settings for Subjective Assessment of Video Quality and Acceptability & AnnoyanceAli Ak, Abhishek Gera, Denise Noyes, Hassene Tmar, Ioannis Katsavounidis, Patrick Le Callet. 1159-1164 [doi]
- A Fusion-Based Approach for Blind Contrast-Enhanced Image RankingWael Suliman, Mohamed Deriche 0001, Naoufel Werghi, Azeddine Beghdadi. 1165-1171 [doi]
- Simsam: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image SegmentationChanda Grover Kamra, Indra Deep Mastan, Nitin Kumar 0003, Debayan Gupta. 1172-1178 [doi]
- Robust Representation Learning With Self-Distillation For Domain GeneralizationAnkur Singh, Senthilnath Jayavelu. 1179-1185 [doi]
- An Anchor-Free Contour-Based Method For Instance SegmentationTzu-Han Huang, Wen-Jiin Tsai. 1186-1192 [doi]
- Investigating Self-Supervised Methods for Label-Efficient LearningSrinivasa Nandama, Sara Atitoa, Zhenhua Fengb, Josef Kittlerb, Muhammad Awaisa. 1193-1199 [doi]
- Masked Signal Modeling for Plastic Waste Resin ClassificationS. Ebrahimkhani, J. Zheng, A. C. Y. Ngo, N. M. Cheung. 1200-1206 [doi]
- DTSN: No-Reference Image Quality Assessment via Deformable Transformer and Semantic NetworkLong Tang, Liang Yuan, Guoquan Zheng, Zesheng Wang 0004, Guangtao Zhai. 1207-1211 [doi]
- Subjective Quality Assessment of Thermal Infrared ImagesGuanghui Yue 0001, Lixin Zhang, Jinxia Zhang, Zhaofei Xu, Shuigen Wang, Tianwei Zhou, Yuanhao Gong, Wei Zhou. 1212-1217 [doi]
- A Comparative Study of Perceptual Quality Metrics For Audio-Driven Talking Head VideosWeixia Zhang, Chengguang Zhu, Jingnan Gao, Yichao Yan, Guangtao Zhai, Xiaokang Yang 0001. 1218-1224 [doi]
- A Subjective Quality Evaluation of 3D Mesh With Dynamic Level of Detail in Virtual RealityDuc V. Nguyen 0001, Tran Thuy Hien, Truong Thu Huong. 1225-1231 [doi]
- A Benchmark of Variance of Opinion Scores in Image Quality AssessmentJianxun Lou, Xinbo Wu, Yingying Wu, Padraig Corcoran, Gualtiero Colombo 0001, Roger M. Whitaker, Hantao Liu. 1232-1238 [doi]
- AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional ImagesLiu Yang, Huiyu Duan, Long Teng, Yucheng Zhu, Xiaohong Liu, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet. 1239-1245 [doi]
- Priorformer: A UGC-VQA Method With Content and Distortion PriorsYajing Pei, Shiyu Huang 0002, Yiting Lu, Xin Li, Zhibo Chen 0001. 1246-1252 [doi]
- Assessing Video Shakiness: A Novel Data And Protocols FrameworkBorhen-Eddine Dakkar, Azeddine Beghdadi, Stefania Colonnese, Naveed Iqbal 0001, Azzedine Zerguine. 1253-1259 [doi]
- Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency For Blind Image Quality AssessmentMohammed Alsaafin, Musab Alsheikh, Saeed Anwar, Muhammad Usman. 1260-1266 [doi]
- Enhancing Perceptual Quality Assessment for 360-Degree Images Based on Adaptive Patch Labeling and Multi-Label LearningAbderrezzaq Sendjasni, Mohamed-Chaker Larabi. 1267-1273 [doi]
- SANERV: Scene-Adaptive Neural Representation for VideosHochang Rhee, Haesoo Chung, Junho Jo, Eunji Lee, Nam Ik Cho. 1274-1280 [doi]
- Estate: Expert-Guided State Text Enhancement for Zero-Shot Industrial Anomaly DetectionBingke Zhu, Hao Li, Changlin Chen, Liujie Hua, Jinqiao Wang. 1281-1287 [doi]
- Scalable Hypersphere Embedding For Semantic Metric LearningLovre Antonio Budimir, Marko Subasic, Zoran Kalafatic, Sven Loncaric. 1288-1294 [doi]
- Temporal Transformer Encoder for Video Class Incremental LearningNattapong Kurpukdee, Adrian G. Bors. 1295-1301 [doi]
- Improving Self-Supervised Vision Transformers for Visual ControlWonil Song, Kwanghoon Sohn, Dongbo Min. 1302-1308 [doi]
- Multi-Task Affinity Propagation Based Natural Image MattingRenkai Zhang, Nong Sang. 1309-1315 [doi]
- AdaViPro: Region-Based Adaptive Visual Prompt For Large-Scale Models AdaptingMengyu Yang, Ye Tian, Lanshan Zhang, Xiao Liang, Xuming Ran, Wendong Wang. 1316-1322 [doi]
- Localization of Image Splicing Under Segment Anything Model With Integrated Compression and Edge ArtifactsRuhao Zhao, Xian Zhong, Liang Liao, Wenxuan Liu, Wenxin Huang, Zheng Wang 0007. 1323-1329 [doi]
- Collaborative Intelligence For Vision Transformers: A Token Sparsity-Driven Edge-Cloud FrameworkMonikka Roslianna Busto, Shohei Enomoto, Takeharu Eda. 1330-1335 [doi]
- Dtpose: Learning Disentangled Token Representation For Effective Human Pose EstimationShiyang Ye, Yuan Fang, Hong Liu, Hu Chen 0002, Wenchao Du, Hongyu Yang. 1336-1342 [doi]
- A Context-Oriented Multi-Scale Neural Network for Fire SegmentationTony Zhang, Robert P. Dick. 1343-1349 [doi]
- VCDSet: A New Vehicle Collision Dataset In Asia Countries For Anticipating AccidentsChih-Chung Hsu, Yun-Zhong Jiang, Wei-Hao Huang. 1350-1356 [doi]
- Detectability of Defects in the Presence of Linear Nuisance Parameters and Images Signal-Dependent NoiseRémi Cogranne. 1357-1363 [doi]
- 3GCN: Sport Scoring Siamese Graph Convolution NetworkYuxi Lu, Zhuming Zhang, Shiming Lin, Dengpan Zhang, Haibin Ma, Zengchang Qin. 1364-1370 [doi]
- U-Convnext Network for Infrared Small Target DetectionJian Ma, Xiuhong Li, Yuye Zhang, Boyuan Li, Dangxuan Wu, Zhenhong Jia. 1371-1376 [doi]
- Surface Anomaly Detection With Anomalous Feature Restriction And Difference-Aware EnhancementJinhui Zhao, Hongxia Gao, Tongtong Liu 0003. 1377-1383 [doi]
- Temporal-Spatial SPDAGG Network For Skeleton-Based Human Action Recognition From Aerial PerspectivesMohamed Sanim Akremi, Najett Neji, Hedi Tabia. 1384-1390 [doi]
- A Statistical Image Realism Score For Deepfake DetectionYunzhuo Chen, Naveed Akhtar, Nur Al Hasan Haldar, Jordan Vice, Ajmal Mian. 1391-1396 [doi]
- Low-Rank Matrix and Tensor Decomposition Using Randomized Two-Sided Subspace Iteration With Application to Video ReconstructionMaboud F. Kaloorazi, Salman Ahmadi-Asl, Susanto Rahardja. 1397-1402 [doi]
- Correlation-Aware Joint Pruning-Quantization using Graph Neural NetworksMuhammad Nor Azzafri Nor-Azman, Usman Ullah Sheikh, Mohammed Sultan Mohammed, Jeevan Sirkunan, Muhammad Nadzir Marsono. 1403-1409 [doi]
- A Sparse Graph Formulation for Efficient Spectral Image SegmentationRahul Palnitkar, Jeová Farias Sales Rocha Neto. 1410-1416 [doi]
- When Self-Supervised Pre-Training Meets Single Image DenoisingHamadi Chihaoui, Paolo Favaro. 1417-1423 [doi]
- SN-NET: Semismooth Newton Driven Lightweight Network for Real-World Image DenoisingChenxiao Zhang, Xin Deng 0002, Hongpeng Sun, Jingyi Xu, Mai Xu. 1424-1430 [doi]
- Constructing an Interpretable Deep Denoiser by Unrolling Graph Laplacian RegularizerSeyed Alireza Hosseini, Tam Thuc Do, Gene Cheung, Yuichi Tanaka 0001. 1431-1437 [doi]
- Unsupervised Coordinate-Based Video DenoisingMary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Feng Luo, Nianyi Li. 1438-1444 [doi]
- B-Walk: Bernoulli Principle Guided Biased Random Walk for Curve ConnectionZhuang Sun, Li Chen, Zhida Feng, Xiaoming Liu. 1445-1451 [doi]
- Dual-Path Coupled Image Deraining Network Via Spatial-Frequency InteractionYuhong He, Aiwen Jiang, Lingfang Jiang, Long Peng, Zhifeng Wang 0006, Lu Wang. 1452-1458 [doi]
- Declouding of Satellite Images for Crop Growth Monitoring Via Unrolling of Gradient Graph Laplacian RegularizerParham Eftekhar, Gene Cheung, Tim Eadie. 1459-1465 [doi]
- Real-World Atmospheric Turbulence Correction Via Domain AdaptationXijun Wang 0003, Santiago López-Tapia, Aggelos K. Katsaggelos. 1466-1472 [doi]
- UTrCGAN: Uncertainty-Driven Cycle-Consistent Generative Adversarial Network for Low-Light Image EnhancementJingShuo Guan, Na Qi, Qing Zhu, Liang Chen. 1473-1479 [doi]
- A Spatio-Temporal Aligned SUNet Model For Low-Light Video EnhancementRuirui Lin, Nantheera Anantrasirichai, Alexandra Malyugina, David Bull 0001. 1480-1486 [doi]
- Dual Attention Enhanced Transformer for Image Defocus DeblurringYuhang He, Senmao Tian, Jian Zhang, Shunli Zhang. 1487-1493 [doi]
- A Dictionary Based Approach for Removing Out-of-Focus BlurUditangshu Aurangabadkar, Anil C. öKokaram. 1494-1499 [doi]
- Bayesian Blind Image Deconvolution using an Hyperbolic-Secant priorFrancisco M. Castro-Macías, Fernando Pérez-Bueno, Miguel Vega, Javier Mateos, Rafael Molina 0001, Aggelos K. Katsaggelos. 1500-1506 [doi]
- DCCM: Dual Data Consistency Guided Consistency Model for Inverse ProblemsJiahao Tian, Ziyang Zheng, Xinyu Peng, Yong Li, Wenrui Dai, Hongkai Xiong. 1507-1513 [doi]
- Two Heads Better Than One: Dual Degradation Representation for Blind Super-ResolutionHsuan Yuan, Shao-Yu Weng, I-Hsuan Lo, Wei-chen Chiu, Yu-Syuan Xu, Hao-Chien Hsueh, Jen-Hui Chuang, Ching-Chun Huang. 1514-1520 [doi]
- RFG-HDR: Representative Feature-Guided Transformer For Multi-Exposure High Dynamic Range ImagingKeuntek Lee, Jaehyun Park, Gu Yong Park, Nam Ik Cho. 1521-1527 [doi]
- Object-Aware Adaptive Image Retargeting Via Importance Map FusionZiyad Alswaidan, M. Hashem Shullar, Khalil Chikhaoui, Motaz Alfarraj. 1528-1533 [doi]
- Intrinsic Image Decomposition Based on Quantized Prior CodebookFangzheng Yuan, Xiaoyue Jiang, Xiaoyi Feng, Moncef Gabbouj. 1534-1539 [doi]
- Coarse-To-Fine Spatio-Temporal Luminance-Aware Reconstruction For High-Speed Motion SceneZhangke Wang, Na Qi, Xiyuan Zhao, Wei Xu, Jingzhong Qi, Qing Zhu. 1540-1546 [doi]
- Draft - Distilled Recurrent All-Pairs Field Transforms For Optical FlowYanick Christian Tchenko, Hicham Hadj-Abdelkader, Hedi Tabia. 1547-1553 [doi]
- Start-Tv: A Closed-Form Initialization For Total Variation ModelsYuanhao Gong, Guanghui Yue 0001. 1554-1559 [doi]
- A Novel Architecture for Image Vectorization with Increasing GranularityJunhao Huang, Fang Zhang, Meiliang Liu, Zhengye Si, Zhiwen Zhao. 1560-1566 [doi]
- Lightweight Recurrent Neural Network for Image Super-ResolutionMir Sazzat Hossain, AKM Mahbubur Rahman, Md. Ashraful Amin, Amin Ahsan Ali. 1567-1573 [doi]
- An Image Decomposition-Guided Network for Image InterpolationJiahuan Ji, Baojiang Zhong, Kai-Kuang Ma, Fuhui Zhou, QiHui Wu. 1574-1580 [doi]
- Streamlined Hybrid Annotation Framework Using Scalable Codestream for Bandwidth-Restricted UAV Object DetectionKarim El Khoury, Tiffanie Godelaine, Simon Delvaux, Sébastien Lugan, Benoît Macq. 1581-1587 [doi]
- Face Drawing GAN by Channel Attention and Matrix Product AttentionHideyuki Ogura, Shinya Ezumi, Masaaki Ikehara. 1588-1594 [doi]
- Reconstruct Dynamic Scene for Spike Camera Based on 3D Space Time SimilarityYuanlin Wang, Ruiqin Xiong, Jing Zhao 0011, Tiejun Huang 0001. 1595-1601 [doi]
- A Practical Calibration Method for Cameras and Multiple Line-Lasers in Light Sectioning Systems for Underwater EnvironmentsTakaki Ikeda, Takafumi Iwaguchi, Diego Thomas, Hiroshi Kawasaki. 1602-1608 [doi]
- Convolutional Neural Network With Learnable Masks For EIT Based Tactile SensingIbrar Amin, Ruiyuan Kang, Hasan Al-Marzouqi, Zeyar Aung, Panos Liatsis. 1609-1615 [doi]
- Remote Sensing Image Uneven Haze Removal Based On Haze Density Estimation and Saliency-Driven Dual Channel FusionYanmeng Liu, Libao Zhang. 1616-1622 [doi]
- A Hue-Preserving Contrast Enhancement Method Using Histogram Specification for Each RGB ComponentRyushiro Matsumoto, Mashiho Mukaida, Takanori Koga, Noriaki Suetake. 1623-1628 [doi]
- Improving Image De-Raining Using Reference-Guided TransformersZihao Ye, JaeHoon Cho, Changjae Oh. 1629-1634 [doi]
- Clouds and Haze Co-Removal Based on Weight-Tuned Overlap Refinement Diffusion Model for Remote Sensing ImagesJingxuan Zhang, Libao Zhang. 1635-1641 [doi]
- FC3DNET: A Fully Connected Encoder-Decoder for Efficient DemoiréingZhibo Du, Long Peng, Yang Wang, Yang Cao 0010, Zheng-Jun Zha. 1642-1648 [doi]
- A Cnn-Transformer Network Based Snr Guided High Frequency Reconstruction for Low Light Image EnhancementJin Zhang, Haiyan Jin, Haonan Su, Yuanlin Zhang 0003, Zhaolin Xiao, Bin Wang. 1649-1655 [doi]
- Fast Unsupervised Tensor Restoration via Low-Rank DeconvolutionDavid Reixach, Josep Ramon Morros. 1656-1662 [doi]
- Project, Skate, and Refresh: Improved Schrödinger Bridge Sampler for Image RestorationZiqiang Shi, Rujie Liu. 1663-1669 [doi]
- Counting Repetitive Actions in Event StreamYuelong Zhuo, Weiling Li, Beibei Yang, Yan Fang, Huaqiang Yuan. 1670-1675 [doi]
- E2GS: Event Enhanced Gaussian SplattingHiroyuki Deguchi, Mana Masuda, Takuya Nakabayashi, Hideo Saito. 1676-1682 [doi]
- Content-Aware Supervision For Diffusion-Based Restoration of Extremely Compressed Background For VCMLe Thi Hue Dao, An Gia Vien, Jooyoung Lee 0004, Seyoon Jeong, Naeun Yang, Chul Lee. 1683-1689 [doi]
- Semantic-Region Specific Lookup Tables for Image Enhancement Via Unpaired LearningZheng-Hui Huang, Tse-Yan Lee, Li-Jen Chang, Yong-Wei Chen, Ping-Jui Chiang, Jo-Fan Wu, Yung-Yu Chuang. 1690-1696 [doi]
- Lightweight Underwater Image Enhancement via Impulse Response of Low-Pass Filter Based Attention NetworkMay Thet Tun, Yosuke Sugiura, Tetsuya Shimamura. 1697-1703 [doi]
- Super: Selfie Undistortion and Head Pose Editing with Identity PreservationPolina Karpikova, Andrei Spiridonov, Anna Vorontsova, Anastasia Yaschenko, Ekaterina Radionova, Igor Medvedev, Alexander Limonov. 1704-1710 [doi]
- SFNet - A Spatial-Frequency Domain Neural Network For Image Lens Flare RemovalFlorin-Alexandru Vasluianu, Zongwei Wu, Radu Timofte. 1711-1717 [doi]
- Frequency-Spatial Domain Information Fusion Network for Pan-SharpeningMengjiao Zhao, Mengting Ma, Ao Gao, Wei Zhang. 1718-1724 [doi]
- Toward Efficient Deep Blind Raw Image RestorationMarcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte. 1725-1731 [doi]
- A Dual-Domain Collaboration Network for VCS ReconstructionJiahui Liu, Chunling Yang. 1732-1738 [doi]
- Computationally Efficient Kalman Filter Framework for Intra-Frame Image Reconstruction with a Rolling Shutter CameraSabeethan Kanagasingham, Andrew R. Mills, Visakan Kadirkamanathan. 1739-1745 [doi]
- Enhanced Facial Restoration with Misinformation-Filtered Guide-Denoising Diffusion Probabilistic ModelsWendi Liang, Yihan Wen, Zewei Wang, Jianuo Jiang, Tat-Ming Lok, Guanchong Niu. 1746-1752 [doi]
- Efficient Learned Wavelet Image and Video CodingAnna Meyer, Srivatsa Prativadibhayankaram, André Kaup. 1753-1759 [doi]
- Motion-Adaptive Inference for Flexible Learned B-Frame CompressionMustafa Akin Yilmaz, O. Ugur Ulas, Ahmet Bilican, A. Murat Tekalp. 1760-1766 [doi]
- Progressive Learning with Visual Prompt Tuning for Variable-Rate Image CompressionShiyu Qin, Yimin Zhou, Jin-Peng Wang, Bin Chen, Baoyi An, Tao Dai 0001, Shu-Tao Xia. 1767-1773 [doi]
- Adapting Learned Image Codecs To Screen Content Via Adjustable TransformationsH. Burak Dogaroglu, Ahmet Burakhan Koyuncu, Atanas Boev, Elena Alshina, Eckehard G. Steinbach. 1774-1780 [doi]
- End-to-End Learned Scalable Multilayer Feature Compression For Machine Vision TasksQiaoxi Chen, Changsheng Gao, Dong Liu. 1781-1787 [doi]
- Learned Image Compression for Both Humans and Machines via Dynamic AdaptationLingyu Zhu 0006, Binzhe Li, Riyu Lu, Peilin Chen 0001, Qi Mao 0002, Zhao Wang 0004, Wenhan Yang, Shiqi Wang 0001. 1788-1794 [doi]
- Saliency-Aware End-to-End Learned Variable-Bitrate 360-Degree Image CompressionOguzhan Güngördü, A. Murat Tekalp. 1795-1801 [doi]
- Gabic: Graph-Based Attention Block for Image CompressionGabriele Spadaro, Alberto Presta, Enzo Tartaglione, Jhony H. Giraldo, Marco Grangetto, Attilio Fiandrotti. 1802-1808 [doi]
- Learned Image Compression With Text Quality EnhancementChih-Yu Lai, Dung N. Tran, Kazuhito Koishida. 1809-1815 [doi]
- Neural Radiance Field-Assisted Static-Scene Video CodingRunyu Yang, Dong Liu, Feng Wu, Wen Gao. 1816-1822 [doi]
- Fast Constant-Quality Video Encoding Using VVENC With Rate Capping Based On Pre-Analysis StatisticsChristian R. Helmrich, Valeri George, Vignesh V. Menon, Adam Wieckowski, Benjamin Bross, Detlev Marpe. 1823-1828 [doi]
- Convex-Hull Estimation using Xpsnr for Versatile Video CodingVignesh V. Menon, Christian R. Helmrich, Adam Wieckowski, Benjamin Bross, Detlev Marpe. 1829-1835 [doi]
- IN-Loop Filter for Object Mask Coding in Versatile Video CodingSebastian Schwarz, Miska M. Hannuksela, Döne Bugdayci Sansli. 1836-1842 [doi]
- Extended Multiple Cross-Component Linear Models With Adaptive Thresholding and Overlapped Averaging Beyond VVCHaruhisa Kato, Yoshitaka Kidani, Kei Kawamura. 1843-1849 [doi]
- Subblock-Based Combined Inter and Intra Prediction Beyond VVCLei Zhao, Kai Zhang, Li Zhang. 1850-1856 [doi]
- ON Annotation-Free Optimization of Video Coding for MachinesMarc Windsheimer, Fabian Brand, André Kaup. 1857-1863 [doi]
- MFLFC: Multi-Frame Fusion Based Low-Resolution Feature Compression For Object TrackingYi Peng, Zixiang Zhang, Li Yu 0003. 1864-1869 [doi]
- Hybrid Single Input and Multiple Output Method For Compressing Features Towards Machine Vision TasksZifu Zhang, Shengxi Li, Tie Liu, Mai Xu, Tao Xu, Zhenyu Guan, Zhuoyi Lv. 1870-1876 [doi]
- Competitive Learning For Achieving Content-Specific Filters In Video Coding For MachinesHonglei Zhang, Jukka I. Ahonen, Nam Le 0003, Ruiying Yang, Francesco Cricri. 1877-1882 [doi]
- Image Coding For Machine Via Analytics-Driven Appearance Redundancy ReductionXuelin Shen, Haoqiao Ou, Wenhan Yang. 1883-1889 [doi]
- Lidar Depth Map Guided Image Compression ModelAlessandro Gnutti, Stefano Della Fiore, Mattia Savardi, Yi-Hsin Chen, Riccardo Leonardi, Wen-Hsiao Peng. 1890-1896 [doi]
- Fast Template Matching-Based Reference Picture Padding for Video CodingNicolas Neumann, Priyanka Das 0005, Tim Classen, Mathias Wien. 1897-1902 [doi]
- Fast Coding Mode Prediction for Intra Prediction in VVC SCCDayong Wang, Junyi Yu, Xin Lu 0001, Frédéric Dufaux, Hongwei Guo 0001, Hui Guo, Ce Zhu. 1903-1909 [doi]
- Sample Domain Prediction and Transform Skip for Region Adaptive Hierarchical Transform in Geometric Point Cloud CompressionBharath Vishwanath, Wenyi Wang, Yingzhan Xu, Kai Zhang, Li Zhang. 1910-1915 [doi]
- Real-Time Semantic Video Communication of General ScenesCem Eteke, Alexander Griessel, Wolfgang Kellerer, Eckehard G. Steinbach. 1916-1921 [doi]
- Standard Compliant Video Coding Using Low Complexity, Switchable Neural WrappersYueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, Yao Wang 0001. 1922-1928 [doi]
- Picture Partitioning Design of Neural Network-Based Intra Coding For Video Coding For MachinesKeiichi Chono, Naoya Niwa, Hiroe Iwasaki. 1929-1934 [doi]
- Feature Enhanced Learning Image Compression With Recurrent Criss-Cross AttentionXue Wu, Tong Tang, Zhiyuan Zhu, Hong Zou. 1935-1939 [doi]
- Learned Image Compression Using A Long and Short Attention ModuleZenghui Duan, Cheolkon Jung, Yang Liu, Ming Li. 1940-1946 [doi]
- Generalized Nested Latent Variable Models For Lossy Coding Applied To Wind Turbine ScenariosRaül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo. 1947-1953 [doi]
- Rate-Complexity Optimization in Lossless Neural-Based Image CompressionLucas S. Lopes, Ricardo L. de Queiroz, Philip A. Chou. 1954-1959 [doi]
- Omra: Online Motion Resolution Adaptation To Remedy Domain Shift in Learned Hierarchical B-Frame CodingZong-Lin Gao, Sang NguyenQuang, Wen-Hsiao Peng, Xiem HoangVan. 1960-1966 [doi]
- ROI-DVC: A Region-of-Interest Based Deep Video Coding FrameworkXiaojie Wu, Ping Wang, Xinhong Wang. 1967-1972 [doi]
- Redefining Visual Quality: The Impact of Loss Functions on INR-Based Image CompressionLorenzo Catania, Dario Allegra. 1973-1979 [doi]
- Talking-Head Video Compression With Motion Semantic Enhancement ModelHaobo Lei, Zhisong Bie, Zhao Jing, Hongxia Bie. 1980-1986 [doi]
- Rage for the Machine: Image Compression with Low-Cost Random Access for Embedded ApplicationsChristian D. Rask, Daniel E. Lucani. 1987-1993 [doi]
- Bi-Predictive Intra Block Copy for Enhanced Video Coding Beyond VVCYoshitaka Kidani, Haruhisa Kato, Kei Kawamura. 1994-2000 [doi]
- MSD-CRFS: Multi-Scale Dual Aggregation Conditional Random Fields for Monocular Depth EstimationXidan Zhang, Jianing Wei, Atsunori Moteki, Yoshie Kobayashi, Genta Suzuki, Zhiming Tan. 2001-2007 [doi]
- Scene Text Recognition Using Progressive Rectification Network And Spelling Error Correction Language ModelMing-Zheng Peng, Hao-Chung Cheng, Phuong Thi Le, Cheng-Chun Wang, Chien-Yao Wang, Jia-Ching Wang. 2008-2014 [doi]
- Real-Time Video Prediction With Fast Video Interpolation Model and Prediction TrainingShota Hirose, Kazuki Kotoyori, Kasidis Arunruangsirilert, Fangzheng Lin, Heming Sun, Jiro Katto. 2015-2021 [doi]
- Some Can Be Better than All: Multimodal Star Transformer for Visual DialogQiangqiang He, Jie Zhang, Shuwei Qian, Chongjun Wang. 2022-2026 [doi]
- Fast Inter Mode Decision with Resolution Sampling For VVC 360-Degree Video CodingYifan Qiang, Naian Liu. 2027-2033 [doi]
- Streaming Neural ImagesMarcos V. Conde, Andy Bigos, Radu Timofte. 2034-2040 [doi]
- Cross-Modal Alignment of Local and Global Features for Zero-Shot Chinese Character RecognitionHongyi Cai, Anna Zhu. 2041-2047 [doi]
- Lrdif: Diffusion Models For Under-Display Camera Emotion RecognitionZhifeng Wang 0004, Kaihao Zhang, Ramesh S. Sankaranarayana. 2048-2054 [doi]
- Hdplifter: Hierarchical Dynamics Perception For 2D-to-3D Human Pose LiftingYe Lu, Jianjun Gao 0005, Chen Cai, Ruoyu Wang, Duc Tri Phan, Kim-Hui Yap. 2055-2061 [doi]
- TCA-NET: Triplet Concatenated-Attentional Network for Multimodal Engagement EstimationHongyuan He, Daming Wang, Md. Rakibul Hasan, Tom Gedeon, Md. Zakir Hossain. 2062-2068 [doi]
- Estimating Indoor Scene Depth Maps From Ultrasonic EchoesJunpei Honma, Akisato Kimura, Go Irie. 2069-2073 [doi]
- A Hard Convex-Shape Constraint In Dnns For Object SegmentationJimut B. Pal, Suyash P. Awate. 2074-2080 [doi]
- SovaSeg-Net: Scale Invariant Ovarian Tumors Segmentation from Ultrasound ImagesHuu-Phong Luong, Hoang-Son Bui, Nam-Khanh Nguyen, Thi-Loan Pham, Gia-Minh Pham, Sy-Hoang Tran, Thanh-Hai Tran 0001, Thi-Lan Le. 2081-2087 [doi]
- Deep Convolutional Neural Network Prediction For Glaucoma Detection Using OCT and OCT-Angiography Disc-and Macula-Centered Images and Their Combined PowerGouverneur François, Pourjavan Sayeh, Macq Benoit. 2088-2094 [doi]
- Blend & Predict: Domain-Adaptable Few-Shot Learning for Microscopy ImagingAyush Somani, Anshul Gupta, Arif Ahmed Sekh, Krishna Agarwal, Dilip K. Prasad. 2095-2100 [doi]
- Quadruple-Consistency Vision Transformer for Medical Image Segmentation with Limited Number of Sparse AnnotationsYufan Liu, Ziyang Wang, Tianxiang Chen, Zi Ye. 2101-2107 [doi]
- Semi-Supervised Graphical Deep Dictionary Learning for Hyperspectral Image Classification From Limited SamplesAnurag Goel, Angshul Majumdar. 2108-2114 [doi]
- Fast Edge-Aware Occlusion Detection In The Context of Multispectral Camera ArraysFrank Sippel, Jürgen Seiler, André Kaup. 2115-2120 [doi]
- Conditional Optimal Filter Selection For Multispectral Object ClassificationKatja Kossira, David Schön, Jürgen Seiler, André Kaup. 2121-2127 [doi]
- Cross-Fusion of Band-Specific Spectral Features For Multi-Band NIR ColorizationGyeong-Eun Youm, Tae Sung Park, Jong-Ok Kim. 2128-2134 [doi]
- ClearDepth: Addressing Depth Distortions Caused By Eyelashes For Accurate Geometric Gaze Estimation On Mobile DevicesJamie Koerner, Vivienne Sze. 2135-2141 [doi]
- Efficient Visual Question Answering on Embedded Devices: Cross-Modality Attention With Evolutionary QuantizationAakansha Mishra, Aditya Agarwala, Utsav Tiwari, Vikram Nelvoy Rajendiran, Srinivas Soumitri Miriyala. 2142-2148 [doi]
- Early Prediction Of The Transferability Of Bovine Embryos From VideomicroscopyYasmine Hachani, Patrick Bouthemy, Elisa Fromont, Sylvie Ruffini, Ludivine Laffont, Alline de Paula Reis. 2149-2155 [doi]
- Metaheuristic Camera Calibration for Optical Tomographic Imaging in Industrial EnvironmentsAndreas Unterberger, Cheau Tyan Foo, Zachary Adrian Emuang, Fabio J. W. A. Martins, Khadijeh Mohri. 2163-2169 [doi]
- Temporal Regularization for Robust Motion Compensation in Reduced Dose Cardiac-Gated Spect ImagesXirang Zhang, Yongyi Yang, Jovan G. Brankov, P. Hendrik Pretorius, Michael A. King. 2170-2174 [doi]
- Self-Supervised Anomaly Detection and a New Benchmark for X-Ray Cargo ImagesBipin Gaikwad, Abani Patra, Carl R. Crawford, Eric L. Miller 0001. 2175-2181 [doi]
- Cross-Action Cross-Subject Skeleton Action Recognition Via Simultaneous Action-Subject Learning With Two-Step Feature RemovalYu Mitsuzumi, Akisato Kimura, Go Irie, Atsushi Nakazawa. 2182-2186 [doi]
- Rethinking Temporal Self-Similarity For Repetitive Action CountingYanan Luo, Jinhui Yi, Yazan Abu Farha, Moritz Wolter, Juergen Gall. 2187-2193 [doi]
- Subgroups For Detection TransformerTharsan Senthivel, Ngoc-Son Vu. 2194-2200 [doi]
- Caseg: Clip-Based Action Segmentation With Learnable Text PromptSuyuan Huang, Haoxin Zhang, Yanyu Xu, Yan Gao 0017, Yao Hu 0002, Zengchang Qin. 2201-2207 [doi]
- Prompt Performance Prediction For Image GenerationNicolas Bizzozzero, Ihab Bendidi, Olivier Risser-Maroix. 2208-2214 [doi]
- FAWN: Floor-and-Walls Normal Regularization for Direct Neural TSDF ReconstructionAnna Sokolova, Anna Vorontsova, Bulat Gabdullin, Alexander Limonov. 2215-2221 [doi]
- Adaptrack: Adaptive Thresholding-Based Matching for Multi-Object TrackingKyuJin Shim, Kangwook Ko, Jubi Hwang, Changick Kim. 2222-2228 [doi]
- Camouflaged Object Detection Via Style Transfer-Based Data AugmentationDongni Lu, Jiaxuan Chen 0006, Haiyan Chen 0001, Ziyi Peng, Rong-Quan, Jie Qin. 2229-2235 [doi]
- 2-Net: Continual Cross-Modal Mapping Network For Driver Action RecognitionRuoyu Wang, Chen Cai, Wenqian Wang, Jianjun Gao, Dan Lin, Wenyang Liu, Kim-Hui Yap. 2236-2242 [doi]
- Medea: Multi-View Efficient Depth AdjustmentMikhail Artemyev, Anna Vorontsova, Anna Sokolova, Alexander Limonov. 2243-2249 [doi]
- Compression-Aware Tuning for Compressing Volumetric Radiance FieldsLuyang Tang, Yongqi Zhai, Ronggang Wang. 2250-2256 [doi]
- Personatalk: Preserving Personalized Dynamic Speech Style In Talking Face GenerationQianxi Lu, Yi He, Shilin Wang. 2257-2263 [doi]
- Toward Low Artifact Virtual Try-On Via Pre-Warping Partitioned Clothing AlignmentWei-Chian Liang, Chieh-Yun Chen, Hong-Han Shuai. 2264-2270 [doi]
- Shadow-Aware Makeup Transfer with Lighting AdaptationHao-Yun Chang, Wen-Jiin Tsai. 2271-2277 [doi]
- Controllable Unsupervised Event-Based Video GenerationYaping Zhao, Pei Zhang, Chutian Wang, Edmund Y. Lam. 2278-2284 [doi]
- Hyperspectral Image Classification With Fuzzy Spatial-Spectral Class Discriminate InformationMuhammad Ahmad, Muhammad Usama, Salvatore Distefano, Manuel Mazzara. 2285-2291 [doi]
- Efficient Semantic Segmentation For Aerial Imagery Using Query Points and Superpixel SupervisionSantiago Rivier, Carlos Hinojosa, Silvio Giancola, Bernard Ghanem. 2292-2298 [doi]
- YOLO-Feder Fusionnet: A Novel Deep Learning Architecture for Drone DetectionTamara R. Lenhard, Andreas Weinmann, Stefan Jäger 0005, Tobias Koch 0004. 2299-2305 [doi]
- MSGAT: Multi-Stage Graph Attention Network For Human Motion PredictionZiyang Zheng, Ziliang Ren, Zhanhao Liang, Gulin Wang, Qieshi Zhang. 2306-2312 [doi]
- Footbots: A Transformer-Based Architecture for Motion Prediction in SoccerGuillem Capellera, Luis Ferraz, Antonio Rubio 0001, Antonio Agudo, Francesc Moreno-Noguer. 2313-2319 [doi]
- Agent-Guided Gaze Estimation Network by Two-Eye Asymmetry ExplorationYichen Shi, Feifei Zhang, Wenming Yang, Guijin Wang, Nan Su. 2320-2326 [doi]
- Anomaly Detection for the Identification of Volcanic Unrest in Satellite ImageryRobert Gabriel Popescu, Nantheera Anantrasirichai, Juliet Biggs. 2327-2333 [doi]
- Semantic-Enhanced Point-Box Joint Prompting for Video Object SegmentationQuan Zhao, Siying Wu, Yueyi Zhang, Xiaoyan Sun 0001. 2334-2340 [doi]
- Two-Stage Tripletnet: Light Weight Remote Sensing Scene ClassificationXianbin Hu, Wei Wu 0019, Zhu Li 0001, Xueliang Luo, Zhengfeng Chen. 2341-2346 [doi]
- Semi-Supervised Action Recognition From Newborn Resuscitation VideosSyed Tahir Hussain Rizvi, Øyvind Meinich-Bache, Vilde Kolstad, Siren Rettedal, Sara Brunner, Kjersti Engan. 2347-2353 [doi]
- Gradtrans: Transformer-Based Gradient Guidance for Image GenerationYiwei Chen, Jiaqian Yu, Siyang Pan, Sangil Jung, Wu Bi, Seung In Park, Qiang Wang 0023, ByungIn Yoo. 2354-2360 [doi]
- Transformer-Based Clipped Contrastive Quantization Learning For Unsupervised Image RetrievalAyush Dubey, Shiv Ram Dubey, Satish Kumar Singh, Wei-Ta Chu. 2361-2367 [doi]
- Improving Real-Time Near-Infrared Face Alignment With a Paired VIS-NIR Dataset and Data Augmentation Through Image-to-Image TranslationLangning Miao, Ryo Kakimoto, Kaoru Ohishi, Yoshihiro Watanabe. 2368-2374 [doi]
- Lipface: Lipschitz-Conditioned For Resolution Robust Face RecognitionYu-Wei Chen, Huu-Phu Do, Chia-Wei Kuo, Hsuan-Tung Liu, Ching-Chun Huang. 2375-2381 [doi]
- Gumbel-NeRF: Representing Unseen Objects as Part-Compositional Neural Radiance FieldsYusuke Sekikawa, Chingwei Hsu, Satoshi Ikehata, Rei Kawakami, Ikuro Sato. 2382-2388 [doi]
- SKETCH2MANGA: Shaded Manga Screening from Sketch with Diffusion ModelsJian Lin, Xueting Liu 0001, Chengze Li, Minshan Xie, Tien-Tsin Wong. 2389-2395 [doi]
- ACML: Attention-Based Cross-Modality Learning For Cloth-Changing and Occluded Person Re-IdentificationVuong D. Nguyen, Pranav Mantini, Shishir K. Shah. 2396-2402 [doi]
- SFD: Similar Frame Dataset for Content-Based Video RetrievalChaowei Han, Gaofeng Meng, Chunlei Huo. 2403-2409 [doi]
- Gaitgs: Temporal Feature Learning in Granularity And Span Dimension for Gait RecognitionHaijun Xiong, Yunze Deng, Bin Feng 0001, Xinggang Wang, Wenyu Liu 0001. 2410-2416 [doi]
- Adaptively Hierarchical Quantization Variational Autoencoder Based on Feature Decoupling and Semantic Consistency for Image GenerationYing Zhang, Hyunhee Park, Hanchao Jia, Fan Wang, Jianxing Zhang, Xiangyu Kong. 2417-2423 [doi]
- Licaf: Lidar-Camera Asymmetric Fusion For Gait RecognitionYunze Deng, Haijun Xiong, Bin Feng 0001. 2424-2430 [doi]
- Zero-Shot Composed Image Retrieval Considering Query-Target Relationship Leveraging Masked Image-Text PairsHuaying Zhang, Rintaro Yanagi, Ren Togo, Takahiro Ogawa 0001, Miki Haseyama. 2431-2437 [doi]
- Thermal Videodiff (TVD): A Diffusion Architecture For Thermal Video SynthesisTayeba Qazi, Brejesh Lall. 2438-2444 [doi]
- One-Shot Multi-Rate Pruning Of Graph Convolutional Networks For Skeleton-Based RecognitionHichem Sahbi. 2445-2451 [doi]
- Spatio-Temporal Adaptation With Dilated Neighbourhood Attention For Accident AnticipationPatrik Patera, Yie-Tarng Chen, Wen-Hsien Fang. 2452-2458 [doi]
- MTA-PS: Towards Practical Person Search in VideosTiancheng Ying, Rong-Quan, Peng Zheng 0004, Yichao Yan, Jie Qin. 2459-2465 [doi]
- Micro-Expression Recognition Based On 3DCNN Combined With GRU and New Attention MechanismChun-Ting Fang, Tsung-Jung Liu, Kuan-Hsien Liu. 2466-2472 [doi]
- Learning Temporal Cues for Fine-Grained Action RecognitionZhihao Liu, Yi Zhang, Wenhui Huang, Yan Liu, Mengyang Pu, Chao Deng, Junlan Feng. 2473-2479 [doi]
- Extending Segment Anything Model into Auditory and Temporal Dimensions for Audio-Visual SegmentationJuhyeong Seon, Woobin Im, Sebin Lee, Jumin Lee, Sung-Eui Yoon. 2480-2486 [doi]
- Stay Focus on Object: Cross-Domain Detection Using Domain-Invariant Object RepresentationTaehoon Kim, Jaemin Na, Joong-Won Hwang, Wonjun Hwang. 2487-2493 [doi]
- Open-Vocabulary Panoptic Segmentation Using Bert Pre-Training of Vision-Language Multiway Transformer ModelYi-Chia Chen, Wei-Hua Li, Chu-Song Chen. 2494-2500 [doi]
- Gabor Feature Network for Transformer-Based Building Change Detection Model in Remote SensingPriscilla Indira Osa, Josiane Zerubia, Zoltan Kato. 2501-2507 [doi]
- Leveraging Generated Image Captions for Visual Commonsense ReasoningSubham Das, C. Chandra Sekhar. 2508-2514 [doi]
- Motion-Lie Transformer: Geometric Attention For 3D Human Pose Motion PredictionMayssa Zaier, Hazem Wannous, Hassen Drira, Jacques Boonaert. 2515-2521 [doi]
- PCA-UNET for Object SegmentationCheng Long 0002, Sayantika Nag, Adrian Barbu. 2522-2528 [doi]
- Exploring Attention Mechanisms in Integration of Multi-Modal Information for Sign Language Recognition and TranslationZaber Ibn Abdul Hakim, Rasman Mubtasim Swargo, Muhammad Abdullah Adnan. 2529-2535 [doi]
- Spatial-Channel Collaborated Attention for Cross-Scale Crowd CountingYongpeng Chang, Guangchun Gao. 2536-2542 [doi]
- Referring Image Segmentation with Two-Stage Multi-Modal InteractionZhenhua Wang, Linwei Ye. 2543-2549 [doi]
- Distinctive Image Captioning: Leveraging Ground Truth Captions in Clip Guided Reinforcement LearningAntoine Chaffin, Ewa Kijak, Vincent Claveau. 2550-2556 [doi]
- Statistics-Aware Audio-Visual Deepfake DetectorMarcella Astrid, Enjie Ghorbel, Djamila Aouada. 2557-2563 [doi]
- Cross-Domain Few-Shot In-Context Learning For Enhancing Traffic Sign RecognitionYaozong Gan, Guang Li 0008, Ren Togo, Keisuke Maeda, Takahiro Ogawa 0001, Miki Haseyama. 2564-2570 [doi]
- Edge-Reserved Knowledge Distillation for Image MattingJiasheng Wang, Zhenhua Wang 0003, Jifeng Ning. 2571-2577 [doi]
- Learning A Rain-Invariant Network For Instance Segmentation In The RainZhiwen Chen, Wei Wu 0019, Zhengfeng Chen. 2578-2584 [doi]
- Rethinking Domain Adaptation and Generalization in the ERA Of ClipRuoyu Feng, Tao Yu, Xin Jin, Xiaoyuan Yu, Lei Xiao, Zhibo Chen 0001. 2585-2591 [doi]
- Fanet: Feature Amplification Network for Semantic Segmentation in Cluttered BackgroundMuhammad Ali, Mamoona Javaid, Mubashir Noman, Mustansar Fiaz, Salman Khan 0001. 2592-2598 [doi]
- Towards Generalizable Referring Image Segmentation Via Target Prompt And Visual CoherenceYajie Liu, Pu Ge, Haoxiang Ma, Shichao Fan, Qingjie Liu, Di Huang 0001, Yunhong Wang 0001. 2599-2605 [doi]
- Exploring the Potential of Recurrence Quantification Analysis for Video Analysis and Motion DetectionT. Kyprianidi, E. Doutsi, George Tzagkarakis, Panagiotis Tsakalides. 2606-2612 [doi]
- MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLOShubhabrata Mukherjee, Cory C. Beard, Zhu Li. 2613-2619 [doi]
- Multimodal Transformer Using Cross-Channel Attention For Object Detection In Remote Sensing ImagesBissmella Bahaduri, Zuheng Ming, Fangchen Feng, Anissa Mokraoui. 2620-2626 [doi]
- Bidfuse: Harnessing Bi-Directional Attention with Modality-Specific Encoders for Infrared-Visible Image FusionWangzhi Xing, Diqi Chen, Mohammad Aminul Islam, Jun Zhou 0001. 2627-2633 [doi]
- Illumination-Enhanced Infrared and Low-Light Visible Image FusionGuohua Lv, Xinyue Fu, Chaoqun Sima, Yanlong Xu, Baodong Zhang, Hanju Bao. 2634-2640 [doi]
- Object Detection Framework Using Multiple Tone Mappings on High-Dynamic-Range ImagesTakumi Watanabe, Rei Kawakami, Masayuki Tanaka 0001, Masatoshi Okutomi. 2641-2647 [doi]
- Deep Fusion of Visible and Near Infrared Images for Registration and Defogging Using Cross Modal TransformerMengyao Ji, Cheolkon Jung. 2648-2654 [doi]
- Rafmnet: Reinforced Attention Fusion and Multiscale Network For Noisy Infrared and Visible Image FusionGuohua Lv, Xiyan Wang, Yongbiao Gao, Yi Zhai, Guixin Zhao, Guangxiao Ma. 2655-2661 [doi]
- Feature Decomposition Transformers for Infrared and Visible Image FusionGahyeon Kim, An Gia Vien, Duong Hai Nguyen, Chul Lee. 2662-2668 [doi]
- Land Use Classification Via Multi-Modal Complementary Feature Fusion and Context Information Enhancement For Optical and Sar ImagesXinyue Fan, Libao Zhang. 2669-2675 [doi]
- Investigating and Reducing the Impairment of Point Spread Effect For Spatiotemporal Fusion Of Remote Sensing ImageryYunfei Li, Jun Li. 2676-2682 [doi]
- Evaluating 3D Human Pose Estimation in Occluded Multi-Sensor Scenarios: Dataset and Annotation ApproachKévin Riou, Kaiwen Dong, Yujie Huang, Kévin Subrin, Patrick Le Callet, Yanjing Sun. 2683-2689 [doi]
- A Preconditioning Approach To Optimizing Sensing Matrix For Improved Compressed Sensing CT ReconstructionPrasad Theeda, Chee-Ming Ting, Arghya Pal, Hernando Ombao. 2690-2695 [doi]
- 3F-PNP: Compressive Sensing Using Nonlocal Self-Similarity and Deep Learning PriorsKaren O. Egiazarian, Vladimir Katkovnik. 2696-2701 [doi]
- Hierarchical Vertex-Wise Intensification Graph Convolution for Skeleton-Based Activity RecognitionYun Li, Hao Xie, Jun Xiao, Cong Zhang, Tianshan Liu, Kin-Man Lam 0001. 2702-2708 [doi]
- Fourier Ptychography With Information Entropy Based No-Reference Image Quality Assessment LearningQijun Yang, Hujun Yin. 2709-2715 [doi]
- All Skeletons are Created Equal! A Domain Adaptation Transformer to Handle Multiple TopologiesGiulia Martinelli, Nicola Garau, Niccolò Bisagno, Nicola Conci. 2716-2722 [doi]
- Spatial Plaid Attention Decoder for Semantic SegmentationAbolfazl Meyarian, Xiaohui Yuan 0001, Zhinan Qiao. 2723-2729 [doi]
- Histohdr-Net: Histogram Equalization for Single LDR to HDR Image TranslationHrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Abhinav Dhall, Kalin Stefanov. 2730-2736 [doi]
- Pixel-Wise Color Constancy Via Smoothness Techniques In Multi-Illuminant ScenesUmut Cem Entok, Firas Laakom, Farhad Pakdaman, Moncef Gabbouj. 2737-2743 [doi]
- MGRQ: Post-Training Quantization For Vision Transformer With Mixed Granularity ReconstructionLianwei Yang, Zhikai Li, Junrui Xiao, Haisong Gong, Qingyi Gu. 2744-2750 [doi]
- Efficient Circular and Confocal Non-Line-Of-Sight Imaging With Transient Sinogram Super ResolutionDixin Yang, Mariko Isogawa. 2751-2757 [doi]
- Simple Image Signal Processing using Global Context GuidanceOmar Elezabi, Marcos V. Conde, Radu Timofte. 2758-2764 [doi]
- Generate DSLR-Like Image With Global Information and Prior Guided ISPLu Xu, Chao Zhang, Yasi Wang, Qiang Wang. 2765-2771 [doi]
- Clip-Based Composition-Aware Image CroppingShuo Zhang, Xinyu Yang, Xiwen Bai, Yu Li. 2772-2778 [doi]
- Multi-Path Interference Mitigation For Indirect Time-of-Flight Camera By the Distortion of Coding CurveWenbin Luo, Takafumi Iwaguchi, Ryusuke Sagawa, Hiroshi Kawasaki. 2779-2785 [doi]
- E2SIFT: Neuromorphic SIFT via Direct Feature Pyramid Recovery from EventsChris Henry, Paras Maharjan, Zhu Li 0001, George York. 2786-2792 [doi]
- VAG: Voxel Attenuation Grid For Sparse-View CBCT ReconstructionJinhao Qiao, Jiang Liu, Heng Yu, Yi Xiao, Hongshan Yu, Yan Zheng, Sihan Li. 2793-2799 [doi]
- Dynamic MRI Reconstruction Using Low-Rank Plus Sparse Decomposition With Smoothness RegularizationChee-Ming Ting, Fuad Noman, Raphaël C.-W. Phan, Hernando Ombao. 2800-2806 [doi]
- LMBF-Net: A Lightweight Multipath Bidirectional Focal Attention Network For Multifeatures SegmentationTariq M. Khan, Shahzaib Iqbal, Syed Saud Naqvi, Imran Razzak, Erik Meijering. 2807-2813 [doi]
- Unrolled Projected Gradient Algorithm For Stain Separation In Digital Histopathological ImagesAymen Sadraoui, Astrid Laurent-Bellue, Mounir Kaaniche, Amel Benazza-Benyahia, Catherine Guettier, Jean-Christophe Pesquet. 2814-2819 [doi]
- Deep Regularization For Scale-Agnostic Superresolution of MR ImagesK. Pavan Kumar Reddy, Kunal N. Chaudhury. 2820-2826 [doi]
- A 1D Plug-and-Play Synthetic Data Deep Learning For Undersampled Magnetic Resonance Image ReconstructionMin Xiao, Zi Wang 0005, Jiefeng Guo, Di Guo, Xiaobo Qu 0001. 2827-2832 [doi]
- Enhancing Intubation Accuracy: Advanced Tracheal Segmentation Techniques In Video EndoscopyAdel Oulefki, Abbes Amira, Fatih Kurugollu, Thaweesak Trongtirakul, Sos S. Agaian, Menen Kassim Mohammed, Mohammad Alshoweky. 2833-2838 [doi]
- Adaptive Sampling Method for Whole-Body Low-Dose Pet Reconstruction Based on Reconstruction DifficultyYanyi Li, Jianping Yin. 2839-2845 [doi]
- Fourier Ptychography Microscopy With Integrated Positional Misalignment CorrectionJuliana Do Nascimento Damurie Da Silva, Patrick Horain. 2846-2851 [doi]
- Deep-Learning-Based Magnetic Resonance Simultaneous Multislice Imaging Using Holographic Image DecodingSatoshi Ito, Yuki Sato, Naoya Endo, Shohei Ouchi. 2852-2857 [doi]
- Improvement of Image Reconstruction for MRI Using Phase-Scrambling Fourier Transform and Dual-Domain StrategyKazuki Yamato, Satoshi Ito. 2858-2864 [doi]
- A Cross Domain Generative Network for Accelerated MRIVazim Ibrahim, Joseph Suresh Paul. 2865-2870 [doi]
- A Multi-Modality Feature Enhancement Method Based On Feature Disentanglement For Sar Image Target DetectionJiayue He, Nan Su 0001, Yanping Liao, Yiming Yan, Shou Feng, Chunhui Zhao 0003. 2871-2877 [doi]
- A Learnable Radar Imaging Paradigm Driven by Deep Generative ModelShuang Li, Ganggang Dong. 2878-2884 [doi]
- Privacy-Preserving Visual Cues Communication for Hearing-Impaired People Using Deep LearningFatima Zaidi, Hira Hameed, Muhammad Farooq 0009, Aisha Fatima, Kamran Arshad, Khaled Assaleh 0001, Qammer H. Abbasi. 2885-2888 [doi]
- Food: Facial Authentication And Out-Of-Distribution Detection With Short-Range FMCW RadarSabri Mustafa Kahya, Boran Hamdi Sivrikaya, Muhammet Sami Yavuz, Eckehard G. Steinbach. 2889-2894 [doi]
- Detecting Biomedical Copy-Move Forgery by Attention-Based Multiscale Deep DescriptorsHao-Chiang Shao, Tse-Yu Tseng, Yuan-Rong Liao, Chi-Chun Chen, Chung-Yang Hung, Ming-hsin Liang. 2895-2901 [doi]
- Directional And Topological Transformer With Topology Priors For 4D Cellular Image SegmentationZelin Li, Zhaoke Huang, Zhen Zhu, Sicheng You, Zhongying Zhao, Hong Yan. 2902-2908 [doi]
- Delving into the Explainability of Prototype-Based CNN for Biological Cell AnalysisMartin Blanchard, Olivier Delézay, Christophe Ducottet, Damien Muselet. 2909-2915 [doi]
- Cell Cycle State Prediction Using Graph Neural NetworksSayan Acharya, Aditya Ganguly, Ram Sarkar, Abin Jose. 2916-2922 [doi]
- Deep Learning-Based Leaf Image Analysis for Tomato Plant Disease Detection and ClassificationAmmar Chouchane, Abdelmalik Ouamane, El Ouanas Belabbaci, Yassine Himeur, Abbes Amira. 2923-2929 [doi]
- Novel Meta Attention Guided Framework for Breast Abnormality Classification With Combination of FSL and DAAnindita Mohanta, Sourav Dey Roy, Niharika Nath, Mrinal Kanti Bhowmik. 2930-2936 [doi]
- Dual Multi-Modal Feature Fusion Network for the Evaluation of OsteosarcomaZequn Song, Lingfeng Wang. 2937-2943 [doi]
- MCT-Net: a Lightweight Multiscale Convolutional Transformer Network for Polyp SegmentationNiladri Chakraborti, Deepak Ranjan Nayak. 2944-2950 [doi]
- A Neuroimaging Yolov8-Based Cad Framework for Anosmia Grading in Covid-19Hossam Magdy Balaha, Mayada Elgendy, Ahmed Alksas, Mohamed Shehata, Norah Saleh Alghamdi, Fatma Taher, Mohammed Ghazal, Mahitab Ghoneim, Eslam Hamed, Fatma Sherif, Ahmed Elgarayhi, Mohammed Sallah, Mohamed Abdelbadie Salem, Elsharawy Kamal, Harpal Sandhu, Ayman El-Baz. 2951-2956 [doi]
- Physiological Modeling With Multispectral Imaging for Heart Rate EstimationKosuke Kurihara, Yoshihiro Maeda, Daisuke Sugimura, Takayuki Hamamoto. 2957-2963 [doi]
- Automated Segmentation of Lung Regions in 3D CT Scans Using Hybrid Unsupervised-Supervised ModelsAhmed Sharafeldeen, Adel Khelifi, Mohammed Ghazal, Maha Yaghi, Sohail Contractor, Ayman El-Baz. 2964-2969 [doi]
- Cafct-Net: A Cnn-Transformer Hybrid Network With Contextual And Attentional Feature Fusion For Liver Tumor SegmentationMing Kang 0002, Chee-Ming Ting, Fung Fung Ting, Raphaël C.-W. Phan. 2970-2974 [doi]
- SS-CXR: Self-Supervised Pretraining Using Chest X-Rays Towards A Domain Specific Foundation ModelSyed Muhammad Anwar, Abhijeet Parida, Sara Atito 0001, Muhammad Awais, Gustavo Nino, Josef Kittler, Marius George Linguraru. 2975-2981 [doi]
- Mix-Domain Contrastive Learning For Unpaired H&E-to-IHC Stain TranslationSong Wang, Zhong Zhang, Huan Yan, Ming Xu, Guanghui Wang. 2982-2988 [doi]
- ATU-NET: An Adaptive Transformation-Based U-NET for Medical Image SegmentationQianyu Du, Baojiang Zhong, Kai-Kuang Ma. 2989-2995 [doi]
- Deep Multi-Graph Embedded Clustering for Community Detection in FMRI Functional Brain Networks Across IndividualsKai-Jun See, Chee-Ming Ting, Fuad Noman, Junn Yong Loo, Yee Fan Tan, Hernando Ombao, Raphaël C.-W. Phan. 2996-3002 [doi]
- An Interpretable Deep Graph Neural Network Based On Attentional Multi-Scale Feature Fusion for FMRI AnalysisLikai Wang 0010, Tao Zhu, Yipu Zhang. 3003-3009 [doi]
- Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class ClassificationMuhammad Uzair Zahid, Aysen Degerli, Fahad Sohrab, Serkan Kiranyaz, Tahir Hamid, Rashid Mazhar, Moncef Gabbouj. 3010-3016 [doi]
- A Needle In A (Medical) Haystack: Detecting A Biopsy Needle In Ultrasound Images Using Vision TransformersAgata M. Wijata, Bartlomiej Pycinski, Jakub Nalepa. 3017-3023 [doi]
- CST-Yolo: A Novel Method For Blood Cell Detection Based On Improved Yolov7 And CNN-Swin TransformerMing Kang 0002, Chee-Ming Ting, Fung Fung Ting, Raphaël C.-W. Phan. 3024-3029 [doi]
- Redefining Cystoscopy With AI: Bladder Cancer Diagnosis Using an Efficient Hybrid CNN-Transformer ModelMeryem Amaouche, Ouassim Karrakchou, Mounir Ghogho, Anouar El Ghazzaly, Mohamed Alami, Ahmed Ameur. 3030-3036 [doi]
- M3T: Multi-Modal Medical Transformer To Bridge Clinical Context With Visual Insights For Retinal Image Medical Description GenerationNagur Shareef Shaik, Teja Krishna Cherukuri, Dong Hye Ye. 3037-3043 [doi]
- Uncovering Communities Of Pipelines in the Task-FMRI Analytical SpaceElodie Germani, Elisa Fromont, Camille Maumet. 3044-3050 [doi]
- Multi-View Network for Colorectal Polyps Detection in CT ColonographyMohamed Yousuf, Samir Harb, Islam Alkabbany, Asem M. Ali, Salwa Elshazley, Aly A. Farag. 3051-3056 [doi]
- GEEG-YOLOv8: Gaussian Enhanced Euclidean Norm Ghost Attention for Real-Time Polyp DetectionPhuong-Thao Nguyen, Hiroshi Watanabe 0001. 3057-3063 [doi]
- Segmentation of Hard Exudates And Hemorrhages from Diabetic Retinopathy Images Using Residual U-Net with Squeeze and Excite BlocksAvinash Gaikwad, Anjali Gautam. 3064-3069 [doi]
- Giraffe: A Genetic Programming Algorithm To Build Deep Learning Ensembles For Ecg Arrhythmia ClassificationDamian Kucharski, Agata M. Wijata, Lu Fu, Weidong Lin, Yumei Xue, Jacek Kawa, Yalin Zheng, Gregory Y. H. Lip, Jakub Nalepa. 3070-3076 [doi]
- Navigating Limitations With Precision: A Fine-Grained Ensemble Approach To Wrist Pathology Recognition On A Limited X-Ray DatasetAmmar Ahmed, Ali Shariq Imran, Mohib Ullah, Zenun Kastrati, Sher Muhammad Daudpota. 3077-3083 [doi]
- Contour-Weighted Loss For Class-Imbalanced Image SegmentationZhengyong Huang, Yao Sui. 3084-3090 [doi]
- Multi-Modal Medical Image Fusion for Non-Small Cell Lung Cancer ClassificationSalma Hassan, Hamad Al-Hammadi, Ibrahim Mohammed, Muhammad Haris Khan. 3091-3097 [doi]
- Guided Context Gating: Learning To Leverage Salient Lesions in Retinal Fundus ImagesTeja Krishna Cherukuri, Nagur Shareef Shaik, Dong Hye Ye. 3098-3104 [doi]
- FEDMI: A Federated Learning Framewoek for Secure Sharing of Medical ImagesZhongyuan Jing, Hongyan Xiang, Ruyan Wang. 3105-3111 [doi]
- Dcctnet: Kidney Tumors Segmentation Based On Dual-Level Combination Of Cnn And TransformerBingzhen Hou, Guimei Zhang, Huiqun Liu, Yipeng Qin, Ying Chen 0023. 3112-3116 [doi]
- Wavelet-Enhanced CNN for Depression Classification Based on MRI ImagesYawei Zhang, Bo Li, Xin Li, Yuhan Huang, Hui Ding. 3117-3123 [doi]
- Advancing Colorectal Polyp Segmentation With Watershed Algorithm-Enhanced Parallel Self-Supervised LearningKhalil Chikhaoui, Motaz Alfarraj. 3124-3130 [doi]
- Multiclassification Of Vocal Folds Disorders From Videos By Spatio-Temporal Deep FeaturesDhouha Attia, Amel Benazza-Benyahia. 3131-3136 [doi]
- Event-Specific EEG-FNIRS Feature Fusion FOR Alzheimer's Disease ClassificationSung-Hyeon Kim, Tae-Min Choi, Sun-Kyung Lee 0001, Minhee Kim, Jae Gwan Kim, Jong-Hwan Kim 0001. 3137-3143 [doi]
- PWISeg: Weakly-Supervised Surgical Instrument Instance SegmentationZhen Sun, Huan Xu, Jinlin Wu, Zhen Chen, Hongbin Liu, Zhen Lei 0001. 3144-3150 [doi]
- A Novel Approach for 3D Renal Segmentation Using a Modified GAN Model and Texture AnalysisIsraa Sharaby, Ahmed Alksas, Hossam Magdy Balaha, Ali Mahmoud 0001, Mohammed Ali Badawy, Mohamed Abou El-Ghar, Ashraf Khalil, Mohammed Ghazal, Sohail Contractor, Ayman El-Baz. 3151-3157 [doi]
- Nyctale: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness PredictionSadaf Khademi, Anastasia Oikonomou, Konstantinos N. Plataniotis, Arash Mohammadi 0001. 3158-3164 [doi]
- Reducing Motion Artifacts in Brain MRI Using Vision Transformers and Self-Supervised LearningLei Zhang, Xiaoke Wang, Edward H. Herskovits, Elias R. Melhem, Linda Chang, Ze Wang 0003, Thomas Ernst 0001. 3165-3171 [doi]
- Burnsnet: Burn Region Segmentation Network From Color Images With Two-Way CNNJoohi Chauhan, Paul L. Rosin, Puneet Goyal. 3172-3178 [doi]
- SINO-CT-Fusion-Net: A Lightweight Deep Learning Framework for Detection and Classification of Intracranial HemorrhagesChitimireddy Sindhura, Phaneendra K. Yalavarthy, Subrahmanyam Gorthi. 3179-3185 [doi]
- Reset: A Residual Set-Transformer Approach to Tackle the Ugly-Duckling Sign in Melanoma DetectionJules Collenne, Rabah Iguernaissi, Séverine Dubuisson, Djamal Merad. 3186-3191 [doi]
- One-Hot Logistic Regression for Radiomics-Based ClassificationBaptiste Schall, Rodolphe Anty, Lionel Fillatre. 3192-3198 [doi]
- Attention-Based Few-Shot Diagnosis of Chest X-Rays Using Semantic SignaturesDevi Prasad Maharathy, Prabhala Sandhya Gayatri, Angshuman Paul. 3199-3204 [doi]
- Recurrent 3-D Multi-Level Visual Transformer For Joint Classification of Heterogeneous 2-d AND 3-D Radiographic DataMuhammad Owais, Muhammad Zubair, Taimur Hassan, Divya Velayudhan, Irfan Hussain, Naoufel Werghi. 3205-3211 [doi]
- Accurate Colon Segmentation Using 2D Convolutional Neural Networks With 3D Contextual InformationSamir Harb, A. Elsayed, Mohamed Yousuf, Islam Alkabbany, Asem M. Ali, Salwa Elshazley, Aly A. Farag. 3212-3218 [doi]
- Vizecgnet: Visual ECG Image Network for Cardiovascular Diseases Classification With Multi-Modal Training and Knowledge DistillationJu-Hyeon Nam, Seo-Hyung Park, Su Jung Kim, Sang-Chul Lee. 3219-3223 [doi]
- Chatgpt and Biometrics: an Assessment of Face Recognition, Gender Detection, and Age Estimation CapabilitiesAhmad Hassanpour, Yasamin Kowsari, Hatef Otroshi-Shahreza, Bian Yang, Sébastien Marcel. 3224-3229 [doi]
- A Trustworthy Authentication Against Visual Master Face Dictionary Attacks (Trauma)Muhammad Mohzary, Baek-Young Choi, Sejun Song. 3230-3235 [doi]
- Interpreting the Fraudulence Level of Different Finger Photo Presentation Attack InstrumentsAnudeep Vurity, Emanuela Marasco, Raghavendra Ramachandra, Duoduo Liao. 3236-3242 [doi]
- Alignface: Enhancing Face Verification Models Through Adaptive Alignment Of Pose, Expression, and IlluminationSahar Husseini, Jean-Luc Dugelay. 3243-3249 [doi]
- A New Fingerprinting Technique for Engraved Binary Matrix AuthenticationLéo Nicollier, Marc Michel Pic, Enric Meinhardt-Llopis, Gabriele Facciolo. 3250-3256 [doi]
- Exploring Saliency Bias in Manipulation DetectionJoshua Krinsky, Alan Bettis, Qiuyu Tang, Daniel Moreira, Aparna Bharati. 3257-3263 [doi]
- Deepfake Detection Via Separable Self-Consistency LearningLin Lu, Yunhong Wang, Wenqi Zhuo, Liang Zhang, Guangshuai Gao, Yuanfang Guo. 3264-3270 [doi]
- A Large-Capacity Data Hiding Scheme in Encrypted VVC VideoChen Chen, Xingjun Wang. 3271-3277 [doi]
- FREQ-MIP-AA: Frequency Mip Representation for Anti-Aliasing Neural Radiance FieldsYoungin Park, Seungtae Nam, Cheul-Hee Hahm, Eunbyung Park. 3278-3284 [doi]
- Temporal Scalable Coding For Dynamic MeshesJianfeng Xu, Haruhisa Kato, Kei Kawamura. 3285-3291 [doi]
- JOINTRF: End-To-End Joint Optimization for Dynamic Neural Radiance Field Representation and CompressionZihan Zheng, Houqiang Zhong, Qiang Hu, Xiaoyun Zhang, Li Song 0001, Ya Zhang, Yanfeng Wang 0001. 3292-3298 [doi]
- Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute CompressionTam Thuc Do, Philip A. Chou, Gene Cheung. 3299-3305 [doi]
- An Indoor Scene Localization Method Using Graphical Summary of Multi-View RGB-D ImagesPreeti Meena, Himanshu Kumar, Sandeep Kumar Yadav. 3306-3312 [doi]
- Full-Reference Point Cloud Quality Assessment Using Spectral Graph WaveletsRyosuke Watanabe, Keisuke Nonaka, Eduardo Pavez, Tatsuya Kobayashi, Antonio Ortega. 3313-3319 [doi]
- WrappingNet: Mesh Autoencoder Via Deep Sphere DeformationEric Lei, Muhammad Asad Lodhi, Jiahao Pang, Junghyun Ahn, Dong Tian. 3320-3326 [doi]
- IMU-Assisted Target-Free Extrinsic Calibration of Heterogeneous Lidars Based on Continuous-Time OptimizationZehao Yan, Lin Zhang, Zhong Wang, Shenjie Zhao. 3327-3333 [doi]
- TSF-NET3D: TSF-NET for 3D Point Cloud Attribute Compression Artifacts RemovalBirendra Kathariya, Zhu Li 0001, Geert Van Der Auwera. 3334-3340 [doi]
- LiSD: An Efficient Multi-Task Learning Framework For Lidar Segmentation and DetectionJiahua Xu, Si Zuo, Chenfeng Wei, Wei Zhou. 3341-3347 [doi]
- 3D Semantic Scene Completion From A Depth Map With Unsupervised Learning For Semantics PrioritisationMona Alawadh, Mahesan Niranjan, Hansung Kim. 3348-3354 [doi]
- End-to-End Learned Lossy Dynamic Point Cloud Attribute CompressionDat Thanh Nguyen, Daniel Zieger, Marc Stamminger, André Kaup. 3355-3360 [doi]
- Quantization After Inter Prediction in Displacement Coding of Dynamic MeshesHitoshi Nishimura, Haruhisa Kato, Kei Kawamura. 3361-3367 [doi]
- Minimization of Submesh Boundary Errors In Dynamic Mesh CodingKoki Kishimoto, Kei Kawamura, Haruhisa Kato. 3368-3374 [doi]
- Enhancing TMIV Performance Through Proximity-Aware Grouping and Preservation of Small ClustersMahshad MahdaviMoghadam, Stéphane Coulombe, Carlos Vázquez 0001, Mohammadreza Jamali, Ahmad Vakili. 3375-3381 [doi]
- Learning-Based Point Cloud Decoding with Independent and Scalable Reduced ComplexityMohammadreza Ghafari, André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira 0001. 3382-3388 [doi]
- Uncertainty-Aware AB3DMOT by Variational 3D Object DetectionIllia Oleksiienko, Alexandros Iosifidis. 3389-3395 [doi]
- Rdssd: 3D Single Stage Object Detector For Roadside Lidar SensorsConghao Lv, Ping Jiang 0001, Meng Wang, Lixin Lin, Xuechen Chen, Xiaoheng Deng. 3396-3402 [doi]
- Unleashing the Power of Generalized Iterative Closest Point for Swift and Effective Point Cloud RegistrationEfthymios Koukoulis, Gerasimos Arvanitis, Konstantinos Moustakas. 3403-3409 [doi]
- Point Cloud Geometry Scalable Coding with a Quality-Conditioned Latents Probability EstimatorDaniele Mari, André F. R. Guarda, Nuno M. M. Rodrigues, Simone Milani, Fernando Pereira 0001. 3410-3416 [doi]
- An Explainable Spectral Analysis For Light Field Image Quality AssessmentShengyang Zhao, Xin Jin 0014. 3417-3423 [doi]
- Super-Resolution for Near-Eye Light Field Display in Fourier SpaceYu-Hsiang Huang, Wei Wang, Homer H. Chen. 3424-3430 [doi]
- Two-Level Intra Prediction Using High-Order Macropixel Neighbors For Plenoptic Video CodingVinh Van Duong, Thuc Nguyen Huu, Jonghoon Yim, Byeungwoo Jeon. 3431-3435 [doi]
- Single-Panorama Classification of 3D Objects Using Horizontally Stacked Dilated ConvolutionsRômulo Marconato Stringhini, Thiago S. Lermen, Thiago L. T. da Silveira, Cláudio R. Jung. 3436-3442 [doi]
- MVCrackViT: Robust Multi-View Crack Detection For Point Cloud Segmentation Using View AttentionChristian Benz, Volker Rodehorst. 3443-3449 [doi]
- Multi-Reference Flow-Guided Cross-Domain Reconstruction For General Object 6D Pose EstimationJaewoo Park, Jaeguk Kim, Nam Ik Cho. 3450-3456 [doi]
- Partial Inter-Frame Coding for Dynamic MeshesXudong Jin, Jianfeng Xu, Kei Kawamura. 3457-3463 [doi]
- Real-Time Monocular Depth Estimation on Embedded SystemsCheng Feng, Congxuan Zhang, Zhen Chen, Weiming Hu, Liyue Ge. 3464-3470 [doi]
- Uncalibrated and Unsupervised Photometric Stereo with Piecewise RegularizerAlejandro Casanova, Antonio Agudo. 3471-3476 [doi]
- Fine-Detailed Neural Indoor Scene Reconstruction Using Multi-Level Importance Sampling And Multi-View ConsistencyXinghui Li, Yuchen Ji, Xiansong Lai, Wanting Zhang, Long Zeng 0001. 3477-3483 [doi]
- DALSM: A Direction-Aware Line Segment Matching MethodZhiyu Liu, Baojiang Zhong. 3484-3490 [doi]
- Confidence Aware Stereo Matching for Realistic Cluttered ScenarioJunhong Min, Youngpil Jeon. 3491-3497 [doi]
- Camera Calibration Through Geometric Constraints from Rotation and Projection MatricesMuhammad Waleed, Abdul Rauf, Murtaza Taj. 3498-3504 [doi]
- Combining Raft-Based Stereo Disparity and Optical Flow Models For Scene Flow EstimationHuizhu Pan, Ling Li 0006, Senjian An, Hui Xie. 3505-3511 [doi]
- Fisheye Stereo Camera Using Fisheye Vertical Stereo MethodHikaru Chikugo, Kento Arai, Sarthak Pathak, Kazunori Umeda. 3512-3518 [doi]
- Context-Adaptive Entropy Model With Adapters For Lossless Point Cloud Geometry CompressionYutong Zhang, Wenbo Zhao 0004, Daxin Li, Junjun Jiang, Xianming Liu. 3519-3525 [doi]
- Robust 3D Semantic Segmentation With Incomplete Point Clouds Based on Sequential Frame SamplingMasahiro Yamaguchi, Kyota Higa, Toshinori Hosoi, Takashi Shibata 0001. 3526-3532 [doi]
- Category-Agnostic Pose Estimation for Point CloudsBowen Liu, Wei Liu, Siang Chen, Pengwei Xie, Guijin Wang. 3533-3539 [doi]
- ResNeRF-PCAC: Super Resolving Residual Learning NeRF for High Efficiency Point Cloud Attributes CodingSajid Umair, Birendra Kathariya, Zhu Li, Anique Akhtar, Geert Van Der Auwera. 3540-3546 [doi]
- RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point CloudsRemco Royen, Kostas Pataridis, Ward Van Der Tempel, Adrian Munteanu 0001. 3547-3553 [doi]
- Mamba-PCGC: Mamba-Based Point Cloud Geometry CompressionMonyneath Yim, Jui-Chiu Chiang. 3554-3560 [doi]
- 3D Clothed Human Reconstruction From One In-the-Wild RGB ImageLiangjing Shao, Benshuang Chen, Xinrong Chen. 3561-3567 [doi]
- Self-Supervised Multi-View Stereo with Adaptive Depth PriorsLintao Xiang, Hujun Yin. 3568-3574 [doi]
- Analyzing Visible Articulatory Movements in Speech Production For Speech-Driven 3D Facial AnimationHyung Kyu Kim, Sangmin Lee 0001, Hak Gu Kim. 3575-3579 [doi]
- Adaptive Spatial-Temporal Modelling For Human Motion PredictionJianhua Zhang, Huiyu Zhou 0001, Na Lv. 3580-3586 [doi]
- Hand-Object Reconstruction Via Interaction-Aware Graph Attention MechanismTaeyun Woo, Tae-Kyun Kim 0001, Jinah Park. 3587-3593 [doi]
- Directional Antenna Systems for Long-Range Through-Wall Human Activity RecognitionJulian Strohmayer, Martin Kampel. 3594-3599 [doi]
- Binary-Decomposed Vision Transformer: Compressing and Accelerating Vision Transformer by Binary DecompositionRyota Kondo, Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi. 3600-3605 [doi]
- Empirical Research On Quantization For 3D Multi-Modal Vit ModelsZicong Hu, Jian Cao 0002, Weichen Xu, Ruilong Ren, Tianhao Fu, Xinxin Xu 0006, Xing Zhang 0002. 3606-3612 [doi]
- A New Efficient Split & Merge Algorithm for Embedded SystemsNathan Maurice, Julien Sopena, Lionel Lacassagne. 3613-3619 [doi]
- Temporal Clustering and Temporal Reference Based Specular Detection For 1-MS Visual Feedback SystemTingting Hu, Ryuji Fuchikami, Shigekiyo Nosaka. 3620-3626 [doi]
- Characterization Of Dim Light Response In DVS Pixel: Discontinuity of Event Triggering TimeXiao Jiang, Fei Zhou. 3627-3632 [doi]
- Adaptive Tilt-Series Alignment With Feature Resampling in Cryo-Electron TomographyRanhao Zhang, Mingtao Huang, Xueming Li, Yuan Shen 0001. 3633-3639 [doi]
- An Optimal Transport-Based Method For Medical Image GenerationBohan Lei, Yueting Zhuang, Xiaoyin Xu, Min Zhang. 3640-3646 [doi]
- Rate-Quality or Energy-Quality Pareto Fronts for Adaptive Video Streaming?Angeliki Katsenou, Xinyi Wang, Daniel Schien, David Bull 0001. 3647-3653 [doi]
- Energy Reduction Opportunities in HDR Video EncodingChristian Herglotz, Steven Le Moan, Alexandre Mercat. 3654-3660 [doi]
- Exploiting Change Blindness to Reduce Bitrate and Display Luminance in Video StreamingSteven Le Moan, Mitra Amiri, Christian Herglotz. 3661-3666 [doi]
- Quality of Experience of Viewport Adaptive Omnidirectional Video StreamingXuelin Liu, Haoyun Zhang, Jiebin Yan, Hao Zhang, Yuming Fang, Shiqi Wang. 3667-3673 [doi]
- On Efficient Neural Network Architectures for Image CompressionYichi Zhang, Zhihao Duan, Fengqing Zhu 0001. 3674-3680 [doi]
- Optimized Decoupled Structure with Non-Local Attention for Deep Image CompressionXuanye Zhang, Zhaobin Zhang, Yaojun Wu 0001, Semih Esenlik, Xiaoyan Sun, Kai Zhang, Li Zhang. 3681-3687 [doi]
- Optimizing Learned Image Compression On Scalar and Entropy-Constraint QuantizationFlorian Borzechowski, Michael Schäfer 0003, Heiko Schwarz, Jonathan Pfaff, Detlev Marpe, Thomas Wiegand. 3688-3694 [doi]
- Parallel Task-Prompts ICM: A Versatile Feature Codec for Machine VisionTianma Shen, Ying Liu. 3695-3701 [doi]
- Image Coding For Machines With Edge Information Learning Using Segment AnythingTakahiro Shindo, Kein Yamada, Taiju Watanabe, Hiroshi Watanabe. 3702-3708 [doi]
- Generative Visual Compression: A ReviewBolin Chen, Shanzhi Yin, Peilin Chen 0001, Shiqi Wang 0001, Yan Ye. 3709-3715 [doi]
- Learned Compression of Encoding DistributionsMateen Ulhaq, Ivan V. Bajic. 3716-3722 [doi]
- Learning-Based Video Compression with Continuously Variable Bitrate CodingMingyi Yang, Xionghui Mao, Yujie Yin, Zhiwei Zhu, Defa Wang, Shuai Wan, FuZheng Yang 0001. 3723-3729 [doi]
- Structured Pruning and Quantization for Learned Image CompressionMd Adnan Faisal Hossain, Fengqing Zhu 0001. 3730-3736 [doi]
- NN-Based In-Loop Filtering With Inputs TransformedDu Liu, Jacob Ström, Mitra Damghanian, Per Wennersten. 3737-3743 [doi]
- A Study on the Effect of Color Spaces in Learned Image CompressionSrivatsa Prativadibhayankaram, Mahadev Prasad Panda, Jürgen Seiler, Thomas Richter 0005, Heiko Sparenberg, Siegfried Foessel, André Kaup. 3744-3750 [doi]
- Res-NeRV: Residual Blocks For A Practical Implicit Neural Video DecoderMarwa Tarchouli, Thomas Guionnet, Marc Rivière, Wassim Hamidouche, Meriem Outtas, Olivier Déforges. 3751-3757 [doi]
- Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-OnnsYuxin Xie, Li Yu 0004, Farhad Pakdaman, Moncef Gabbouj. 3758-3764 [doi]
- Adaptive Downsampling and Spatial Upconversion for Point Cloud CompressionYichen Zhou, Xinfeng Zhang 0001, Yingzhan Xu, Kai Zhang 0007, Li Zhang 0006. 3765-3770 [doi]
- Improving Image Coding for Machines Through Optimizing Encoder Via Auxiliary LossKei Iino, Shunsuke Akamatsu, Hiroshi Watanabe, Shohei Enomoto, Akira Sakamoto, Takeharu Eda. 3771-3777 [doi]
- Towards the Detection of AI-Synthesized Human Face ImagesYuhang Lu, Touradj Ebrahimi. 3778-3784 [doi]
- Towards Privacy-Enhancing Provenance Annotations for ImagesNikolaos Fotos, Jaime Delgado. 3785-3791 [doi]
- On The Detection Of Images Generated From TextYuQing Yang, Charuka Moremada, Nikos Deligiannis. 3792-3798 [doi]
- An International Standard For Assessing Trustworthiness In MediaDeepayan Bhowmik, Sabrina B. Caldwell, Jaime Delgado, Touradj Ebrahimi, Nikolaos Fotos, Xiaojun Gu, Ziyuan Hu, Xin Kang 0001, Fernando Pereira 0001, Leonard Rosenthol, Frederik Temmermans, Haibo Zhou. 3799-3805 [doi]
- On the Exploitation of DCT-Traces in the Generative-AI DomainOrazio Pontorno, Luca Guarnera, Sebastiano Battiato. 3806-3812 [doi]
- Exploring the Impact of Moire Pattern on Deepfake DetectorsRazaib Tariq, Shahroz Tariq, Simon S. Woo. 3813-3819 [doi]
- Exposing the Limits of Deepfake Detection using novel Facial mole attack: A Perceptual Black- Box Adversarial Attack StudyQurat-ul Ain, Ali Javed, Khalid Mahmood Malik, Aun Irtaza. 3820-3826 [doi]
- AI-Generated Image Detection With Wasserstein Distance Compression and Dynamic AggregationZihang Lyu, Jun Xiao 0010, Cong Zhang, Kin-Man Lam 0001. 3827-3833 [doi]
- Increasing Trust in Image Analysis by Detecting Trellis Quantization in JPEG ImagesNora Hofer. 3834-3840 [doi]
- Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum DisorderHalil Ismail Helvaci, Chen-Nee Chuah, Sally Ozonoff, Sen-ching Samson Cheung. 3841-3847 [doi]
- Codamal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost MicroscopesIshan Rajendrakumar Dave, Tristan de Blegiers, Chen Chen 0001, Mubarak Shah. 3848-3853 [doi]
- Clip-Medfake: Synthetic Data Augmentation With AI-Generated Content for Improved Medical Image ClassificationHonghui Chen, Baoquan Zhao, Guanghui Yue 0001, Weide Liu, Chenlei Lv, Ruomei Wang 0001, Fan Zhou 0001. 3854-3860 [doi]
- Deep Learning Approach for Renal Cell Carcinoma Detection, Subtyping, And GradingMaroof Abdul Aziz, Fatemeh Javadian, Sherin Susheel Mathew, Avinash Gopal, Johannes Stegmaier, Sonit Singh, Abin Jose. 3861-3867 [doi]
- Deepskinformer: Skin Lesion Segmentation Using Hierarchical Transformers And Edge EnhancementUfaq Khan, Umair Nawaz, Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El-Saddik. 3868-3874 [doi]
- Towards Better Control Of Latent Spaces For Face EditingSavas Özkan, Mete Özay. 3875-3881 [doi]
- How to Train Your VAEMariano Rivera. 3882-3888 [doi]
- Apnet: Generating Precise Anomaly Prior Information for Mixed-Supervised Defect DetectionGuanji Li, Hongxia Gao. 3889-3895 [doi]
- Defending Against Physical Adversarial Patch attacks On Infrared Human DetectionLukas Strack, Futa Waseda, Huy H. Nguyen, Yinqiang Zheng, Isao Echizen. 3896-3902 [doi]
- Trustworthy Sr: Resolving Ambiguity In Image Super-Resolution Via Diffusion Models And Human FeedbackCansu Korkmaz, Ege Çirakman, A. Murat Tekalp, Zafer Dogan. 3903-3909 [doi]
- Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature GenerationAprilPyone MaungMaung, Huy H. Nguyen, Hitoshi Kiya, Isao Echizen. 3910-3916 [doi]
- Real-Time and Resource-Efficient Multi-Scale Adaptive Robotics Vision for Underwater Object Detection and Domain GeneralizationLyes Saad Saoud, Zhenwei Niu, Lakmal D. Seneviratne, Irfan Hussain. 3917-3923 [doi]
- Underwater Change Detection Using Multiple Sampling-Based Probabilistic Learner and Feature Preservance DiscriminatorMehvish Nissar, Badri Narayan Subudhi, Vinit Jakhetiya, Amit Kumar Mishra. 3924-3930 [doi]
- Domain Dilation for Single Domain GeneralizationYuehui Fan, Baoyao Yang, Meng Shen, Fei Lyu 0004. 3931-3937 [doi]
- Are Objective Explanatory Evaluation Metrics Trustworthy? An Adversarial AnalysisPrithwijit Chowdhury, Mohit Prabhushankar, Ghassan Alregib, Mohamed Deriche 0001. 3938-3944 [doi]
- Reinforcement Learning-Based Secure Video Transmission For IOV SystemsLixin Liu, Zhibo Liu, Xiaozhen Lu, Yanling Bu, Bin Han, Liang Xiao 0003. 3945-3950 [doi]
- JPEG Image Ciphering Based on Chaotic EncryptionMeha Hachani, Azza Ouled Zaid. 3951-3957 [doi]
- Pilot-Free Semantic Communication Over Multi-User Mimo Fading ChannelsWeixuan Chen, Qianqian Yang 0002, Zhaohui Yang 0001, Yiping Duan, Zhaoyang Zhang 0001. 3958-3964 [doi]
- Robust Skin Color Driven Privacy-Preserving Face Recognition Via Function Secret SharingDong Han, Yufan Jiang, Yong Li, Ricardo Mendes, Joachim Denzler. 3965-3971 [doi]
- Fantom: Federated Adversarial Network for Training Multi-Sequence Magnetic Resonance Imaging in Semantic SegmentationAnupam Borthakur, Apoorva Srivastava, Avik Kar, Dipayan Dewan, Debdoot Sheet. 3972-3978 [doi]
- A Modular and Robust Physics-Based Approach for Lensless Image ReconstructionYohann Perron, Eric Bezzam, Martin Vetterli. 3979-3985 [doi]
- Lensless Phase Retrieval With Regularization By Blind Noise Map Estimation and DenoisingIgor Shevkunov, Mykola Ponomarenko, Jere Heimo, Karen O. Egiazarian. 3986-3992 [doi]
- Highly Constrained Coded Aperture Imaging Systems Design Via a Knowledge Distillation ApproachLeon Suarez-Rodriguez, Roman Jacome, Henry Arguello. 3993-3999 [doi]
- Through-Wall Imaging Based On WiFi Channel State InformationJulian Strohmayer, Rafael Sterzinger, Christian Stippel, Martin Kampel. 4000-4006 [doi]
- Adversarial EM For Partially-Supervised Image-Quality Enhancement: Application To Low-Dose Pet ImagingVatsala Sharma, Suyash P. Awate. 4007-4013 [doi]
- Learn By An Example Transformer For Domain Generalization In Video Object SegmentationIslam I. Osman, Mohamed S. Shehata. 4014-4020 [doi]
- Edge-Guided Pixel Level Connected Component Assisted Camouflaged Object DetectionQingwang Wang, Xin Qu, Liyao Zhou, Pengcheng Jin, Chengbiao Fu, Tao Shen 0004. 4021-4027 [doi]
- Luminate: Linguistic Understanding and Multi-Granularity Interaction for Video Object SegmentationRahul Tekchandani, Ritik Maheshwari, Praful Hambarde, Satya Narayan Tazi, Santosh Kumar Vipparthi, Subrahmanyam Murala. 4028-4034 [doi]
- Bi-Directional Tracklet Embedding for Multi-Object TrackingH. Çagriota Bilgi, A. Aydiotan Alatan. 4035-4041 [doi]
- A Confidence-Aware Matching Strategy For Generalized Multi-Object TrackingKyuJin Shim, Jubi Hwang, Kangwook Ko, Changick Kim. 4042-4048 [doi]