Abstract is missing.
- Efficient Video Encoder Autotuning via Offline Bayesian Optimization and Supervised LearningRoberto Azevedo, Yuanyi Xue, Xuewei Meng, Wenhao Zhang, Scott Labrozzi, Christopher Schroers. 1-6 [doi]
- Parametric Modeling and Estimation of Photon Registrations for 3D ImagingWeijian Zhang, Hashan K. Weerasooriya, Prateek Chennuri, Stanley H. Chan. 1-6 [doi]
- Inter-Camera Color Correction for Multispectral Imaging with Camera Arrays Using a Consensus ImageKatja Kossira, Jürgen Seiler, André Kaup. 1-6 [doi]
- Message from the MMSP 2024 General and Technical Program ChairsFengqing Maggie Zhu, Zhu Li, Nikolaos Thomos, Balu Adsumilli, Enrico Magli. 1-2 [doi]
- CT-Bound: Robust Boundary Detection from Noisy Images Via Hybrid Convolution and Transformer Neural NetworksWei Xu, Junjie Luo 0009, Qi Guo. 1-6 [doi]
- ECRF: Entropy-Constrained Neural Radiance Fields Compression with Frequency Domain OptimizationSoonbin Lee, Fangwen Shu, Yago Sánchez de la Fuente, Thomas Schierl, Cornelius Hellge. 1-6 [doi]
- Multi-Reference Generative Face Video Compression with Contrastive LearningGoluck Konuko, Giuseppe Valenzise. 1-6 [doi]
- Modeling the Energy Consumption of the HEVC Software Encoding Process Using Processor EventsGeetha Ramasubbu, André Kaup, Christian Herglotz. 1-6 [doi]
- Efficient Image Compression Using Advanced State Space ModelsBouzid Arezki, Anissa Mokraoui, Fangchen Feng. 1-6 [doi]
- Dual-Path Multi-Scale Transformer for High-Quality Image DerainingHuiling Zhou, Hongming Chen 0004, Xianhao Wu, Yufeng Li 0001. 1-6 [doi]
- Shadow Augmentation for Handwashing Action Recognition: From Synthetic to Real DatasetsShengtai Ju, Amy R. Reibman. 1-6 [doi]
- Lifelong Direct Error-Driven Learning for UAV Altitude Estimation in Different Weather ConditionsShirin Nasr Esfahani, Jagannathan Sarangapani. 1-6 [doi]
- Efficient Image Harmonization via RGB TransformationCheng Su, Jiande Sun 0001, Jinhui Wang, Wenbo Wan, Kai Zhang 0010, Jian Wang 0004. 1-6 [doi]
- LAM3D: Leveraging Attention for Monocular 3D Object DetectionDiana-Alexandra Sas, Leandro Di Bella, Yangxintong Lyu, Florin Oniga, Adrian Munteanu 0001. 1-6 [doi]
- Multi-network Ensembling for GAN Training and Adversarial AttacksShuting Zheng, Yuan-Gen Wang. 1-6 [doi]
- Efficient Microscopic Image Instance Segmentation for Food Crystal Quality ControlXiaoyu Ji 0004, Jan P. Allebach, Ali Shakouri, Fengqing Zhu 0001. 1-6 [doi]
- Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation Through Hybrid VisionAditya Krishnan 0003, Jayneel Vora, Prasant Mohapatra. 1-6 [doi]
- 3D Geometry Compression with Hybrid Framework: Quasi-JPEG and Phase EncodingWonbeen Oh, Jae-Sang Hyun. 1-6 [doi]
- Anomaly Detection in Satellite Videos Using Diffusion ModelsAkash Awasthi, Son T. Ly, Jaer Nizam, Videet Mehta, Safwan Ahmad, Ramakrishna Nemani, Saurabh Prasad, Hien Van Nguyen. 1-6 [doi]
- Sparse Convolution Based Point Cloud Attributes Deblocking with Graph Fourier Latent RepresentationMuhammad Talha, Birendra Kathariya, Zhu Li 0001, Anique Akhtar, Geert Van Der Auwera. 1-6 [doi]
- MultiFuser: Multimodal Fusion Transformer for Enhanced Driver Action RecognitionRuoyu Wang, Wenqian Wang, Jianjun Gao, Dan Lin, Kim-Hui Yap, Bingbing Li. 1-6 [doi]
- KPCA-CAM: Visual Explainability of Deep Computer Vision Models Using Kernel PCASachin Karmani, Thanushon Sivakaran, Gaurav Prasad, Mehmet-Ali, Wenbo Yang, Sheyang Tang. 1-5 [doi]
- Learned Multimodal Compression for Autonomous DrivingHadi Hadizadeh, Ivan V. Bajic. 1-6 [doi]
- Optimizing ROI Benefits Vehicle ReID in ITSMei Qiu, Lauren Ann Christopher, Lingxi Li 0001, Stanley Y. P. Chien, Yaobin Chen. 1-6 [doi]
- Efficient Partition Map Prediction via Token Sparsification for Fast VVC Intra CodingXinmin Feng, Li Li 0040, Dong Liu 0002, Feng Wu 0001. 1-6 [doi]
- Enhanced Multi-Resolution Generative Face Video CompressionRenjie Zhou, Ru-Ling Liao, Bolin Chen, Yan Ye, Jie Chen. 1-4 [doi]
- Dynamic 6-DoF Volumetric Video Generation: Software Toolkit and DatasetMufeng Zhu, Yuan-Chun Sun, Na Li, Jin Zhou, Songqing Chen, Cheng-Hsin Hsu, Yao Liu 0001. 1-6 [doi]
- Light-Weighted Temporal Evolution Inference for Generative Face Video CompressionZihan Zhang, Bolin Chen, Shanzhi Yin, Shiqi Wang 0001, Yan Ye. 1-6 [doi]
- Rate-Adaptive Joint Source Channel Coding Using Deep Block-Based Compressed SensingMohammad Amin Jarrahi, Eirina Bourtsoulatze, Vahid Abolghasemi. 1-6 [doi]
- An Exploration of Human Pose Estimation Based Cheating Tools for FPS Video Game and its Defense SolutionChang Liu, Zichun Gao, Zhenyu Liao, Yue Sun, Xianglong Feng. 1-6 [doi]
- Color-Guided Flying Pixel Correction in Depth ImagesEkamresh Vasudevan, Shashank N. Sridhara, Eduardo Pavez, Antonio Ortega, Raghavendra Singh, Srinath Kalluri. 1-6 [doi]
- Comparative Analysis and Performance Evaluation of Adaptive 360° Video DASH Streaming SolutionsAlireza M. Hosseini, Jacob Chakareski. 1-6 [doi]
- Synthetic Local Data AugmentationVasyl Chomko, Yuhao Chen 0001, David A. Clausi, Alexander Wong. 1-6 [doi]
- On the Rate-Distortion-Complexity Trade-Offs of Neural Video CodingYi-Hsin Chen, Kuan-Wei Ho, Martin Benjak, Jörn Ostermann, Wen-Hsiao Peng. 1-6 [doi]
- Multimodal Deep Learning for Diabetic Retinopathy Grading: Integrating Linear-Radon Sinograms and Retinal Fundus ImagesFarida Mohsen, Uzair Shah, Ashhadul Islam, Zubair Shah, Samir Brahim Belhaouari. 1-5 [doi]
- Dynamic Crowd Routing: RL-Driven Crowd DynamicsDaniele Della Pietra, Nicola Garau, Nicola Conci, Fabrizio Granelli. 1-6 [doi]
- Pose Guided Portrait View Interpolation from Dual Cameras with a Long BaselineWeichen Xu, Yezhi Shen, Qian Lin 0001, Jan P. Allebach, Fengqing Zhu 0001. 1-6 [doi]
- Pattern Template Manifest for Live Video StreamingYongjun Wu, Kyle Koceski, Mairo Pedrini, Sally Cheng, Parminder Singh. 1-5 [doi]
- Analysis and Improvement of Rank-Ordered Mean Algorithm in Single-Photon LiDARWilliam C. Yau, Weijian Zhang, Hashan Kavinga Weerasooriya, Stanley H. Chan. 1-6 [doi]
- Cross-Modal Distortion Approximation for Fast Bit Allocation of Video-Based Point Cloud CompressionHaichen Yang, Yujie Zhang, Qi Yang 0003, Ziyu Shan, Yiling Xu, Yunfeng Guan 0001. 1-6 [doi]
- Bayesian Formulation of Regularization by Denoising - Model and Monte Carlo SamplingElhadji C. Faye, Mame Diarra Fall, Aladine Chetouani, Nicolas Dobigeon. 1-5 [doi]
- A Spatiotemporal Decomposition of a Video Stream Based on the Retina-Inspired FilterEffrosyni Doutsi, Panagiotis Tsakalides. 1-6 [doi]
- FMiFood: Multi-Modal Contrastive Learning for Food Image ClassificationXinyue Pan, Jiangpeng He, Fengqing Zhu 0001. 1-6 [doi]
- Reveal Fluidity Behind Frames: A Multi-Modality Framework for Action Quality AssessmentSiyuan Xu, Peilin Chen, Yue Liu, Meng Wang 0017, Shiqi Wang 0001, Sam Kwong. 1-6 [doi]
- JPEG AI Compressed Domain Face DetectionAyman Alkhateeb, Alessandro Gnutti, Fabrizio Guerrini, Riccardo Leonardi, João Ascenso, Fernando Pereira 0001. 1-6 [doi]
- Scalable Image Coding for Humans and Machines Using Feature Fusion NetworkTakahiro Shindo, Taiju Watanabe, Yui Tatsumi, Hiroshi Watanabe. 1-6 [doi]
- Expanding the Effective Receptive Field for Learned Image CompressionYunhui Shi, Yalong Su, Jin Wang, Nam Ling, Baocai Yin. 1-6 [doi]
- Compression of Self-Supervised Representations for Machine VisionZhihao Duan, Fengqing Maggie Zhu. 1-6 [doi]
- Minimizing Human Labor for In-the-Wild Camera Trap Processing PipelineHaoyu Chen 0006, Amy R. Reibman. 1-6 [doi]
- Towards Reproducible Learning-Based CompressionJiahao Pang, Muhammad Asad Lodhi, Junghyun Ahn, Yuning Huang, Dong Tian. 1-6 [doi]
- Denoising for Neuromorphic Cameras Based on Graph Spectral FeaturesShimpei Harada, Junya Hara, Hiroshi Higashi, Yuichi Tanaka 0001. 1-6 [doi]
- Towards Light-Weight Transformer-Based Quality Assessment Metric for Augmented RealityAymen Sekhri, Seyed Ali Amirshahi, Mohamed-Chaker Larabi. 1-6 [doi]
- A Comparative Assessment of Implicit and Explicit Plenoptic Scene RepresentationsDavi Rabbouni Freitas, Ricardo L. de Queiroz, Ioan Tabus, Christine Guillemot. 1-6 [doi]
- Enhanced Product Classification Using Learned Prompt Ensembling and Dual Interpolation with CLIP-Based ModelTakahisa Yamamoto, Koichiro Niinuma, László A. Jeni. 1-6 [doi]
- Federated Data-Driven Kalman Filtering for State EstimationNikos Piperigkos, Alexandros Gkillas, Christos Anagnostopoulos, Aris S. Lalos. 1-6 [doi]
- Sketching-Based Acoustic Scene Change Detection in Low-Power Embedded DevicesTimm Koppelmann, Rainer Martin 0001. 1-6 [doi]
- Image Quality Assessment in End-to-end Face Analytics SystemsPraneet Singh, Amy R. Reibman. 1-6 [doi]
- End-to-End Compression of Complex-Valued SAR ImagesParas Maharjan, Corey Marrs, Zhu Li 0001. 1-6 [doi]
- Point Cloud Geometry Coding with Relational Neighborhood Self-AttentionMohammadreza Ghafari, André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira 0001. 1-6 [doi]
- Luma Range Scaling for Enhanced VVC Efficiency in Video Coding for MachinesTero Partanen, Alban Marie, Alexandre Mercat, Jarno Vanne, Miska M. Hannuksela, Honglei Zhang, Alireza Aminlou, Francesco Cricri. 1-6 [doi]
- Residual Domain Super-Resolution Generative Adversarial NetworksNelson C. Francisco, Julien Le Tanou. 1-6 [doi]
- Relative Altitude Estimation of Infrared Thermal UAV Images Using SIFT FeaturesShirin Nasr Esfahani, Jagannathan Sarangapani. 1-6 [doi]
- Selective Enablement of L4S Transport for Latency-Sensitive Multimedia DeliveryDhananjay Lal, Christopher Phillips. 1-6 [doi]
- Lightweight Reinforcement-Based Approach for HDR ConversionChansoon Heo, Byeungwoo Jeon. 1-5 [doi]
- Personalized Federated Learning for Cross-View Geo-LocalizationChristos Anagnostopoulos, Alexandros Gkillas, Nikos Piperigkos, Aris S. Lalos. 1-6 [doi]
- A Sharpness Based Loss Function for Removing Out-of-Focus BlurUditangshu Aurangabadkar, Darren Ramsook, Anil C. Kokaram. 1-6 [doi]
- Pixel-Weighted Multi-Pose Fusion for Metal Artifact Reduction in X-Ray Computed TomographyDiyu Yang, Craig A. J. Kemp, Soumendu Majee, Gregery T. Buzzard, Charles A. Bouman. 1-6 [doi]
- Extreme Low Bitrate Image Compression System for Mobile DeploymentJunqi Wu, Wenhong Duan, Xianping Ma, Jianhui Chang, Shanshe Wang, Siwei Ma, Chuanmin Jia. 1-6 [doi]
- Perception-Driven Point Cloud Quality Assessment Through Projections and Deep Structure SimilarityArthur H. S. Carvalho, Pedro Garcia Freitas, Mateus Gonçalves, Johann Homonnai, Mylène C. Q. Farias. 1-6 [doi]
- A Multi-Stream Fusion Approach with One-Class Learning for Audio-Visual Deepfake DetectionKyungbok Lee, You Zhang 0001, Zhiyao Duarr. 1-6 [doi]
- Effects of Delay on Nonverbal Behavior and Interpersonal Coordination in Video ConferencingChenyao Diao, Stephanie Arévalo Arboleda, Alexander Raake. 1-6 [doi]
- Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-Based Speech EnhancementMuhammad Salman Khan, Moreno La Quatra, Kuo-Hsuan Hung, Szu-Wei Fu, Sabato Marco Siniscalchi, Yu Tsao 0001. 1-6 [doi]
- Feature-Preserving Rate-Distortion Optimization in Image Coding for MachinesSamuel Fernández-Menduiña, Eduardo Pavez, Antonio Ortega. 1-6 [doi]
- Decoding Energy Optimization for Video Coding Using Model-Driven Gradient DescentChristian Herglotz, Matthias Kränzler, Bide Xu, André Kaup. 1-6 [doi]
- Robust Real-World Image Dehazing via Knowledge Guided Conditional Diffusion Model FinetuningHaoran Wei, Qingbo Wu 0001, Chenhao Wu, Shuai Chen, Lei Wang 0186, King Ngi Ngan, Fanman Meng, Hongliang Li 0001. 1-6 [doi]
- Diffusion-Based Bit-Depth ExpansionRiyu Lu, Lingyu Zhu 0006, Baoliang Chen, Xiaopeng Fan, Shiqi Wang 0001. 1-6 [doi]
- A Quantitative Metric of Confidence for Segmentation of Nuclei in Large Spatially Variable Image VolumesLiming Wu, Alain Chen, Paul Salama, Kenneth W. Dunn, Seth Winfree, Edward J. Delp. 1-6 [doi]
- Embedded Bit-Stream Region-of-Interest Coding of Point Cloud AttributesVictor F. Figueiredo, Ricardo L. de Queiroz. 1-6 [doi]
- Embedding Similarity Learning for Extreme License Plate Super-ResolutionAbderrezzaq Sendjasni, Mohamed-Chaker Larabi. 1-6 [doi]