Abstract is missing.
- Memory-efficient and GPU-oriented visual anomaly detection with incremental dimension reductionTeng-Yok Lee, Yusuke Nagai, Akira Minezawa. 1-9 [doi]
- Selective Bokeh Effect TransformationJuewen Peng, Zhiyu Pan, Chengxin Liu, Xianrui Luo, Huiqiang Sun, Liao Shen, Ke Xian, Zhiguo Cao 0001. 1-9 [doi]
- Learning unbiased classifiers from biased data with meta-learningRuggero Ragonesi, Pietro Morerio, Vittorio Murino. 1-9 [doi]
- The Casual Conversations v2 Dataset : A diverse, large benchmark for measuring fairness and robustness in audio/vision/speech modelsBilal Porgali, Vítor Albiero, Jordan Ryda, Cristian Canton-Ferrer, Caner Hazirbas. 10-17 [doi]
- Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving CameraHannah Kirkland, Sanjeev J. Koppal. 18-27 [doi]
- Robustness Against Gradient based Attacks through Cost Effective Network Fine-TuningAkshay Agarwal 0001, Nalini K. Ratha, Richa Singh 0001, Mayank Vatsa. 28-37 [doi]
- Gradient Attention Balance Network: Mitigating Face Recognition Racial Bias via Gradient AttentionLinzhi Huang, Mei Wang, Jiahao Liang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Jian Zhao. 38-47 [doi]
- Estimating and Maximizing Mutual Information for Knowledge DistillationAman Shrivastava, Yanjun Qi, Vicente Ordonez. 48-57 [doi]
- Synthetic Sample Selection for Generalized Zero-Shot LearningShreyank N. Gowda. 58-67 [doi]
- MMRNet: Improving Reliability for Multimodal Object Detection and Segmentation for Bin Picking via Multimodal RedundancyYuhao Chen, Hayden Gunraj, E. Zhixuan Zeng, Robbie Meyer, Maximilian Gilles, Alexander Wong. 68-77 [doi]
- DPPD: Deformable Polar Polygon Object DetectionYang Zheng, Oles Andrienko, Yonglei Zhao, Minwoo Park, Trung Pham. 78-87 [doi]
- Joint Camera and LiDAR Risk AnalysisOliver Zendel 0001, Johannes Huemer, Markus Murschitz, Gustavo Fernández Domínguez, Amadeus Lobe. 88-97 [doi]
- Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic SegmentationAdriano Cardace, Pierluigi Zama Ramirez, Samuele Salti, Luigi di Stefano. 98-109 [doi]
- Training Strategies for Vision Transformers for Object DetectionApoorv Singh. 110-118 [doi]
- EGA-Depth: Efficient Guided Attention for Self-Supervised Multi-Camera Depth EstimationYunxiao Shi, Hong Cai, Amin Ansari, Fatih Porikli. 119-129 [doi]
- Improving Rare Classes on nuScenes LiDAR segmentation Through Targeted Domain AdaptationVickram Rajendran, Chuck Tang, Frits van Paasschen. 130-139 [doi]
- Does Image Anonymization Impact Computer Vision Training?Håkon Hukkelås, Frank Lindseth. 140-150 [doi]
- MotionTrack: End-to-End Transformer-based Multi-Object Tracking with LiDAR-Camera FusionCe Zhang, Chengjie Zhang, Yiluan Guo, Lingji Chen, Michael Happold. 151-160 [doi]
- HazardNet: Road Debris Detection by Augmentation of Synthetic ModelsTae Eun Choe, Jane Wu, Xiaolin Lin, Karen Kwon, Minwoo Park. 161-171 [doi]
- FUTR3D: A Unified Sensor Fusion Framework for 3D DetectionXuanyao Chen, Tianyuan Zhang 0002, Yue Wang, Yilun Wang, Hang Zhao. 172-181 [doi]
- RadarGNN: Transformation Invariant Graph Neural Network for Radar-based PerceptionFelix Fent, Philipp Bauerschmidt, Markus Lienkamp. 182-191 [doi]
- MobileDeRainGAN: An Efficient Semi-Supervised Approach to Single Image Rain Removal for Task-Driven ApplicationsRuphan Swaminathan, Pradyot V. N. Korupolu. 192-201 [doi]
- TorchSparse++: Efficient Point Cloud EngineHaotian Tang, Shang Yang, Zhijian Liu, Ke Hong, Zhongming Yu, Xiuyu Li, Guohao Dai, Yu Wang 0002, Song Han 0003. 202-209 [doi]
- Ultra-Sonic Sensor based Object Detection for Autonomous VehiclesTommaso Nesti, Santhosh Boddana, Burhaneddin Yaman. 210-218 [doi]
- Improvements to Image Reconstruction-Based Performance Prediction for Semantic Segmentation in Highly Automated DrivingAndreas Bär, Daniel Kusuma, Tim Fingscheidt. 219-229 [doi]
- LiDAR-Based Localization on Highways Using Raw Data and Pole-Like Object FeaturesSheng-Cheng Lee, Victor Lu, Chieh-Chih Wang, Wen-Chieh Lin. 230-237 [doi]
- Zero-shot Classification at Different Levels of GranularityMatías Molina. 238-244 [doi]
- Difficulty Estimation with Action Scores for Computer Vision TasksOctavio Arriaga, Sebastian Palacio, Matias Valdenegro-Toro. 245-253 [doi]
- Detail-Preserving Self-Supervised Monocular Depth with Self-Supervised Structural SharpeningJuan Luis Gonzalez Bello, Jaeho Moon, Munchurl Kim. 254-264 [doi]
- LD-GAN: Low-Dimensional Generative Adversarial Network for Spectral Image Generation with Variance RegularizationEmmanuel Martinez, Roman Jacome, Alejandra Hernandez-Rojas, Henry Arguello. 265-275 [doi]
- Isolated Sign Language Recognition based on Tree Structure Skeleton ImagesDavid Laines, Miguel González-Mendoza 0001, Gilberto Ochoa-Ruiz, Gissella Bejarano. 276-284 [doi]
- SUPRA: Superpixel Guided Loss for Improved Multi-modal Segmentation in EndoscopyRafael Martinez Garcia Peña, Mansoor Ali Teevno, Gilberto Ochoa-Ruiz, Sharib Ali. 285-294 [doi]
- Deep Prototypical-Parts Ease Morphological Kidney Stone Identification and are Competitively Robust to Photometric PerturbationsDaniel Flores-Araiza, Francisco Javier Lopez-Tiro, Jonathan El Beze, Jacques Hubert, Miguel González-Mendoza 0001, Gilberto Ochoa-Ruiz, Christian Daul. 295-304 [doi]
- Wildlife Image Generation from Scene GraphsYoshio Rubio, Marco A. Contreras-Cruz. 305-314 [doi]
- Towards Characterizing the Semantic Robustness of Face RecognitionJuan C. Pérez, Motasem Alfarra, Ali K. Thabet, Pablo Arbeláez, Bernard Ghanem. 315-325 [doi]
- High-level context representation for emotion recognition in imagesWillams de Lima Costa, Estefania Talavera Martínez, Lucas Silva Figueiredo, Veronica Teichrieb. 326-334 [doi]
- Mitigating Catastrophic Interference using Unsupervised Multi-Part Attention for RGB-IR Face RecognitionKshitij Nikhal, Nkiruka Uzuegbunam, Bridget Kennedy, Benjamin S. Riggan. 335-344 [doi]
- Multi-sensor Ensemble-guided Attention Network for Aerial Vehicle Perception Beyond Visible SpectrumAlicja Kwasniewska, Anastacia MacAllister, Rey Nicolas, Javier Garzás. 345-353 [doi]
- C-PLES: Contextual Progressive Layer Expansion with Self-attention for Multi-class Landslide Segmentation on Mars using Multimodal Satellite ImageryAbel A. Reyes, Sidike Paheding, A. Rajaneesh, K. S. Sajinkumar, Thomas Oommen. 354-364 [doi]
- Enhanced Thermal-RGB Fusion for Robust Object DetectionWassim A. El Ahmar, Yahya Massoud, Dhanvin Kolhatkar, Hamzah Alghamdi, Mohammad Al Ja'afreh, Robert Laganière, Riad I. Hammoud. 365-374 [doi]
- Detecting Underwater Discrete Scatterers in Echograms with Deep Learning-Based Semantic SegmentationRhythm Vohra, Femina Senjaliya, Melissa Cote, Amanda Dash, Alexandra Branzan Albu, Julek Chawarski, Steve Pearce, Kaan Ersahin. 375-384 [doi]
- A Meta-learning Approach for Domain Generalisation across Visual Modalities in Vehicle Re-identificationEleni Kamenou, Jesús Martínez del Rincón, Paul Miller 0003, Patricia Devlin-Hill. 385-393 [doi]
- VisiTherS: Visible-thermal infrared stereo disparity estimation of human silhouetteNoreen Anwar, Philippe Duplessis-Guindon, Guillaume-Alexandre Bilodeau, Wassim Bouachir. 394-402 [doi]
- Multimodal Object Detection by Channel Switching and Spatial AttentionYue Cao, Junchi Bin, Jozsef Hamari, Erik Blasch, Zheng Liu 0002. 403-411 [doi]
- Multi-modal Aerial View Object Classification Challenge Results - PBVS 2023Spencer Low, Oliver Nina, Angel Domingo Sappa, Erik Blasch, Nathan Inkawhich. 412-421 [doi]
- IR Reasoner: Real-time Infrared Object Detection by Visual ReasoningMeryem Mine Gündogan, Tolga Aksoy, Alptekin Temizel, Ugur Halici. 422-430 [doi]
- Photometric Correction for Infrared SensorsJinCheng Zhang, Andrew R. Willis, Kevin M. Brink. 431-439 [doi]
- Multispectral Contrastive Learning with Viewmaker NetworksJasmine Bayrooti, Noah D. Goodman, Alex Tamkin. 440-448 [doi]
- Spectral Transfer Guided Active Domain Adaptation For Thermal ImageryBerkcan Ustun, Ahmet Kagan Kaya, Ezgi Cakir Ayerden, Fazil Altinel. 449-458 [doi]
- Thermal Infrared Single Image Dehazing and Blind Image Quality AssessmentFabian Erlenbusch, Constanze Merkt, Bernardo de Oliveira, Alexander Gatter, Friedhelm Schwenker, Ulrich Klauck, Michael Teutsch. 459-469 [doi]
- Thermal Image Super-Resolution Challenge Results - PBVS 2023Rafael E. Rivadeneira, Angel Domingo Sappa, Boris Xavier Vintimilla, Chenyang Wang 0002, Junjun Jiang, Xianming Liu, Zhiwei Zhong, Dai Bin, Li Ruodi, Shengye Li. 470-478 [doi]
- A Three-Stage Framework with Reliable Sample Pool for Long-Tailed ClassificationFeng Cai, Keyu Wu, Haipeng Wang, Feng Wang. 479-486 [doi]
- DeepMAO: Deep Multi-scale Aware Overcomplete Network for Building Segmentation in Satellite ImageryAniruddh Sikdar, Sumanth Udupa, Prajwal Gurunath, Suresh Sundaram. 487-496 [doi]
- MoundCount: A detection-based approach for automatic counting of planting microsites on UAV imagesAhmed Zgaren, Wassim Bouachir, Nizar Bouguila, Riad I. Hammoud. 497-506 [doi]
- CoReFusion: Contrastive Regularized Fusion for Guided Thermal Super-ResolutionAditya Kasliwal, Pratinav Seth, Sriya Rallabandi, Sanchit Singhal. 507-514 [doi]
- Multi-modal Aerial View Image Challenge: Translation from Synthetic Aperture Radar to Electro-Optical Domain Results - PBVS 2023Spencer Low, Oliver Nina, Angel Domingo Sappa, Erik Blasch, Nathan Inkawhich. 515-523 [doi]
- Seeing Through the Data: A Statistical Evaluation of Prohibited Item Detection Benchmark Datasets for X-ray Security ScreeningBrian K. S. Isaac-Medina, Seyma Yucer, Neelanjan Bhowmik, Toby P. Breckon. 524-533 [doi]
- Appearance Label Balanced Triplet Loss for Multi-modal Aerial View Object ClassificationRaghunath Sai Puttagunta, Zhu Li 0001, Shuvra S. Bhattacharyya, George York. 534-542 [doi]
- Topology Preserving Compositionality for Robust Medical Image SegmentationAinkaran Santhirasekaram, Mathias Winkler, Andrea G. Rockall, Ben Glocker. 543-552 [doi]
- Shape and Intensity Analysis of Glioblastoma Multiforme TumorsYi Tang Chen, Sebastian Kurtek. 553-560 [doi]
- Robust Hierarchical Symbolic Explanations in Hyperbolic Space for Image ClassificationAinkaran Santhirasekaram, Avinash Kori, Mathias Winkler, Andrea G. Rockall, Francesca Toni, Ben Glocker. 561-570 [doi]
- Euler Characteristic Transform Based Topological Loss for Reconstructing 3D Images from Single 2D SlicesKalyan Varma Nadimpalli, Amit Chattopadhyay, Bastian Rieck. 571-579 [doi]
- Topology-Aware Focal Loss for 3D Image SegmentationAndac Demir, Elie Massaad, Bulent Kiziltan. 580-589 [doi]
- Hamming Similarity and Graph Laplacians for Class Partitioning and Adversarial Image DetectionHuma Jamil, Yajing Liu, Turgay Caglar, Christina M. Cole, Nathaniel Blanchard, Christopher Peterson 0001, Michael Kirby. 590-599 [doi]
- TopFusion: Using Topological Feature Space for Fusion and Imputation in Multi-Modal DataAudun Myers, Henry Kvinge, Tegan Emerson. 600-609 [doi]
- Quantifying Extrinsic Curvature in Neural ManifoldsFrancisco Acosta, Sophia Sanborn, Khanh Dao Duc, Manu S. Madhav, Nina Miolane. 610-619 [doi]
- Making Corgis Important for Honeycomb Classification: Adversarial Attacks on Concept-based Explainability ToolsDavis Brown, Henry Kvinge. 620-627 [doi]
- Face Animation with an Attribute-Guided Diffusion ModelBohan Zeng, Xuhui Liu, Sicheng Gao, Boyu Liu, Hong Li, Jianzhuang Liu, Baochang Zhang 0001. 628-637 [doi]
- Explore the Power of Synthetic Data on Few-shot Object DetectionShaobo Lin, Kun Wang, Xingyu Zeng, Rui Zhao. 638-647 [doi]
- Internal Diverse Image CompletionNoa Alkobi, Tamar Rott Shaham, Tomer Michaeli. 648-658 [doi]
- Leveraging GANs for data scarcity of COVID-19: Beyond the hypeHazrat Ali, Christer Grönlund, Zubair Shah. 659-667 [doi]
- Face Transformer: Towards High Fidelity and Accurate Face SwappingKaiwen Cui, Rongliang Wu, Fangneng Zhan, Shijian Lu. 668-677 [doi]
- Controllable GAN Synthesis Using Non-Rigid Structure-from-MotionRené Haas, Stella Graßhof, Sami S. Brandt. 678-687 [doi]
- Discovering Class-Specific GAN Controls for Semantic Image SynthesisEdgar Schönfeld, Julio Borges, Vadim Sushko, Bernt Schiele, Anna Khoreva. 688-697 [doi]
- One-shot Unsupervised Domain Adaptation with Personalized Diffusion ModelsYasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière. 698-708 [doi]
- DeSRF: Deformable Stylized Radiance FieldShiyao Xu, Lingzhi Li 0002, Li Shen 0005, Zhouhui Lian. 709-718 [doi]
- Unsupervised Style-based Explicit 3D Face Reconstruction from Single ImageHeng Yu, Zoltan A. Milacski, László A. Jeni. 719-729 [doi]
- Generating Adversarial Attacks in the Latent SpaceNitish Shukla, Sudipta Banerjee. 730-739 [doi]
- Unsupervised Bidirectional Style Transfer Network using Local Feature Transform ModuleKangmin Bae, Hyung-il Kim, Yongjin Kwon, Jinyoung Moon. 740-749 [doi]
- Improving Normalizing Flows with the Approximate Mass for Out-of-Distribution DetectionSamy Chali, Inna Kucher, Marc Duranton, Jacques-Olivier Klein. 750-758 [doi]
- Scene Graph Driven Text-Prompt Generation for Image InpaintingTripti Shukla, Paridhi Maheshwari, Rajhans Singh, Ankita Shukla, Kuldeep Kulkarni, Pavan K. Turaga. 759-768 [doi]
- Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable DiffusionJordan Shipard, Arnold Wiliem, Kien Nguyen Thanh, Wei Xiang 0001, Clinton Fookes. 769-778 [doi]
- Benchmarking Robustness to Text-Guided CorruptionsMohammadreza Mofayezi, Yasamin Medghalchi. 779-786 [doi]
- Look ATME: The Discriminator Mean Entropy Needs AttentionEdgardo Solano-Carrillo, Ángel Bueno Rodríguez, Borja Carrillo-Perez, Yannik Steiniger, Jannis Stoppe. 787-796 [doi]
- Diffusion-Enhanced PatchMatch: A Framework for Arbitrary Style Transfer with Diffusion ModelsMark Hamazaspyan, Shant Navasardyan. 797-805 [doi]
- Identity-driven Three-Player Generative Adversarial Network for Synthetic-based Face RecognitionJan Niklas Kolf, Tim Rieber, Jurek Elliesen, Fadi Boutros, Arjan Kuijper, Naser Damer. 806-816 [doi]
- GAN-based Vision Transformer for High-Quality Thermal Image EnhancementMohamed Amine Marnissi, Abir Fathallah. 817-825 [doi]
- Vision + Language Applications: A SurveyYutong Zhou, Nobutaka Shimada. 826-842 [doi]
- Universal Guidance for Diffusion ModelsArpit Bansal, Hong-Min Chu, Avi Schwarzschild, Soumyadip Sengupta, Micah Goldblum, Jonas Geiping, Tom Goldstein. 843-852 [doi]
- Exploring Compositional Visual Generation with Latent Classifier GuidanceChanghao Shi, Haomiao Ni, Kai Li 0012, Shaobo Han, Mingfu Liang, Martin Renqiang Min. 853-862 [doi]
- A Geometric and Photometric Exploration of GAN and Diffusion Synthesized FacesMatyás Bohácek, Hany Farid. 874-883 [doi]
- Exposing GAN-Generated Profile Photos from Compact EmbeddingsShivansh Mundra, Gonzalo J. Aniano Porcile, Smit Marvaniya, James R. Verbus, Hany Farid. 884-892 [doi]
- AutoSplice: A Text-prompt Manipulated Image Dataset for Media ForensicsShan Jia, Mingzhen Huang, Zhou Zhou, Yan Ju, Jialing Cai, Siwei Lyu. 893-903 [doi]
- AI-Synthesized Voice Detection Using Neural Vocoder ArtifactsChengzhe Sun, Shan Jia, Shuwei Hou, Siwei Lyu. 904-912 [doi]
- EKILA: Synthetic Media Provenance and Attribution for Generative ArtKar Balan, Shruti Agarwal, Simon Jenni, Andy Parsons, Andrew Gilbert, John P. Collomosse. 913-922 [doi]
- Harnessing the Power of Text-image Contrastive Models for Automatic Detection of Online MisinformationHao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu. 923-932 [doi]
- RoSteALS: Robust Steganography using Autoencoder Latent SpaceTu Bui, Shruti Agarwal, Ning Yu, John P. Collomosse. 933-942 [doi]
- Audio-Visual Person-of-Interest DeepFake DetectionDavide Cozzolino, Alessandro Pianese, Matthias Nießner, Luisa Verdoliva. 943-952 [doi]
- Open Set Classification of GAN-based Image Manipulations via a ViT-based Hybrid ArchitectureJun Wang, Omran Alamayreh, Benedetta Tondi, Mauro Barni. 953-962 [doi]
- MTN: Forensic Analysis of MP4 Video Files Using Graph Neural NetworksZiyue Xiang, Amit Kumar Singh Yadav, Paolo Bestagini, Stefano Tubaro, Edward J. Delp. 963-972 [doi]
- Intriguing properties of synthetic images: from generative adversarial networks to diffusion modelsRiccardo Corvi, Davide Cozzolino, Giovanni Poggi, Koki Nagano, Luisa Verdoliva. 973-982 [doi]
- Defending Low-Bandwidth Talking Head Videoconferencing Systems From Real-Time Puppeteering AttacksDanial Samadi Vahdati, Tai D. Nguyen, Matthew C. Stamm. 983-992 [doi]
- Multimodaltrace: Deepfake Detection using Audiovisual Representation LearningMuhammad Anas Raza, Khalid Mahmood Malik. 993-1000 [doi]
- Exposing Fine-Grained Adversarial Vulnerability of Face Anti-Spoofing ModelsSonglin Yang, Wei Wang 0025, Chenye Xu, Ziwen He, Bo Peng 0002, Jing Dong 0003. 1001-1010 [doi]
- Robust Partial Fingerprint RecognitionYufei Zhang, Rui Zhao, Ziyi Zhao, Naveen Ramakrishnan, Manoj Aggarwal, Gerard Medioni, Qiang Ji. 1011-1020 [doi]
- PIC-Score: Probabilistic Interpretable Comparison Score for Optimal Matching Confidence in Single- and Multi-Biometric Face RecognitionPedro C. Neto, Ana F. Sequeira, Jaime S. Cardoso 0001, Philipp Terhörst. 1021-1029 [doi]
- Gait Recognition from Fisheye ImagesChi Xu 0003, Yasushi Makihara, Xiang Li 0028, Yasushi Yagi. 1030-1040 [doi]
- Face Recognition Accuracy Across Demographics: Shining a Light Into the ProblemHaiyu Wu, Vítor Albiero, K. S. Krishnapriya, Michael C. King, Kevin W. Bowyer. 1041-1050 [doi]
- BeCAPTCHA-Type: Biometric Keystroke Data Generation for Improved Bot DetectionDaniel DeAlcala, Aythami Morales, Rubén Tolosana, Alejandro Acien, Julian Fiérrez, Santiago Hernandez, Miguel A. Ferrer, Moisés Díaz. 1051-1060 [doi]
- SynthASpoof: Developing Face Presentation Attack Detection Based on Privacy-friendly Synthetic DataMeiling Fang, Marco Huber, Naser Damer. 1061-1070 [doi]
- The Universal Face Encoder: Learning Disentangled Representations Across Different AttributesSandipan Banerjee, Ajjen Joshi, Jay Turcot. 1071-1080 [doi]
- A Closer Look at Geometric Temporal Dynamics for Face Anti-SpoofingChih-Jung Chang, Yaw-Chern Lee, Shih-Hsuan Yao, Min-Hung Chen, Chien-Yi Wang, Shang-Hong Lai, Trista Pei-chun Chen. 1081-1091 [doi]
- FlexiCurve: Flexible Piecewise Curves Estimation for Photo RetouchingChongyi Li, Chunle Guo, Shangchen Zhou, Qiming Ai, Ruicheng Feng, Chen Change Loy. 1092-1101 [doi]
- BeautyREC: Robust, Efficient, and Component-Specific Makeup TransferQixin Yan, Chunle Guo, Jixin Zhao, Yuekun Dai, Chen Change Loy, Chongyi Li. 1102-1110 [doi]
- SCONE-GAN: Semantic Contrastive learning-based Generative Adversarial Network for an end-to-end image translationIman Abbasnejad, Fabio Zambetta, Flora D. Salim, Timothy Wiley, Jeffrey Chan, Russell Gallagher, Ehsan Abbasnejad. 1111-1120 [doi]
- Adaptive Human-Centric Video Compression for Humans and MachinesWei Jiang, Hyomin Choi, Fabien Racapé. 1121-1129 [doi]
- ProgDTD: Progressive Learned Image Compression with Double-Tail-Drop TrainingAli Hojjat, Janek Haberer, Olaf Landsiedel. 1130-1139 [doi]
- RB-Dust - A Reference-based Dataset for Vision-based Dust RemovalPeter Buckel, Timo Oksanen, Thomas Dietmueller. 1140-1149 [doi]
- Quantum Annealing for Single Image Super-ResolutionHan Yao Choong, Suryansh Kumar 0001, Luc Van Gool. 1150-1159 [doi]
- Unlimited-Size Diffusion RestorationYinhuai Wang, Jiwen Yu, Runyi Yu, Jian Zhang 0018. 1160-1167 [doi]
- Benchmark Dataset and Effective Inter-Frame Alignment for Real-World Video Super-ResolutionRuohao Wang, Xiaohui Liu 0003, Zhilu Zhang, Xiaohe Wu, Chun-Mei Feng, Lei Zhang 0006, Wangmeng Zuo. 1168-1177 [doi]
- SS-TTA: Test-Time Adaption for Self-Supervised Denoising MethodsMasud An Nur Islam Fahim, Jani Boutellier. 1178-1187 [doi]
- High-Resolution Synthetic RGB-D Datasets for Monocular Depth EstimationAakash Rajpal, Noshaba Cheema, Klaus Illgner-Fehns, Philipp Slusallek, Sunil Prasad Jaiswal. 1188-1198 [doi]
- Expanding Synthetic Real-World Degradations for Blind Video Super ResolutionMehran Jeelani, Sadbhawna, Noshaba Cheema, Klaus Illgner-Fehns, Philipp Slusallek, Sunil Prasad Jaiswal. 1199-1208 [doi]
- Deep Dehazing Powered by Image Processing NetworkGuisik Kim, Jinhee Park, Junseok Kwon. 1209-1218 [doi]
- Denoising Diffusion Models for Plug-and-Play Image RestorationYuanzhi Zhu, Kai Zhang 0008, Jingyun Liang, Jiezhang Cao, Bihan Wen, Radu Timofte, Luc Van Gool. 1219-1229 [doi]
- Saliency-aware Stereoscopic Video RetargetingHassan Imani, Md Baharul Islam, Lai-Kuan Wong. 1230-1239 [doi]
- FRR-Net: A Real-Time Blind Face Restoration and Relighting NetworkSamira Pouyanfar, Sunando Sengupta, Mahmoud Mohammadi, Ebey Abraham, Brett Bloomquist, Lukas Dauterman, Anjali Parikh, Steve Lim, Eric Sommerlade. 1240-1250 [doi]
- Blind Image Inpainting via Omni-dimensional Gated Attention and Wavelet QueriesShruti S. Phutke, Ashutosh Kulkarni, Santosh Kumar Vipparthi, Subrahmanyam Murala. 1251-1260 [doi]
- Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline ResultsAndrei Dumitriu, Florin Tatui, Florin Miron, Radu-Tudor Ionescu, Radu Timofte. 1261-1271 [doi]
- High-Perceptual Quality JPEG Decoding via Posterior SamplingSean Man, Guy Ohayon, Theo Adrai, Michael Elad. 1272-1282 [doi]
- Large Kernel Distillation Network for Efficient Single Image Super-ResolutionChengxing Xie, Xiaoming Zhang, Linze Li, Haiteng Meng, Tianlin Zhang, Tianrui Li 0001, Xiaole Zhao. 1283-1292 [doi]
- OPDN: Omnidirectional Position-aware Deformable Network for Omnidirectional Image Super-ResolutionXiaopeng Sun, Weiqi Li, Zhenyu Zhang, Qiufang Ma, Xuhan Sheng, Ming Cheng, Haoyu Ma, Shijie Zhao, Jian Zhang, Junlin Li, Li Zhang 0006. 1293-1301 [doi]
- Zoom-VQA: Patches, Frames and Clips Integration for Video Quality AssessmentKai Zhao, Kun Yuan, Ming Sun, Xing Wen. 1302-1310 [doi]
- Pyramid Ensemble Structure for High Resolution Image Shadow RemovalShuhao Cui, Junshi Huang, Shuman Tian, Mingyuan Fan, Jiaqi Zhang, Li Zhu, Xiaoming Wei, Xiaolin Wei. 1311-1319 [doi]
- NTIRE 2023 Challenge on Light Field Image Super-Resolution: Dataset, Methods and ResultsYingqian Wang, Longguang Wang, Zhengyu Liang, Jungang Yang, Radu Timofte, Yulan Guo, Kai Jin, Zeqiang Wei, Angulia Yang, Sha Guo, Mingzhi Gao, Xiuzhuang Zhou, Vinh Van Duong, Thuc Nguyen Huu, Jonghoon Yim, Byeungwoo Jeon, Yutong Liu, Zhen Cheng, Zeyu Xiao, Ruikang Xu, Zhiwei Xiong, Gaosheng Liu, Manchang Jin, Huanjing Yue, Jingyu Yang, Chen Gao, Shuo Zhang, Song Chang, Youfang Lin, Wentao Chao, Xuechun Wang, Guanghui Wang, Fuqing Duan, Wang Xia, Yan Wang, Peiqi Xia, Shunzhou Wang, Yao Lu, Ruixuan Cong, Hao Sheng 0001, Da Yang, Rongshan Chen, Sizhe Wang, Zhenglong Cui, Yilei Chen, Yongjie Lu, Dongjun Cai, Ping An, Ahmed Salem 0005, Hatem Ibrahem, Bilel Yagoub, Hyun Soo Kang, Zekai Zeng, Heng Wu. 1320-1335 [doi]
- Learning Epipolar-Spatial Relationship for Light Field Image Super-ResolutionAhmed Salem 0005, Hatem Ibrahem, Hyun Soo Kang. 1336-1345 [doi]
- NTIRE 2023 Challenge on Stereo Image Super-Resolution: Methods and ResultsLongguang Wang, Yulan Guo, Yingqian Wang, Juncheng Li, Shuhang Gu, Radu Timofte, Ming Cheng, Haoyu Ma, Qiufang Ma, Xiaopeng Sun, Shijie Zhao, Xuhan Sheng, Yukang Ding, Ming Sun, Xing Wen, Dafeng Zhang, Jia Li, Fan Wang, Zheng Xie, Zongyao He, Zidian Qiu, Zilin Pan, Zhihao Zhan, Xingyuan Xian, Zhi Jin, Yuanbo Zhou, Wei Deng, Ruofeng Nie, Jiajun Zhang, Qinquan Gao, Tong Tong 0001, Kexin Zhang 0003, Junpei Zhang, Rui Peng, Yanbiao Ma, Licheng Jiao, Haoran Bai, Lingshun Kong, Jinshan Pan, Jiangxin Dong, Jinhui Tang 0001, Pu Cao, Tianrui Huang, Lu Yang, Qing Song, Bingxin Chen, Chunhua He, Meiyun Chen, Zijie Guo, Shaojuan Luo, Chengzhi Cao, Kunyu Wang, Fanrui Zhang, Qiang Zhang, Nancy Mehta, Subrahmanyam Murala, Akshay Dudhane, Yujin Wang, Lingen Li, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He, Junyang Chen, Hao Li, Yukai Shi, Zhijing Yang, Wenbin Zou, Yunchen Zhang, Mingchao Jiang, ZhongXin Yu, Ming Tan, Hongxia Gao, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Jingxiang Chen, Bo Yang, XiSheryl Zhang, Chenghua Li, Weijun Yuan, Zhan Li, Ruting Deng, Jintao Zeng, Pulkit Mahajan, Sahaj Mistry, Shreyas Chatterjee, Vinit Jakhetiya, Badri N. Subudhi, Sunil Prasad Jaiswal, Zhao Zhang 0001, Huan Zheng, Suiyi Zhao, Yangcheng Gao, Yanyan Wei, Bo Wang, Gen Li, Aijin Li, Lei Sun, Ke Chen, Congling Tang, Yunzhe Li, Jun Chen, Yuan-Chun Chiang, Yi-Chung Chen, Zhi-Kai Huang, Hao-Hsiang Yang, I-Hsiang Chen, Sy-Yen Kuo, Yiheng Wang, Gang Zhu, Xingyi Yang, Songhua Liu, Yongcheng Jing, Xingyu Hu, Jianwen Song, Changming Sun, Arcot Sowmya, Seung-Ho Park, Xiaoyan Lei, Jingchao Wang, Chenbo Zhai, Yufei Zhang, Weifeng Cao, Wenlong Zhang. 1346-1372 [doi]
- DistgEPIT: Enhanced Disparity Learning for Light Field Image Super-ResolutionKai Jin, Angulia Yang, Zeqiang Wei, Sha Guo, Mingzhi Gao, Xiuzhuang Zhou. 1373-1383 [doi]
- NTIRE 2023 Challenge on HR Depth from Images of Specular and Transparent SurfacesPierluigi Zama Ramirez, Fabio Tosi, Luigi di Stefano, Radu Timofte, Alex Costanzino, Matteo Poggi, Samuele Salti, Stefano Mattoccia, Jun Shi, Dafeng Zhang, Yong A, Yixiang Jin, Dingzhe Li, Chao Li, Zhiwen Liu, Qi Zhang, Yixing Wang, Shi Yin. 1384-1395 [doi]
- Cross-View Hierarchy Network for Stereo Image Super-ResolutionWenbin Zou, Hongxia Gao, Liang Chen, Yunchen Zhang, Mingchao Jiang, ZhongXin Yu, Ming Tan. 1396-1405 [doi]
- A Data-Centric Solution to NonHomogeneous Dehazing via Vision TransformerYangyi Liu, Huan Liu, Liangyan Li, Zijun Wu, Jun Chen. 1406-1415 [doi]
- Stereo Cross Global Learnable Attention Module for Stereo Image Super-ResolutionYuanbo Zhou, Yuyang Xue, Wei Deng, Ruofeng Nie, Jiajun Zhang, Jiaqi Pu, Qinquan Gao, Junlin Lan, Tong Tong 0001. 1416-1425 [doi]
- SC-NAFSSR: Perceptual-Oriented Stereo Image Super-Resolution Using Stereo Consistency Guided NAFSSRZidian Qiu, Zongyao He, Zhihao Zhan, Zilin Pan, Xingyuan Xian, Zhi Jin. 1426-1435 [doi]
- TSRFormer: Transformer Based Two-stage Refinement for Single Image Shadow RemovalHua-En Chang, Chia-Hsuan Hsieh, Hao-Hsiang Yang, I-Hsiang Chen, Yi-Chung Chen, Yu-Chiang Frank Wang, Zhi-Kai Huang, Wei-Ting Chen, Sy-Yen Kuo. 1436-1446 [doi]
- Semantic Guidance Learning for High-Resolution Non-homogeneous DehazingHao-Hsiang Yang, I-Hsiang Chen, Chia-Hsuan Hsieh, Hua-En Chang, Yuan-Chun Chiang, Yi-Chung Chen, Zhi-Kai Huang, Wei-Ting Chen, Sy-Yen Kuo. 1447-1455 [doi]
- Back to the future: a night photography rendering ISP without deep learningSimone Zini, Claudio Rota, Marco Buzzelli, Simone Bianco 0001, Raimondo Schettini. 1465-1473 [doi]
- VDPVE: VQA Dataset for Perceptual Video EnhancementYixuan Gao, Yuqin Cao, Tengchuan Kou, Wei Sun 0029, Yunlong Dong, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai. 1474-1483 [doi]
- A Simple Transformer-style Network for Lightweight Image Super-resolutionGaras Gendy, Nabil Sabor, Jingchao Hou, Guanghui He. 1484-1494 [doi]
- Efficient Deep Models for Real-Time 4K Image Super-Resolution. NTIRE 2023 Benchmark and ReportMarcos V. Conde, Eduard Zamfir, Radu Timofte, Daniel Motilla, Cen Liu, Zexin Zhang, Yunbo Peng, Yue Lin, Jiaming Guo, Xueyi Zou, Yuyi Chen, Yi Liu, Jia Hao, Youliang Yan, Yuanfan Zhang, Gen Li, Lei Sun, Lingshun Kong, Haoran Bai, Jinshan Pan, Jiangxin Dong, Jinhui Tang 0001, Mustafa Ayazoglu, Bahri Batuhan Bilecen, Mingxi Li, Yuhang Zhang, Xianjun Fan, Yankai Sheng, Long Sun, Zibin Liu, Weiran Gou, Shaoqing Li, Ziyao Yi, Yan Xiang, Dehui Kong, Ke Xu, Ganzorig Gankhuyag, Kihwan Yoon, Jin Zhang, Gaocheng Yu, Feng Zhang, Hongbin Wang, Zhou Zhou, Jiahao Chao, Hongfan Gao, Jiali Gong, Zhengfeng Yang, Zhenbing Zeng, Chengpeng Chen, Zichao Guo, Anjin Park, Yuqing Liu, Qi Jia, Hongyuan Yu, Xuanwu Yin, Dongyang Zhang, Ting Fu, Zhengxue Cheng, Shiai Zhu, Dajiang Zhou, Weichen Yu, Lin Ge, Jiahua Dong, Yajun Zou, Zhuoyuan Wu, Binnan Han, Xiaolin Zhang, Heng Zhang, Ben Shao, Shaolong Zheng, Daheng Yin, Baijun Chen, Mengyang Liu, Marian-Sergiu Nistor, Yi-Chung Chen, Zhi-Kai Huang, Yuan-Chun Chiang, Wei-Ting Chen, Hao-Hsiang Yang, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Tu Vo, Qingsen Yan, Yun Zhu, Jinqiu Su, Yanning Zhang, Cheng Zhang, Jiaying Luo, Youngsun Cho, Nakyung Lee, Kunlong Zuo. 1495-1521 [doi]
- Towards Real-Time 4K Image Super-ResolutionEduard Zamfir, Marcos V. Conde, Radu Timofte. 1522-1532 [doi]
- Quality assessment of enhanced videos guided by aesthetics and technical quality attributesMirko Agarla, Luigi Celona, Claudio Rota, Raimondo Schettini. 1533-1541 [doi]
- BokehOrNot: Transforming Bokeh Effect with Image Transformer and Lens Metadata EmbeddingZhihao Yang, Wenyi Lian, Siyuan Lai. 1542-1550 [doi]
- NTIRE 2023 Quality Assessment of Video Enhancement ChallengeXiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Kai Zhao, Heng Cong, Hang Shi, Zhiliang Ma, Mirko Agarla, Zhiwei Huang, Hongye Liu, Ironhead Chuang, Haotian Fan, Shiqi Zhou, Yu Lai, Wenqi Wang, Haoning Wu, Chunzheng Zhu, Shiling Zhao, Hanene Brachemi Meftah, Tengfei Shi, Azadeh Mansouri. 1551-1569 [doi]
- NTIRE 2023 Video Colorization ChallengeXiaoyang Kang, Xianhui Lin, Kai Zhang, Zheng Hui, Wangmeng Xiang, Jun-Yan He, Xiaoming Li, Peiran Ren, Xuansong Xie, Radu Timofte, Yixin Yang, Jinshan Pan, Zhong Zheng, Peng Qiyan, Jiangxin Zhang, Jinhui Dong, Jinjing Tan, Chi-Chen Lin, Lin Qipei Li, Qirong Liang, Ruipeng Gang, Xiaofeng Liu, Shuang Feng, Shuai Liu, Hao Wang, Chaoyu Feng, Furui Bai, Yuqian Zhang, Guangqi Shao, Xiaotao Wang, Lei Lei, Siqi Chen, Yu Zhang, Hanning Xu, Zheyuan Liu, Zhao Zhang 0001, Yan Luo, Zhichao Zuo. 1570-1581 [doi]
- AsConvSR: Fast and Lightweight Super-Resolution Network with Assembled ConvolutionsJiaming Guo, Xueyi Zou, Yuyi Chen, Yi Liu, Jia Hao, Jianzhuang Liu, Youliang Yan. 1582-1592 [doi]
- Mixer-based Local Residual Network for Lightweight Image Super-resolutionGaras Gendy, Nabil Sabor, Jingchao Hou, Guanghui He. 1593-1602 [doi]
- NAFBET: Bokeh Effect Transformation with Parameter Analysis Block based on NAFNetXiangyu Kong, Fan Wang, Dafeng Zhang, Jinlong Wu, Zikun Liu. 1603-1612 [doi]
- SB-VQA: A Stack-Based Video Quality Assessment Framework for Video EnhancementDing-Jiun Huang, Yu-Ting Kao, Tieh-Hung Chuang, Ya-Chun Tsai, Jing-Kai Lou, Shuen-Huei Guan. 1613-1622 [doi]
- Bicubic++: Slim, Slimmer, Slimmest Designing an Industry-Grade Super-Resolution NetworkBahri Batuhan Bilecen, Mustafa Ayazoglu. 1623-1332 [doi]
- Efficient Multi-Lens Bokeh Effect Rendering and TransformationTim Seizinger, Marcos V. Conde, Manuel Kolmet, Tom E. Bishop, Radu Timofte. 1633-1642 [doi]
- Lens-to-Lens Bokeh Effect Transformation. NTIRE 2023 Challenge ReportMarcos V. Conde, Manuel Kolmet, Tim Seizinger, Tom E. Bishop, Radu Timofte, Xiangyu Kong, Dafeng Zhang, Jinlong Wu, Fan Wang, Juewen Peng, Zhiyu Pan, Chengxin Liu, Xianrui Luo, Huiqiang Sun, Liao Shen, Zhiguo Cao 0001, Ke Xian, Chaowei Liu, Zigeng Chen, Xingyi Yang, Songhua Liu, Yongcheng Jing, Michael Bi Mi, Xinchao Wang, Zhihao Yang, Wenyi Lian, Siyuan Lai, Haichuan Zhang, Trung Hoang, Amirsaeed Yazdani, Vishal Monga, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Yuxuan Zhao, Baoliang Chen, Yiqing Xu, JiXiangNiu. 1643-1659 [doi]
- Multi-level Dispersion Residual Network for Efficient Image Super-ResolutionYanyu Mao, Nihao Zhang, Qian Wang 0019, Bendu Bai, Wanying Bai, Haonan Fang, Peng Liu, Mingyue Li, Shengbo Yan. 1660-1669 [doi]
- TransER: Hybrid Model and Ensemble-based Sequential Learning for Non-homogenous DehazingTrung Hoang, Haichuan Zhang, Amirsaeed Yazdani, Vishal Monga. 1670-1679 [doi]
- Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion ModelsZiwei Luo, Fredrik K. Gustafsson, Zheng Zhao 0004, Jens Sjölund, Thomas B. Schön. 1680-1691 [doi]
- DIPNet: Efficiency Distillation and Iterative Pruning for Image Super-ResolutionLei Yu, Xinpeng Li, Youwei Li, Ting Jiang, Qi Wu, Haoqiang Fan, Shuaicheng Liu. 1692-1701 [doi]
- Hybrid Transformer and CNN Attention Network for Stereo Image Super-resolutionMing Cheng, Haoyu Ma, Qiufang Ma, Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Xuhan Sheng, Shijie Zhao, Junlin Li, Li Zhang 0006. 1702-1711 [doi]
- Reparameterized Residual Feature Network For Lightweight Image Super-ResolutionWeijian Deng, Hongjie Yuan, Lunhui Deng, Zengtong Lu. 1712-1721 [doi]
- RTTLC: Video Colorization with Restored Transformer and Test-time Local ConverterJinjing Li, Qirong Liang, Qipei Li, Ruipeng Gang, Ji Fang, Chi-Chen Lin, Shuang Feng, Xiaofeng Liu. 1722-1730 [doi]
- NTIRE 2023 Challenge on 360° Omnidirectional Image and Video Super-Resolution: Datasets, Methods and ResultsMingdeng Cao, Chong Mou, Fanghua Yu, Xintao Wang, Yinqiang Zheng, Jian Zhang, Chao Dong, Gen Li, Ying Shan, Radu Timofte, Xiaopeng Sun, Weiqi Li, Zhenyu Zhang, Xuhan Sheng, Bin Chen, Haoyu Ma, Ming Cheng, Shijie Zhao, Wanwan Cui, Tianyu Xu, Chunyang Li, Long Bao, Heng Sun, Huaibo Huang, Xiaoqiang Zhou, Yuang Ai, Ran He, Renlong Wu, Yi Yang, Zhilu Zhang, Shuohao Zhang, Junyi Li, Yunjin Chen, Dongwei Ren, Wangmeng Zuo, Qian Wang, Hao-Hsiang Yang, Yi-Chung Chen, Zhi-Kai Huang, Wei-Ting Chen, Yuan-Chun Chiang, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Zebin Zhang, Jiaqi Zhang, Yuhui Wang, Shuhao Cui, Junshi Huang, Li Zhu, Shuman Tian, Wei Yu, Bingchun Luo. 1731-1745 [doi]
- Lightweight Real-Time Image Super-Resolution Network for 4K ImagesGanzorig Gankhuyag, Kihwan Yoon, Jinman Park, Haeng Seon Son, Kyoungwon Min. 1746-1755 [doi]
- Attention Retractable Frequency Fusion Transformer for Image Super ResolutionQiang Zhu, Pengfei Li, Qianhui Li. 1756-1763 [doi]
- SwinFSR: Stereo Image Super-Resolution using SwinIR and Frequency Domain KnowledgeKe Chen, Liangyan Li, Huan Liu, Yunzhe Li, Congling Tang, Jun Chen. 1764-1774 [doi]
- LSDIR: A Large Scale Dataset for Image RestorationYawei Li, Kai Zhang 0008, Jingyun Liang, Jiezhang Cao, Ce Liu, Rui Gong, Yulun Zhang, Hao Tang 0005, Yun Liu, Denis Demandolx, Rakesh Ranjan, Radu Timofte, Luc Van Gool. 1775-1787 [doi]
- NTIRE 2023 Image Shadow Removal Challenge ReportFlorin-Alexandru Vasluianu, Tim Seizinger, Radu Timofte, Shuhao Cui, Junshi Huang, Shuman Tian, Mingyuan Fan, Jiaqi Zhang, Li Zhu, Xiaoming Wei, Xiaolin Wei, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Xiaoyi Dong, Xi Sheryl Zhang, Chenghua Li, Cong Leng, Woon-Ha Yeo, Wang-Taek Oh, Yeoreum Lee, Han-Cheol Ryu, Jinting Luo, Chengzhi Jiang, Mingyan Han, Qi Wu, Wenjie Lin, Lei Yu, Xinpeng Li, Ting Jiang, Haoqiang Fan, Shuaicheng Liu, Shuning Xu, Binbin Song, Xiangyu Chen, Shile Zhang, Jiantao Zhou, Zhao Zhang 0001, Suiyi Zhao, Huan Zheng, Yangcheng Gao, Yanyan Wei, Bo Wang, Jiahuan Ren, Yan Luo, Yuki Kondo, Riku Miyata, Fuma Yasue, Taito Naruki, Norimichi Ukita, Hua-En Chang, Hao-Hsiang Yang, Yi-Chung Chen, Yuan-Chun Chiang, Zhi-Kai Huang, Wei-Ting Chen, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Li Xianwei, Huiyuan Fu, Chunlin Liu, Huadong Ma, Binglan Fu, Huiming He, Mengjia Wang, Wenxuan She, Yu Liu, Sabari Nathan, Priya Kansal, Zhongjian Zhang, Huabin Yang, Yan Wang, Yanru Zhang, Shruti S. Phutke, Ashutosh Kulkarni, MD Raqib Khan, Subrahmanyam Murala, Santosh Kumar Vipparthi, Heng Ye, Zixi Liu, Xingyi Yang, Songhua Liu, Yinwei Wu, Yongcheng Jing, Qianhao Yu, Naishan Zheng, Jie Huang 0017, Yuhang Long, Mingde Yao, Feng Zhao, Bowen Zhao, Nan Ye, Ning Shen, Yanpeng Cao, Tong Xiong, Weiran Xia, Dingwen Li, Shuchen Xia. 1788-1807 [doi]
- NTIRE 2023 HR NonHomogeneous Dehazing Challenge ReportCodruta O. Ancuti, Cosmin Ancuti, Florin-Alexandru Vasluianu, Radu Timofte, Han Zhou, Wei Dong, Yangyi Liu, Jun Chen, Huan Liu, Liangyan Li, Zijun Wu, Yubo Dong, Yuyan Li, Tian Qiu, Yu He, Yonghong Lu, Yinwei Wu, Zhenxiang Jiang, Songhua Liu, Xingyi Yang, Yongcheng Jing, Bilel Benjdira, Anas M. Ali, Anis Koubaa, Hao-Hsiang Yang, I-Hsiang Chen, Wei-Ting Chen, Zhi-Kai Huang, Yi-Chung Chen, Chia-Hsuan Hsieh, Hua-En Chang, Yuan-Chun Chiang, Sy-Yen Kuo, Yu Guo, Yuan Gao, Ryan Wen Liu, Yuxu Lu, Jingxiang Qu, Shengfeng He, Wenqi Ren, Trung Hoang, Haichuan Zhang, Amirsaeed Yazdani, Vishal Monga, Lehan Yang, Alex Jiahao Wu, Tiancheng Mai, Xiaofeng Cong, Xuemeng Yin, Xuefei Yin, Hazim Emad, Ahmed Abdallah, Yahya Yasser, Dalia Elshahat, Esraa Elbaz, Zhan Li, Wenqing Kuang, Ziwei Luo, Fredrik K. Gustafsson, Zheng Zhao, Jens Sjölund, Thomas B. Schön, Zhao Zhang, Yanyan Wei, Junhu Wang, Suiyi Zhao, Huan Zheng, Jin Guo, Yangfan Sun, Tianli Liu, Dejun Hao, Kui Jiang, Anjali Sarvaiya, Kalpesh Prajapati, Ratnadeep Patra, Pragnesh Barik, Chaitanya Rathod, Kishor P. Upla, Kiran B. Raja, Raghavendra Ramachandra, Christoph Busch 0001. 1808-1825 [doi]
- WSRD: A Novel Benchmark for High Resolution Image Shadow RemovalFlorin-Alexandru Vasluianu, Tim Seizinger, Radu Timofte. 1826-1835 [doi]
- Temporal Consistent Automatic Video Colorization via Semantic CorrespondenceYu Zhang, Siqi Chen, Mingdao Wang, Xianlin Zhang, Chuang Zhu, Yue Zhang, Xueming Li. 1836-1845 [doi]
- Video Quality Assessment Based on Swin Transformer with Spatio-Temporal Feature Fusion and Data AugmentationWei Wu, Shuming Hu, PengXiang Xiao, Sibin Deng, Yilin Li, Ying Chen, Kai Li. 1846-1854 [doi]
- Streamlined Global and Local Features Combinator (SGLC) for High Resolution Image DehazingBilel Benjdira, Anas M. Ali, Anis Koubaa. 1855-1864 [doi]
- NTIRE 2023 Challenge on Image Super-Resolution (×4): Methods and ResultsYulun Zhang, Kai Zhang, Zheng Chen, Yawei Li, Radu Timofte, Junpei Zhang, Kexin Zhang 0003, Rui Peng, Yanbiao Ma, Licheng Jia, Huaibo Huang, Xiaoqiang Zhou, Yuang Ai, Ran He, Yajun Qiu, Qiang Zhu, Pengfei Li, Qianhui Li, Shuyuan Zhu, Dafeng Zhang, Jia Li, Fan Wang, Chunmiao Li, Taehyung Kim, Jungkeong Kil, Eon Kim, Yeonseung Yu, Beomyeol Lee, Subin Lee, Seokjae Lim, Somi Chae, Heungjun Choi, Zhi-Kai Huang, YiChung Chen, Yuan-Chun Chiang, Hao-Hsiang Yang, Wei-Ting Chen, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Ui-Jin Choi, Marcos V. Conde, Sunder Ali Khowaja, Jiseok Yoon, Ik Hyun Lee, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He, Zhao Zhang 0001, Baiang Li, Huan Zheng, Suiyi Zhao, Yangcheng Gao, Yanyan Wei, Jiahuan Ren, Jiayu Wei, Yanfeng Li, Jia Sun, Zhanyi Cheng, Zhiyuan Li, Xu Yao, Xinyi Wang, Danxu Li, Xuan Cui, Jun Cao, Cheng Li, Jianbin Zheng, Anjali Sarvaiya, Kalpesh Prajapati, Ratnadeep Patra, Pragnesh Barik, Chaitanya Rathod, Kishor P. Upla, Kiran B. Raja, Raghavendra Ramachandra, Christoph Busch 0001. 1865-1884 [doi]
- SCANet: Self-Paced Semi-Curricular Attention Network for Non-Homogeneous Image DehazingYu Guo, Yuan Gao, Ryan Wen Liu, Yuxu Lu, Jingxiang Qu, Shengfeng He, Wenqi Ren. 1885-1894 [doi]
- Breaking Through the Haze: An Advanced Non-Homogeneous Dehazing Method based on Fast Fourier Convolution and ConvNeXtHan Zhou, Wei Dong, Yangyi Liu, Jun Chen. 1895-1904 [doi]
- NTIRE 2023 Challenge on Image Denoising: Methods and ResultsYawei Li, Yulun Zhang, Radu Timofte, Luc Van Gool, Zhijun Tu, Kunpeng Du, Hailing Wang, Hanting Chen, Wei Li, Xiaofei Wang, Jie Hu, Yunhe Wang, Xiangyu Kong, Jinlong Wu, Dafeng Zhang, Jianxing Zhang, Shuai Liu, Furui Bai, Chaoyu Feng, Hao Wang, Yuqian Zhang, Guangqi Shao, Xiaotao Wang, Lei Lei, Rongjian Xu, Zhilu Zhang, Yunjin Chen, Dongwei Ren, Wangmeng Zuo, Qi Wu, Mingyan Han, Shen Cheng, HaiPeng Li, Ting Jiang, Chengzhi Jiang, Xinpeng Li, Jinting Luo, Wenjie Lin, Lei Yu, Haoqiang Fan, Shuaicheng Liu, Aditya Arora, Syed Waqas Zamir, Javier Vazquez-Corral, Konstantinos G. Derpanis, Michael S. Brown, Hao Li, Zhihao Zhao, Jinshan Pan, Jiangxin Dong, Jinhui Tang 0001, Bo Yang, Jingxiang Chen, Chenghua Li, Xi Zhang, Zhao Zhang 0001, Jiahuan Ren, Zhicheng Ji, Kang Miao, Suiyi Zhao, Huan Zheng, Yanyan Wei, Kangliang Liu, Xiangcheng Du, Sijie Liu, Yingbin Zheng, Xingjiao Wu, Cheng Jin, Rajeev Irny, Sriharsha Koundinya, Vighnesh Kamath, Gaurav Khandelwal, Sunder Ali Khowaja, Jiseok Yoon, Ik Hyun Lee, Shijie Chen, Chengqiang Zhao, Huabin Yang, Zhongjian Zhang, Junjia Huang, Yanru Zhang. 1905-1921 [doi]
- NTIRE 2023 Challenge on Efficient Super-Resolution: Methods and ResultsYawei Li, Yulun Zhang, Radu Timofte, Luc Van Gool, Lei Yu, Youwei Li, Xinpeng Li, Ting Jiang, Qi Wu, Mingyan Han, Wenjie Lin, Chengzhi Jiang, Jinting Luo, Haoqiang Fan, Shuaicheng Liu, Yucong Wang, Minjie Cai, Mingxi Li, Yuhang Zhang, Xianjun Fan, Yankai Sheng, Yanyu Mao, Nihao Zhang, Qian Wang, Mingjun Zheng, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang 0001, Zhongbao Yang, Yan Wang, Erlin Pan, Qixuan Cai, Xinan Dai, Magauiya Zhussip, Nikolay Kalyazin, Dmitry Vyal, Xueyi Zou, Youliang Yan, Heaseo Chung, Jin Zhang, Gaocheng Yu, Feng Zhang, Hongbin Wang, Bohao Liao, Zhibo Du, Yu-Liang Wu, Gege Shi, Long Peng, Yang Wang, Yang Cao, Zhengjun Zha, Zhi-Kai Huang, Yi-Chung Chen, Yuan-Chun Chiang, Hao-Hsiang Yang, Wei-Ting Chen, Hua-En Chang, I-Hsiang Chen, Chia-Hsuan Hsieh, Sy-Yen Kuo, Xin Liu, Jiahao Pan, Hongyuan Yu, Weichen Yu, Lin Ge, Jiahua Dong, Yajun Zou, Zhuoyuan Wu, Binnan Han, Xiaolin Zhang, Heng Zhang, Xuanwu Yin, Kunlong Zuo, Weijian Deng, Hongjie Yuan, Zengtong Lu, Mingyu Ouyang, Wenzhuo Ma, Nian Liu, Hanyou Zheng, Yuantong Zhang, Junxi Zhang, Zhenzhong Chen, Garas Gendy, Nabil Sabor, Jingchao Hou, Guanghui He, Yurui Zhu, Xi Wang, Xueyang Fu, Zheng-Jun Zha, Daheng Yin, Mengyang Liu, Baijun Chen, Ao Li, Lei Luo, Kangjun Jin, Ce Zhu, Xiaoming Zhang, Chengxing Xie, Linze Li, Haiteng Meng, Tianlin Zhang, Tianrui Li 0001, Xiaole Zhao, Zhao Zhang, Baiang Li, Huan Zheng, Suiyi Zhao, Yangcheng Gao, Jiahuan Ren, Kang Hu, Jingpeng Shi, Zhijian Wu, Dingjiang Huang, Jinchen Zhu, Hui Li, Qianru Xv, Tianle Liu, Gang Wu, Junpeng Jiang, Xianming Liu, Junjun Jiang, Mingjian Zhang, Shizhuang Weng, Jing Hu, Chengxu Wu, Qinrui Fan, Chengming Feng, Ziwei Luo, Shu Hu, Siwei Lyu, Xi Wu, Xin Wang. 1922-1960 [doi]
- Spatial-Angular Multi-Scale Mechanism for Light Field Spatial Super-ResolutionChen Gao, Youfang Lin, Song Chang, Shuo Zhang 0003. 1961-1970 [doi]
- A Single Residual Network with ESA Modules and DistillationYucong Wang, Minjie Cai. 1971-1981 [doi]
- NTIRE 2023 Challenge on Night Photography RenderingAlina Shutova, Egor I. Ershov, Georgy Perevozchikov, Ivan Ermakov, Nikola Banic, Radu Timofte, Richard Collins, Maria Efimova, Arseniy P. Terekhin, Simone Zini, Claudio Rota, Marco Buzzelli, Simone Bianco 0001, Raimondo Schettini, Chunxia Lei, Tingniao Wang, Song Wang, Shuai Liu 0009, Chaoyu Feng, Guangqi Shao, Hao Wang, Xiaotao Wang, Lei Lei, Lu Xu, Chao Zhang, Yasi Wang, Jin Guo, Yangfan Sun, Tianli Liu, Hao Dejun, Furkan Kinli, Baris Özcan, Furkan Kiraç, Hyerin Chung, Nakyung Lee, Sungkeun Kwak, Marcos V. Conde, Tim Seizinger, Florin-Alexandru Vasluianu, Omar Elezabi, Chia-Hsuan Hsieh, Wei-Ting Chen, Hao-Hsiang Yang, Zhi-Kai Huang, Hua-En Chang, I-Hsiang Chen, Yi-Chung Chen, Yuan-Chun Chiang. 1982-1993 [doi]
- CrisisHateMM: Multimodal Analysis of Directed and Undirected Hate Speech in Text-Embedded Images from Russia-Ukraine ConflictAashish Bhandari, Siddhant Bikram Shah, Surendrabikram Thapa, Usman Naseem, Mehwish Nasim. 1994-2003 [doi]
- Prioritised Moderation for Online AdvertisingPhanideep Gampa, Akash Anil Valsangkar, Shailesh Choubey, Pooja A. 2004-2012 [doi]
- L1BSR: Exploiting Detector Overlap for Self-Supervised Single-Image Super-Resolution of Sentinel-2 L1B ImageryNgoc-Long Nguyen, Jérémy Anger, Axel Davy, Pablo Arias 0001, Gabriele Facciolo. 2013-2023 [doi]
- APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIPMainak Singha, Ankit Jha, Bhupendra Solanki, Shirsha Bose, Biplab Banerjee. 2024-2034 [doi]
- Multi-Date Earth Observation NeRF: The Detail Is in the ShadowsRoger Marí, Gabriele Facciolo, Thibaud Ehret. 2035-2045 [doi]
- Cascaded Zoom-in Detector for High Resolution Aerial ImagesAkhil Meethal, Eric Granger, Marco Pedersoli. 2046-2055 [doi]
- Handheld Burst Super-Resolution Meets Multi-Exposure Satellite ImageryJamy Lafenetre, Ngoc-Long Nguyen, Gabriele Facciolo, Thomas Eboli. 2056-2064 [doi]
- Solar Irradiance Anticipative TransformerThomas M. Mercier, Tasmiat Rahman, Amin Sabet. 2065-2074 [doi]
- GeoMultiTaskNet: remote sensing unsupervised domain adaptation using geographical coordinatesValerio Marsocci, Nicolas Gonthier, Anatol Garioud, Simone Scardapane, Clément Mallet. 2075-2085 [doi]
- UnCRtainTS: Uncertainty Quantification for Cloud Removal in Optical Satellite Time SeriesPatrick Ebel 0002, Vivien Sainte Fare Garnot, Michael Schmitt 0003, Jan Dirk Wegner, Xiao Xiang Zhu. 2086-2096 [doi]
- DeepSim-Nets: Deep Similarity Networks for Stereo Image MatchingMohamed Ali Chebbi, Ewelina Rupnik, Marc Pierrot Deseilligny, Paul Lopes. 2097-2105 [doi]
- Deep unfolding for hyper sharpening using a high-frequency injection moduleJamila Mifdal, Marc Tomás-Cruz, Alessandro Sebastianelli, Bartomeu Coll, Joan Duran. 2106-2115 [doi]
- Seasonal Domain Shift in the Global South: Dataset and Deep Features AnalysisGeorgios Voulgaris, Andy Philippides, Jonathan Dolley, Jeremy Reffin, Fiona Marshall, Novi Quadrianto. 2116-2124 [doi]
- Comprehensive quality assessment of optical satellite imagery using weakly supervised video learningValerie Pasquarella, Christopher F. Brown, Wanda Czerwinski, William Rucklidge. 2125-2135 [doi]
- Multi-Modal Multi-Objective Contrastive Learning for Sentinel-1/2 ImageryJonathan Prexl, Michael Schmitt 0003. 2136-2144 [doi]
- Sparse Multimodal Vision Transformer for Weakly Supervised Semantic SegmentationJoëlle Hanna, Michael Mommert, Damian Borth. 2145-2154 [doi]
- Inferring the past: a combined CNN-LSTM deep learning framework to fuse satellites for historical inundation mappingJonathan Giezendanner, Rohit Mukherjee, Matthew Purri, Mitchell Thomas, Max Mauerman, A. K. M. Saiful Islam, Beth Tellman. 2155-2165 [doi]
- Masked Vision Transformers for Hyperspectral Image ClassificationLinus Scheibenreif, Michael Mommert, Damian Borth. 2166-2176 [doi]
- VideoMatt: A Simple Baseline for Accessible Real-Time Video MattingJiachen Li 0003, Marianna Ohanyan, Vidit Goel, Shant Navasardyan, Yunchao Wei, Humphrey Shi. 2177-2186 [doi]
- QuickSRNet: Plain Single-Image Super-Resolution Architecture for Faster Inference on Mobile PlatformsGuillaume Berger, Manik Dhingra, Antoine Mercier 0005, Yashesh Savani, Sunny Panchal, Fatih Porikli. 2187-2196 [doi]
- Real-time Segmenting Human Portrait at AnywhereRuifeng Yuan, Yuhao Cheng, Yiqiang Yan, Haiyan Liu. 2197-2203 [doi]
- High-efficiency Device-Cloud Collaborative Transformer ModelPenghao Jiang, Ke Xin, Chunxi Li, Yinsi Zhou. 2204-2210 [doi]
- MobileViG: Graph-Based Sparse Attention for Mobile Vision ApplicationsMustafa Munir, William Avery, Radu Marculescu. 2211-2219 [doi]
- DIFT: Dynamic Iterative Field Transforms for Memory Efficient Optical FlowRisheek Garrepalli, Jisoo Jeong, Rajeswaran C. Ravindran, Jamie Menjay Lin, Fatih Porikli. 2220-2229 [doi]
- PerfHD: Efficient ViT Architecture Performance Ranking using Hyperdimensional ComputingDongning Ma, Pengfei Zhao, Xun Jiao. 2230-2237 [doi]
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary DetectionWentao Zhu 0001, Yufang Huang, Xiufeng Xie, Wenxian Liu, Jincan Deng, Debing Zhang, Zhangyang Wang, Ji Liu 0002. 2238-2247 [doi]
- Pareto-aware Neural Architecture Generation for Diverse Computational BudgetsYong Guo, Yaofo Chen, Yin Zheng, Qi Chen, Peilin Zhao, JunZhou Huang, Jian Chen, Mingkui Tan. 2248-2258 [doi]
- Exploring the Potential of Neural Dataset SearchRyosuke Yamada, Risa Shinoda, Hirokatsu Kataoka. 2259-2266 [doi]
- 2-Aug: Adaptive Automated Data AugmentationLujun Li, Anggeng Li. 2267-2274 [doi]
- Hardware-aware NAS by Genetic Optimisation with a Design Space Exploration SimulatorLotte Hendrickx, Arne Symons, Wiebe Van Ranst, Marian Verhelst, Toon Goedemé. 2275-2283 [doi]
- Systematic Architectural Design of Scale Transformed Attention Condenser DNNs via Multi-Scale Class Representational Response Similarity AnalysisAndrew Hryniowski, Alexander Wong. 2284-2292 [doi]
- Fast GraspNeXt: A Fast Self-Attention Neural Network Architecture for Multi-task Learning in Computer Vision Tasks for Robotic Grasping on the EdgeAlexander Wong, Yifan Wu, Saad Abbasi, Saeejith Nair, Yuhao Chen, Mohammad Javad Shafiee. 2293-2297 [doi]
- Certified Adversarial Robustness Within Multiple Perturbation BoundsSoumalya Nandi, Sravanti Addepalli, Harsh Rangwani, R. Venkatesh Babu. 2298-2305 [doi]
- Adversarial Defense in Aerial DetectionYuwei Chen, Shiyong Chu. 2306-2313 [doi]
- Investigating Catastrophic Overfitting in Fast Adversarial Training: A Self-fitting PerspectiveZhengbao He, Tao Li, Sizhe Chen, Xiaolin Huang. 2314-2321 [doi]
- Universal Watermark Vaccine: Universal Adversarial Perturbations for Watermark ProtectionJianbo Chen, Xinwei Liu, Siyuan Liang, Xiaojun Jia, Yuan Xun. 2322-2329 [doi]
- Robustness with Query-efficient Adversarial Attack using Reinforcement LearningSoumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Sahand Ghorbanpour, Vineet Gundecha, Antonio Guillen, Ricardo Luna Gutierrez, Avisek Naug. 2330-2337 [doi]
- Don't FREAK Out: A Frequency-Inspired Approach to Detecting Backdoor Poisoned Samples in DNNsHasan Abed Al Kader Hammoud, Adel Bibi, Philip H. S. Torr, Bernard Ghanem. 2338-2345 [doi]
- Exploring Diversified Adversarial Robustness in Neural Networks via Robust Mode ConnectivityRen Wang 0008, Yuxuan Li, Sijia Liu 0001. 2346-2352 [doi]
- How many dimensions are required to find an adversarial example?Charles Godfrey, Henry Kvinge, Elise Bishoff, Myles Mckay, Davis Brown, Tim Doster, Eleanor Byler. 2353-2360 [doi]
- An Extended Study of Human-like Behavior under Adversarial TrainingPaul Gavrikov, Janis Keuper, Margret Keuper. 2361-2368 [doi]
- Deep Convolutional Sparse Coding Networks for Interpretable Image FusionZixiang Zhao, Jiang-She Zhang 0001, Haowen Bai, Yicheng Wang, Yukun Cui, Lilun Deng, Kai Sun 0007, Chunxia Zhang 0002, Junmin Liu, Shuang Xu. 2369-2377 [doi]
- Generating Adversarial Samples in Mini-Batches May Be Detrimental To Adversarial RobustnessTimothy Redgrave, Colton Crum. 2378-2384 [doi]
- A Pilot Study of Query-Free Adversarial Attack against Stable DiffusionHaomin Zhuang, Yihua Zhang, Sijia Liu 0001. 2385-2392 [doi]
- Implications of Solution Patterns on Adversarial RobustnessHengyue Liang, Buyun Liang, Ju Sun, Ying Cui, Tim Mitchell. 2393-2400 [doi]
- Are Labels Needed for Incremental Instance Learning?Mert Kilickaya, Joaquin Vanschoren. 2401-2409 [doi]
- A Closer Look at Rehearsal-Free Continual LearningJames Seale Smith, Junjiao Tian, Shaunak Halbe, Yen-Chang Hsu, Zsolt Kira. 2410-2420 [doi]
- 3Former: Debiased Dual Distilled Transformer for Incremental LearningAbdelrahman Mohamed, Rushali Grandhe, K. J. Joseph, Salman H. Khan 0001, Fahad Shahbaz Khan. 2421-2430 [doi]
- How Efficient Are Today's Continual Learning Algorithms?Md Yousuf Harun, Jhair Gallardo, Tyler L. Hayes, Christopher Kanan. 2431-2436 [doi]
- Online Distillation with Continual Learning for Cyclic Domain ShiftsJoachim Houyon, Anthony Cioppa, Yasir Ghunaim, Motasem Alfarra, Anaïs Halin, Maxim Henry, Bernard Ghanem, Marc Van Droogenbroeck. 2437-2446 [doi]
- Continual Learning for LiDAR Semantic Segmentation: Class-Incremental and Coarse-to-Fine strategies on Sparse DataElena Camuffo, Simone Milani. 2447-2456 [doi]
- Continual Domain Adaptation through Pruning-aided Domain-specific Weight ModulationPrasanna B, Sunandini Sanyal, R. Venkatesh Babu. 2457-2463 [doi]
- CoVIO: Online Continual Learning for Visual-Inertial OdometryNiclas Vödisch, Daniele Cattaneo 0001, Wolfram Burgard, Abhinav Valada. 2464-2473 [doi]
- Just a Glimpse: Rethinking Temporal Information for Video Continual LearningLama Alssum, Juan León Alcázar, Merey Ramazanova, Chen Zhao 0002, Bernard Ghanem. 2474-2483 [doi]
- SCALE: Online Self-Supervised Lifelong Learning without Prior KnowledgeXiaofan Yu, Yunhui Guo, Sicun Gao, Tajana Rosing. 2484-2495 [doi]
- CLVOS23: A Long Video Object Segmentation Dataset for Continual LearningAmir Nazemi, Zeyad Moustafa, Paul W. Fieguth. 2496-2505 [doi]
- Density Map Distillation for Incremental Object CountingChenshen Wu, Joost van de Weijer 0001. 2506-2515 [doi]
- Simulating Task-Free Continual Learning Streams From Existing DatasetsAristotelis Chrysakis, Marie-Francine Moens. 2516-2524 [doi]
- Lifelong Learning of Task-Parameter Relationships for Knowledge TransferShikhar Srivastava 0001, Mohammad Yaqub, Karthik Nandakumar. 2525-2534 [doi]
- TFRGAN: Leveraging Text Information for Blind Face Restoration with Extreme DegradationChengxing Xie, Qian Ning, Weisheng Dong, Guangming Shi. 2535-2545 [doi]
- The MONET dataset: Multimodal drone thermal dataset recorded in rural scenariosLuigi Riz, Andrea Caraffa, Matteo Bortolon, Mohamed Lamine Mekhalfi, Davide Boscaini, André Moura, José Antunes, André Dias, Hugo Silva 0003, Andreas Leonidou, Christos Constantinides, Christos Keleshis, Dante Abate, Fabio Poiesi. 2546-2554 [doi]
- SSGVS: Semantic Scene Graph-to-Video SynthesisYuren Cong, Jinhui Yi, Bodo Rosenhahn, Michael Ying Yang. 2555-2565 [doi]
- Multi Event Localization by Audio-Visual Fusion with Omnidirectional Camera and Microphone ArrayWenru Zheng, Ryota Yoshihashi, Rei Kawakami, Ikuro Sato, Asako Kanezaki. 2566-2574 [doi]
- Dynamic Multimodal FusionZihui Xue, Radu Marculescu. 2575-2584 [doi]
- Exposing and Mitigating Spurious Correlations for Cross-Modal RetrievalJae-Myung Kim, A. Sophia Koepke, Cordelia Schmid, Zeynep Akata. 2585-2595 [doi]
- Adapting Grounded Visual Question Answering Models to Low Resource LanguagesYing Wang, Jonas Pfeiffer, Nicolas Carion, Yann LeCun, Aishwarya Kamath. 2596-2605 [doi]
- SEM-POS: Grammatically and Semantically Correct Video CaptioningAsmar Nadeem, Adrian Hilton 0001, Robert Dawes, Graham A. Thomas, Annin Mustafa. 2606-2616 [doi]
- Robust Multiview Multimodal Driver Monitoring System Using Masked Multi-Head Self-AttentionYiming Ma, Victor Sanchez, Soodeh Nikan, Devesh Upadhyay, Bhushan Atote, Tanaya Guha. 2617-2625 [doi]
- Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute RecognitionJun Zhu, Jiandong Jin, Zihan Yang, Xiaohao Wu, Xiao Wang. 2626-2629 [doi]
- Causalainer: Causal Explainer for Automatic Video SummarizationJia-Hong Huang, Chao-Han Huck Yang, Pin-Yu Chen, Min-Hung Chen, Marcel Worring. 2630-2636 [doi]
- Is Multimodal Vision Supervision Beneficial to Language?Avinash Madasu, Vasudev Lal. 2637-2642 [doi]
- Abstract Visual Reasoning Enabled by LanguageGiacomo Camposampiero, Loïc Houmard, Benjamin Estermann, Joël Mathys, Roger Wattenhofer. 2643-2647 [doi]
- Multimodal Integration of Human-Like Attention in Visual Question AnsweringEkta Sood, Fabian Kögel, Philipp Müller 0001, Dominike Thomas, Mihai Bâce, Andreas Bulling. 2648-2658 [doi]
- Kappa Angle Regression with Ocular Counter-Rolling Awareness for Gaze EstimationShiwei Jin, Ji Dai, Truong Nguyen 0001. 2659-2668 [doi]
- GazeCaps: Gaze Estimation with Self-Attention-Routed CapsulesHengfei Wang, Jun O. Oh, Hyung Jin Chang, Jin Hee Na, Minwoo Tae, Zhongqun Zhang, Sang-Il Choi. 2669-2677 [doi]
- Where are they looking in the 3D space?Nora Horanyi, Linfang Zheng, Eunji Chong, Ales Leonardis, Hyung Jin Chang. 2678-2687 [doi]
- EFE: End-to-end Frame-to-Gaze EstimationHaldun Balim, Seonwook Park, Xi Wang, Xucong Zhang, Otmar Hilliges. 2688-2697 [doi]
- Octree Transformer: Autoregressive 3D Shape Generation on Hierarchically Structured SequencesMoritz Ibing, Gregor Kobsik, Leif Kobbelt. 2698-2707 [doi]
- 3DSSR: 3D Subscene RetrievalReza Asad, Manolis Savva. 2708-2716 [doi]
- Attention-based Part Assembly for 3D Volumetric Shape ModelingChengzhi Wu, Junwei Zheng, Julius Pfrommer, Jürgen Beyerer. 2717-2726 [doi]
- SepicNet: Sharp Edges Recovery by Parametric Inference of Curves in 3D ShapesKseniya Cherenkova, Elona Dupont, Anis Kacem 0001, Ilya Arzhannikov, Gleb Gusev, Djamila Aouada. 2727-2735 [doi]
- IPD-Net: SO(3) Invariant Primitive Decompositional Network for 3D Point CloudsRamesh Ashok Tabib, Nitishkumar Upasi, Tejas Anvekar, Dikshit Hegde, Uma Mudenagudi. 2736-2744 [doi]
- OO-dMVMT: A Deep Multi-view Multi-task Classification Framework for Real-time 3D Hand Gesture Classification and SegmentationFederico Cunico, Federico Girella, Andrea Avogaro, Marco Emporio, Andrea Giachetti 0001, Marco Cristani. 2745-2754 [doi]
- Three Recipes for Better 3D Pseudo-GTs of 3D Human Mesh Estimation in the WildGyeongsik Moon, Hongsuk Choi, Sanghyuk Chun, Jiyoung Lee, Sangdoo Yun. 2755-2764 [doi]
- 3DSAINT Representation for 3D Point CloudsChandra Kambhamettu. 2765-2774 [doi]
- Face Image Lighting Enhancement Using a 3D ModelQiulin Chen, Jan P. Allebach. 2775-2784 [doi]
- BOP Challenge 2022 on Detection, Segmentation and Pose Estimation of Specific Rigid ObjectsMartin Sundermeyer, Tomás Hodan, Yann Labbé, Gu Wang 0001, Eric Brachmann, Bertram Drost, Carsten Rother, Jirí Matas. 2785-2794 [doi]
- Dual Attention Poser: Dual Path Body Tracking Based on AttentionXinhan Di, Xiaokun Dai, Xinkang Zhang, Xinrong Chen. 2795-2804 [doi]
- Efficient Multi-exposure Image Fusion via Filter-dominated Fusion and Gradient-driven Unsupervised LearningKaiwen Zheng, Jie Huang 0017, Hu Yu, Feng Zhao 0004. 2805-2814 [doi]
- Asymmetric Color Transfer with Consistent Modality LearningKaiwen Zheng, Jie Huang 0017, Man Zhou, Feng Zhao 0004. 2815-2823 [doi]
- FF-Former: Swin Fourier Transformer for Nighttime Flare RemovalDafeng Zhang, Jia OuYang, Guanqun Liu, Xiaobing Wang, Xiangyu Kong, Zhezhu Jin. 2824-2832 [doi]
- OTST: A Two-Phase Framework for Joint Denoising and Remosaicing in RGBW CFAZhihao Fan, Xun Wu, Fanqing Meng, Yaqi Wu, Feng Zhang. 2833-2842 [doi]
- Hard-negative Sampling with Cascaded Fine-Tuning Network to Boost Flare Removal Performance in the Nighttime ImagesSoonyong Song, Heechul Bae. 2843-2852 [doi]
- MIPI 2023 Challenge on Nighttime Flare Removal: Methods and ResultsYuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qingpeng Zhu, Qianhui Sun, Wenxiu Sun, Chen Change Loy, Jinwei Gu, Shuai Liu, Hao Wang, Chaoyu Feng, Luyang Wang, Guangqi Shao, Chenguang Zhang, Xiaotao Wang, Lei Lei, Dafeng Zhang, Xiangyu Kong, Guanqun Liu, Mengmeng Bai, Jia OuYang, Xiaobing Wang, Jiahui Yuan, Xinpeng Li, Chengzhi Jiang, Ting Jiang, Wenjie Lin, Qi Wu, Mingyan Han, Jinting Luo, Lei Yu, Haoqiang Fan, Shuaicheng Liu, Bo Yan, Zhuang Li, Yadong Li, Hongbin Wang, Soonyong Song, Minghan Fu, Rayyan Azam Khan, Fang-Xiang Wu, Zhao Zhang 0001, Suiyi Zhao, Huan Zheng, Yangcheng Gao, Yanyan Wei, Jiahuan Ren, Bo Wang, Yan Luo, Shuaibo Gao, Wenhui Wu, Sicong Kang, Nikhil Akalwadi, Ankit Raichur, Vinod Patil, Allabakash G, Swaroop A, Amogh Joshi, Chaitra Desai, Ramesh Ashok Tabib, Ujwala Patil, Uma Mudenagudi, Sicheng Li, Ruoxi Zhu, Jiazheng Lian, Shusong Xu, Zihao Liu, Sabari Nathan, Priya Kansal. 2853-2863 [doi]
- MIPI 2023 Challenge on RGB+ToF Depth Completion: Methods and ResultsQingpeng Zhu, Wenxiu Sun, Yuekun Dai, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Qianhui Sun, Chen Change Loy, Jinwei Gu, Yi Yu, Yangke Huang, Kang Zhang, Meiya Chen, Yu Wang, Yongchao Li, Hao Jiang, Amrit Kumar Muduli, Vikash Kumar, Kunal Swami, Pankaj Kumar Bajpai, Yunchao Ma, Jiajun Xiao, Zhi Ling. 2864-2870 [doi]
- MIPI 2023 Challenge on RGBW Fusion: Methods and ResultsQianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Hongyuan Yu, Yuqing Liu, Weichen Yu, Lin Ge, Xiaolin Zhang, Qi Jia, Heng Zhang, Xuanwu Yin, Kunlong Zuo, Qi Wu, Wenjie Lin, Ting Jiang, Chengzhi Jiang, Mingyan Han, Xinpeng Li, Jinting Luo, Lei Yu, Haoqiang Fan, Shuaicheng Liu, Kunyu Wang, Chengzhi Cao, Yuanshen Guan, Jiyuan Xia, Ruikang Xu, Mingde Yao, Zhiwei Xiong. 2871-2877 [doi]
- MIPI 2023 Challenge on RGBW Remosaic: Methods and ResultsQianhui Sun, Qingyu Yang, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Wenxiu Sun, Qingpeng Zhu, Chen Change Loy, Jinwei Gu, Yuqing Liu, Hongyuan Yu, Weichen Yu, Zhen Dong, Binnan Han, Qi Jia, Xuanwu Yin, Kunlong Zuo, Yaqi Wu, Zhihao Fan, Fanqing Meng, Xun Wu, Jiawei Zhang, Feng Zhang, Mingyan Han, Jinting Luo, Qi Wu, Ting Jiang, Chengzhi Jiang, Wenjie Lin, Xinpeng Li, Lei Yu, Haoqiang Fan, Shuaicheng Liu. 2878-2885 [doi]
- Multi-Task Learning based Video Anomaly Detection with AttentionMohammad Baradaran, Robert Bergevin. 2886-2896 [doi]
- Are we certain it's anomalous?Alessandro Flaborea, Bardh Prenkaj, Bharti Munjal, Marco Aurelio Sterpa, Dario Aragona, Luca Podo, Fabio Galasso. 2897-2907 [doi]
- Exploring the Importance of Pretrained Feature Extractors for Unsupervised Anomaly Detection and LocalizationLars Heckler, Rebecca König, Paul Bergmann. 2917-2926 [doi]
- Self-Supervised Normalizing Flows for Image Anomaly Detection and LocalizationLi-Ling Chiu, Shang-Hong Lai. 2927-2936 [doi]
- On Advantages of Mask-level Recognition for Outlier-aware SegmentationMatej Grcic, Josip Saric, Sinisa Segvic. 2937-2947 [doi]
- Denoising diffusion models for out-of-distribution detectionMark S. Graham, Walter H. L. Pinaya, Petru-Daniel Tudosiu, Parashkev Nachev, Sébastien Ourselin, M. Jorge Cardoso. 2948-2957 [doi]
- Anomaly Detection with Domain AdaptationZiyi Yang, Iman Soltani Bozchalooi, Eric Darve. 2958-2967 [doi]
- Back to the Feature: Classical 3D Features are (Almost) All You Need for 3D Anomaly DetectionEliahu Horwitz, Yedid Hoshen. 2968-2977 [doi]
- FewSOME: One-Class Few Shot Anomaly Detection with Siamese NetworksNiamh Belton, Misgina Tsighe Hagos, Aonghus Lawlor, Kathleen M. Curran. 2978-2987 [doi]
- SANO: Score-based Diffusion Model for Anomaly Localization in DermatologyÁlvaro González-Jiménez, Simone Lionetti, Marc Pouly, Alexander A. Navarini. 2988-2994 [doi]
- Region-based Appearance and Flow Characteristics for Anomaly Detection in Infrared Surveillance ImageryYona Falinie A. Gaus, Neelanjan Bhowmik, Brian K. S. Isaac-Medina, Hubert P. H. Shum, Amir Atapour Abarghouei, Toby P. Breckon. 2995-3005 [doi]
- Motion Matters: Difference-based Multi-scale Learning for Infrared UAV DetectionRuian He, Shili Zhou, Ri Cheng, Yuqi Sun, Weimin Tan, Bo Yan 0001. 3006-3015 [doi]
- A Real-time and Lightweight Method for Tiny Airborne Object DetectionYanyi Lyu, Zhunga Liu, Huandong Li, Dongxiu Guo, Yimin Fu. 3016-3025 [doi]
- A Global-Local Tracking Framework Driven by Both Motion and Appearance for Infrared Anti-UAVYifan Li, Dian Yuan, Meng Sun, Hongyu Wang, Xiaotao Liu, Jing Liu. 3026-3035 [doi]
- A Unified Transformer-based Tracker for Anti-UAV TrackingQianjin Yu, Yinchao Ma, Jianfeng He, Dawei Yang, Tianzhu Zhang. 3036-3046 [doi]
- Strong Detector with Simple TrackerZongheng Tang, YuLu Gao, Zizheng Xun, Fengguang Peng, Yifan Sun 0003, Si Liu 0001, Bo Li. 3047-3053 [doi]
- Video Tiny-Object Detection Guided by the Spatial-Temporal Motion InformationXin Yang, Gang Wang, Weiming Hu, Jin Gao, Shubo Lin, Liang Li, Kai Gao, Yizheng Wang. 3054-3063 [doi]
- The Second Monocular Depth Estimation ChallengeJaime Spencer, C. Stella Qian, Michaela Trescakova, Chris Russell 0001, Simon Hadfield, Erich W. Graf, Wendy J. Adams, Andrew J. Schofield, James H. Elder, Richard Bowden, Ali Anwar 0002, Hao Chen, Xiaozhi Chen, Kai Cheng, Yuchao Dai, Huynh Thai Hoa, Sadat Hossain, Jianmian Huang, Mohan Jing, Bo Li, Chao Li, Baojun Li, Zhiwen Liu, Stefano Mattoccia, Siegfried Mercelis, MyungWoo Nam, Matteo Poggi, Xiaohua Qi, Jiahui Ren, Yang Tang, Fabio Tosi, Linh Trinh, S. M. Nadim Uddin, Khan Muhammad Umair, Kaixuan Wang, Yufei Wang, Yixing Wang, Mochu Xiang, Guangkai Xu, Wei Yin, Jun Yu, Qi Zhang, Chaoqiang Zhao. 3064-3076 [doi]
- Exploring the Utility of Self-Supervised Pretraining Strategies for the Detection of Absent Lung Sliding in M-Mode Lung UltrasoundBlake VanBerlo, Brian Li, Alexander Wong, Jesse Hoey, Robert Arntfield. 3077-3086 [doi]
- Self-Supervised Learning for Accurate Liver View Classification in Ultrasound Images with Minimal Labeled DataAbder-Rahman Ali, Anthony E. Samir, Peng Guo. 3087-3093 [doi]
- A deep learning-based approach to increase efficiency in the acquisition of ultrasonic non-destructive testing datasetsNick Luiken, Matteo Ravasi. 3094-3102 [doi]
- Deep Learning Video Classification of Lung Ultrasound Features Associated with PneumoniaDaniel E. Shea, Sourabh Kulhare, Rachel Millin, Zohreh Laverriere, Courosh Mehanian, Charles B. Delahunt, Dipayan Banik, Xinliang Zheng, Meihua Zhu, Ye Ji, Travis Ostbye, Martha-Marie S. Mehanian, Atinuke Uwajeh, Adeseye M. Akinsete, Fen Wang, Matthew P. Horning. 3103-3112 [doi]
- Image Inpainting with Hypergraphs for Resolution Improvement in Scanning Acoustic MicroscopyAyush Somani, Pragyan Banerjee, Krishna Agarwal, Manu Rastogi, Dilip K. Prasad, Anowarul Habib. 3113-3122 [doi]
- DOAD: Decoupled One Stage Action Detection NetworkShuning Chang, Pichao Wang, Fan Wang, Jiashi Feng, Mike Zheng Shou. 3123-3232 [doi]
- A New Dataset and Approach for Timestamp Supervised Action Segmentation Using Human Object InteractionSaif Iftekar Sayed, Reza Ghoddoosian, Bhaskar Trivedi, Vassilis Athitsos. 3133-3142 [doi]
- Multi-Annotation Attention Model for Video SummarizationHacene Terbouche, Maryan Morel, Mariano Rodriguez, Alice Othmani. 3143-3152 [doi]
- Global Motion Understanding in Large-Scale Video Object SegmentationVolodymyr Fedynyak, Yaroslav Romanus, Oles Dobosevych, Igor Babin, Roman Riazantsev. 3153-3162 [doi]
- Multi-Object Tracking by Self-supervised Learning Appearance ModelKaer Huang, Kanokphan Lertniphonphan, Feng Chen, Jian Li, Zhepeng Wang. 3163-3169 [doi]
- An Improved Association Pipeline for Multi-Person TrackingDaniel Stadler, Jürgen Beyerer. 3170-3179 [doi]
- Pixel-level Contrastive Learning of Driving Videos with Optical FlowTomoya Takahashi, Shingo Yashima, Kohta Ishikawa, Ikuro Sato, Rio Yokota. 3180-3187 [doi]
- Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object DetectionKaicheng Yu, Tang Tao, Hongwei Xie, Zhiwei Lin, Tingting Liang, Bing Wang, Peng Chen, Dayang Hao, Yongtao Wang, Xiaodan Liang. 3188-3198 [doi]
- LDFA: Latent Diffusion Face Anonymization for Self-driving ApplicationsMarvin Klemp, Kevin Rösch, Royden Wagner, Jannik Quehl, Martin Lauer. 3199-3205 [doi]
- Integrated Perception and Planning for Autonomous Vehicle Navigation: An Optimization-based ApproachShubham Kedia, Yu Zhou, Sambhu H. Karumanchi. 3206-3215 [doi]
- Correlation Pyramid Network for 3D Single Object TrackingMengmeng Wang, Teli Ma, Xingxing Zuo, Jiajun Lv, Yong Liu 0007. 3216-3225 [doi]
- Contrastive Learning for Depth PredictionRizhao Fan, Matteo Poggi, Stefano Mattoccia. 3226-3237 [doi]
- DynStatF: An Efficient Feature Fusion Strategy for LiDAR 3D Object DetectionYao Rong, Xiangyu Wei, Tianwei Lin, Yueyu Wang, Enkelejda Kasneci. 3238-3247 [doi]
- Lanelet2 for nuScenes: Enabling Spatial Semantic Relationships and Diverse Map-based Anchor PathsAlexander Naumann, Felix Hertlein, Daniel Grimm, Maximilian Zipfl, Steffen Thoma, Achim Rettinger, Lavdim Halilaj, Juergen Luettin, Stefan Schmid 0002, Holger Caesar. 3248-3257 [doi]
- Consistency and Accuracy of CelebA Attribute ValuesHaiyu Wu, Grace Bezold, Manuel Günther, Terrance E. Boult, Michael C. King, Kevin W. Bowyer. 3258-3266 [doi]
- Compensation Learning in Semantic SegmentationTimo Kaiser, Christoph Reinders, Bodo Rosenhahn. 3267-3278 [doi]
- Scoring Your Prediction on Unseen DataYuhao Chen, Shen Zhang, Renjie Song. 3279-3288 [doi]
- Digital Twin Tracking Dataset (DTTD): A New RGB+Depth 3D Dataset for Longer-Range Object Tracking ApplicationsWeiyu Feng, Seth Z. Zhao, Chuanyu Pan, Adam Chang, Yichen Chen, Zekun Wang, Allen Y. Yang. 3289-3298 [doi]
- K-means Clustering Based Feature Consistency Alignment for Label-free Model EvaluationShuyu Miao, Lin Zheng, Jingjing Liu, Hong Jin. 3299-3307 [doi]
- Exploring Video Frame Redundancies for Efficient Data Sampling and Annotation in Instance SegmentationJihun Yoon, Min-Kook Choi. 3308-3317 [doi]
- WEDGE: A multi-weather autonomous driving dataset built from generative vision-language modelsAboli Marathe, Deva Ramanan, Rahee Walambe, Ketan Kotecha. 3318-3327 [doi]
- Human Gesture and Gait Analysis for Autism DetectionSania Zahan, Syed Zulqarnain Gilani, Ghulam Mubashar Hassan, Ajmal Mian. 3328-3337 [doi]
- Privileged Knowledge Distillation for Dimensional Emotion Recognition in the WildMuhammad Haseeb Aslam, Muhammad Osama Zeeshan, Marco Pedersoli, Alessandro L. Koerich, Simon Bacon, Eric Granger. 3338-3347 [doi]
- Online LiDAR-to-Vehicle Alignment Using Lane Markings and Traffic SignsYao Hu, Xinyu Du, Shengbing Jiang. 3348-3357 [doi]
- DeepSmooth: Efficient and Smooth Depth CompletionSriram Krishna, Basavaraja Shanthappa Vandrotti. 3358-3367 [doi]
- Network Specialization via Feature-level Knowledge DistillationGaowen Liu, Yuzhang Shang, Yuguang Yao, Ramana Kompella. 3368-3375 [doi]
- ST-RoomNet: Learning Room Layout Estimation From Single Image Through Unsupervised Spatial TransformationsHatem Ibrahem, Ahmed Salem 0005, Hyun Soo Kang. 3376-3384 [doi]
- PanopticVis: Integrated Panoptic Segmentation for Visibility Estimation at Twilight and NightHidetomo Sakaino. 3385-3398 [doi]
- Light Field Synthesis from a Monocular Image using Variable LDIJunhyeong Bak, In Kyu Park. 3399-3407 [doi]
- Toward Real-World Light Field Super-ResolutionZeyu Xiao, Ruisheng Gao, Yutong Liu, Yueyi Zhang, Zhiwei Xiong. 3408-3418 [doi]
- Disentangling Local and Global Information for Light Field Depth EstimationXueting Yang, Junli Deng, Rongshan Chen, Ruixuan Cong, Wei Ke 0001, Hao Sheng 0001. 3419-3427 [doi]
- CNT-NeRF: Carbon Nanotube Forest Depth Layer Decomposition in SEM Imagery using Generative Adversarial NetworksNguyen P. Nguyen, Ramakrishna Surya, Prasad Calyam, Kannappan Palaniappan, Matthew R. Maschmann, Filiz Bunyak. 3428-3437 [doi]
- EPI-Guided Cost Construction Network for Light Field Disparity EstimationTun Wang, Rongshan Chen, Ruixuan Cong, Da Yang, Zhenglong Cui, Fangping Li, Hao Sheng 0001. 3438-3446 [doi]
- A Data-Driven Approach based on Dynamic Mode Decomposition for Efficient Encoding of Dynamic Light FieldsJoshitha Ravishankar, Sally Khaidem, Mansi Sharma. 3447-3453 [doi]
- Multi-view Semantic Information Guidance for Light Field Image SegmentationYiming Li, Ruixuan Cong, Sizhe Wang, Mingyuan Zhao, Yang Zhang 0032, Fangping Li, Hao Sheng 0001. 3454-3462 [doi]
- Implicit Epipolar Geometric Function based Light Field Continuous Angular RepresentationLin Zhong, Bangcheng Zong, Qiming Wang, Junle Yu, WenHui Zhou. 3463-3472 [doi]
- LFNAT 2023 Challenge on Light Field Depth Estimation: Methods and ResultsHao Sheng 0001, Yebin Liu, Jingyi Yu, Gaochang Wu, Wei Xiong, Ruixuan Cong, Rongshan Chen, Longzhao Guo, Yanlin Xie, Shuo Zhang, Song Chang, Youfang Lin, Wentao Chao, Xuechun Wang, Guanghui Wang, Fuqing Duan, Tun Wang, Da Yang, Zhenglong Cui, Sizhe Wang, Mingyuan Zhao, Qiong Wang, Qianyu Chen, Zhengyu Liang, Yingqian Wang, Jungang Yang, Xueting Yang, Junli Deng. 3473-3485 [doi]
- Diffusart: Enhancing Line Art Colorization with Conditional Diffusion ModelsHernan Carrillo, Michaël Clément, Aurélie Bugeau, Edgar Simo-Serra. 3486-3490 [doi]
- FreqHPT: Frequency-aware attention and flow fusion for Human Pose TransferLiyuan Ma, Tingwei Gao, Haibin Shen, Kejie Huang. 3491-3496 [doi]
- Fashion-Specific Ambiguous Expression Interpretation with Partial Visual-Semantic EmbeddingRyotaro Shimizu, Takuma Nakamura, Masayuki Goto. 3497-3502 [doi]
- SkiLL: Skipping Color and Label Landscape: Self Supervised Design Representations for Products in E-commerceVinay Kumar Verma, Dween Rabius Sanny, Shreyas Sunil Kulkarni, Prateek Sircar, Abhishek Singh, Deepak Gupta. 3503-3507 [doi]
- SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shiftsMasanari Kimura, Takuma Nakamura, Yuki Saito. 3508-3513 [doi]
- FashionVQA: A Domain-Specific Visual Question Answering SystemMingyu Wang, Ata Mahjoubfar, Anupama Joshi. 3514-3519 [doi]
- Shape of You: Precise 3D shape estimations for diverse body typesRohan Sarkar, Achal Dave, Gerard Medioni, Benjamin Biggs. 3520-3524 [doi]
- Image Reference-guided Fashion Design with Structure-aware Transfer by Diffusion ModelsShidong Cao, Wenhao Chai, Shengyu Hao, Gaoang Wang. 3525-3529 [doi]
- Name your style: text-guided artistic style transferZhi-Song Liu, Li-wen Wang, Wan-Chi Siu, Vicky Kalogeiton. 3530-3534 [doi]
- DETR-based Layered Clothing Segmentation and Fine-Grained Attribute RecognitionHao Tian, Yu Cao, P. Y. Mok. 3535-3539 [doi]
- KBody: Balanced monocular whole-body estimationNikolaos Zioulis, James F. O'Brien. 3540-3545 [doi]
- Gatha: Relational Loss for enhancing text-based style transferSurgan Jandial, Shripad Deshmukh, Abhinav Java, Simra Shahid, Balaji Krishnamurthy. 3546-3551 [doi]
- Shape-Net: Room Layout Estimation from Panoramic Images Robust to Occlusion using Knowledge Distillation with 3D Shapes as Additional InputsMizuki Tabata, Kana Kurata, Junichiro Tamamatsu. 3552-3561 [doi]
- U2RLE: Uncertainty-Guided 2-Stage Room Layout EstimationPooya Fayyazsanavi, Zhiqiang Wan, Will Hutchcroft, Ivaylo Boyadzhiev, Yuguang Li, Jana Kosecka, Sing Bing Kang. 3562-3570 [doi]
- Motion-state Alignment for Video Semantic SegmentationJinming Su, Ruihong Yin, Shuaibin Zhang, Junfeng Luo. 3571-3580 [doi]
- Perceive, Excavate and Purify: A Novel Object Mining Framework for Instance SegmentationJinming Su, Ruihong Yin, Xingyue Chen, Junfeng Luo. 3581-3590 [doi]
- PanopticRoad: Integrated Panoptic Road Segmentation Under Adversarial ConditionsHidetomo Sakaino. 3591-3603 [doi]
- A unified model for continuous conditional video predictionXi Ye, Guillaume-Alexandre Bilodeau. 3604-3613 [doi]
- Best Practices for 2-Body Pose ForecastingMuhammad Rameez Ur Rahman, Luca Scofano, Edoardo De Matteis, Alessandro Flaborea, Alessio Sampieri, Fabio Galasso. 3614-3624 [doi]
- 3D-IntPhys: Towards More Generalized 3D-grounded Visual Intuitive Physics under Challenging ScenesHaotian Xue, Antonio Torralba 0001, Joshua B. Tenenbaum, Daniel Yamins, Yunzhu Li, Hsiao-Yu Tung. 3625-3635 [doi]
- StillFast: An End-to-End Approach for Short-Term Object Interaction AnticipationFrancesco Ragusa, Giovanni Maria Farinella, Antonino Furnari. 3636-3645 [doi]
- Bush Detection for Vision-based UGV Guidance in Blueberry Orchards: Data Set and MethodsVladan Filipovic, Dimitrije Stefanovic, Nina Pajevic, Zeljana Grbovic, Nemanja Djuric, Marko Panic. 3646-3655 [doi]
- DPOSE: Online Keypoint-CAM Guided Inference for Driver Pose Estimation with GMM-based Balanced SamplingYuyu Guo, Yancheng Bai, Daiqi Shi, Yan Cai 0001, Wei Bian. 3656-3665 [doi]
- CIPF: Crossing Intention Prediction Network based on Feature Fusion Modules for Improving Pedestrian SafetyJe-Seok Ham, Dae Hoe Kim, NamKyo Jung, Jinyoung Moon. 3666-3675 [doi]
- DNA: Deformable Neural Articulations Network for Template-free Dynamic 3D Human Reconstruction from Monocular RGB-D VideoKhoa Vo 0001, Trong-Thang Pham, Kashu Yamazaki, Minh Q. Tran, Ngan Le. 3676-3685 [doi]
- ODSmoothGrad: Generating Saliency Maps for Object DetectorsChul Gwon, Steven C. Howell. 3686-3690 [doi]
- Sanity checks for patch visualisation in prototype-based image classificationRomain Xu-Darme, Georges Quénot, Zakaria Chihani, Marie-Christine Rousset. 3691-3696 [doi]
- The Manifold Hypothesis for Gradient-Based ExplanationsSebastian Bordt, Uddeshya Upadhyay, Zeynep Akata, Ulrike von Luxburg. 3697-3702 [doi]
- Hierarchical Explanations for Video Action RecognitionSadaf Gulshad, Teng Long, Nanne van Noord. 3703-3708 [doi]
- A Confusion Matrix for Evaluating Feature Attribution MethodsAnna Arias-Duart, Ettore Mariotti, Dario Garcia-Gasulla, Jose Maria Alonso-Moral. 3709-3714 [doi]
- Robustness of Visual Explanations to Common Data Augmentation MethodsLenka Tetková, Lars Kai Hansen. 3715-3720 [doi]
- Localized Shortcut RemovalNicolas M. Müller, Jochen Jacobs, Jennifer Williams, Konstantin Böttinger. 3721-3725 [doi]
- Towards Evaluating Explanations of Vision Transformers for Medical ImagingPiotr Komorowski, Hubert Baniecki, Przemyslaw Biecek. 3726-3732 [doi]
- Seg-XRes-CAM: Explaining Spatially Local Regions in Image SegmentationSyed Nouman Hasany, Caroline Petitjean, Fabrice Mériaudeau. 3733-3738 [doi]
- Analyzing Results of Depth Estimation Models with Monocular CriteriaJonas Theiner, Nils Nommensen, Jim Rhotert, Matthias Springstein, Eric Müller-Budack, Ralph Ewerth. 3739-3743 [doi]
- Text2Concept: Concept Activation Vectors Directly from TextMazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi. 3744-3749 [doi]
- CAVLI - Using image associations to produce local concept-based explanationsPushkar Shukla, Sushil Bharati, Matthew A. Turk. 3750-3755 [doi]
- Vision DiffMask: Faithful Interpretation of Vision Transformers with Differentiable Patch MaskingAngelos Nalmpantis, Apostolos Panagiotopoulos, John Gkountouras, Konstantinos Papakostas, Wilker Aziz. 3756-3763 [doi]
- Ante-Hoc Generation of Task-Agnostic Interpretation MapsAkash Guna R. T, Raul Benitez, O. K. Sikha. 3764-3769 [doi]
- Disentangling Neuron Representations with Concept VectorsLaura O'Mahony, Vincent Andrearczyk, Henning Müller, Mara Graziani. 3770-3775 [doi]
- Shared Interest...Sometimes: Understanding the Alignment between Human Perception, Vision Architectures, and Saliency Map TechniquesKatelyn Morrison, Ankita Mehra, Adam Perer. 3776-3781 [doi]
- ZEBRA: Explaining rare cases through outlying interpretable conceptsPedro Madeira, André V. Carreiro, Alex Gaudio, Luís Rosado, Filipe Soares, Asim Smailagic. 3782-3788 [doi]
- Uncovering the Inner Workings of STEGO for Safe Unsupervised Semantic SegmentationAlexander Koenig, Maximilian Schambach, Johannes S. Otterbach. 3789-3798 [doi]
- Coherent Concept-based Explanations in Medical Image and Its Application to Skin Lesion DiagnosisCristiano Patrício, João C. Neves, Luís F. Teixeira 0001. 3799-3808 [doi]
- Maximum Entropy Information Bottleneck for Uncertainty-aware Stochastic EmbeddingSungtae An, Nataraj Jammalamadaka, Eunji Chong. 3809-3818 [doi]
- Optimizing Explanations by Network Canonization and Hyperparameter SearchFrederik Pahde, Galip Ümit Yolcu, Alexander Binder, Wojciech Samek, Sebastian Lapuschkin. 3819-3828 [doi]
- Revealing Hidden Context Bias in Segmentation and Object Detection through Concept-specific ExplanationsMaximilian Dreyer, Reduan Achtibat, Thomas Wiegand, Wojciech Samek, Sebastian Lapuschkin. 3829-3839 [doi]
- Investigating CLIP Performance for Meta-data Generation in AD DatasetsSujan Sai Gannamaneni, Arwin Sadaghiani, Rohil Prakash Rao, Michael Mock, Maram Akila. 3840-3850 [doi]
- A Novel Benchmark for Refinement of Noisy Localization Labels in Autolabeled Datasets for Object DetectionAndreas Bär, Jonas Uhrig, Jeethesh Pai Umesh, Marius Cordts, Tim Fingscheidt. 3851-3860 [doi]
- RL-CAM: Visual Explanations for Convolutional Networks using Reinforcement LearningSoumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Sahand Ghorbanpour, Vineet Gundecha, Antonio Guillen, Ricardo Luna Gutierrez, Avisek Naug. 3861-3869 [doi]
- Category Differences Matter: A Broad Analysis of Inter-Category Error in Semantic SegmentationJingxing Zhou, Jürgen Beyerer. 3870-3880 [doi]
- Beyond AUROC & co. for evaluating out-of-distribution detection performanceGaladrielle Humblot-Renaux, Sergio Escalera, Thomas B. Moeslund. 3881-3890 [doi]
- Interpretable Model-Agnostic Plausibility Verification for 2D Object Detectors Using Domain-Invariant Concept Bottleneck ModelsMert Keser, Gesina Schwalbe, Azarm Nowzad, Alois Knoll. 3891-3900 [doi]
- Live Demonstration: PINK: Polarity-based Anti-flicker for Event CamerasGyubeom Im, Keunjoo Park, Junseok Kim, Bongki Son, Seungchul Shin, Haechang Lee. 3901-3902 [doi]
- Exploring Joint Embedding Architectures and Data Augmentations for Self-Supervised Representation Learning in Event-Based VisionSami Barchid, José Mennesson, Chaabane Djeraba. 3903-3912 [doi]
- How Many Events Make an Object? Improving Single-frame Object Detection on the 1 Mpx Dataset