Abstract is missing.
- UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA FiltersKovvuri Sai Gopal Reddy, Bodduluri Saran, A. Mudit Adityaja, Saurabh J. Shigwan, Nitin Kumar 0003, Snehasis Mukherjee. [doi]
- Distribution-Aware Calibration for Object Detection with Noisy Bounding BoxesDonghao Zhou, Jialin Li, Jinpeng Li 0004, Jiancheng Huang, Qiang Nie, Yong Liu 0032, Bin-Bin Gao, Qiong Wang 0001, Pheng-Ann Heng, Guangyong Chen. [doi]
- Improving Multimodal Learning with Multi-Loss Gradient ModulationKonstantinos Kontras, Christos Chatzichristos, Matthew B. Blaschko, Maarten De Vos. [doi]
- Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search SpaceJunho Lee, Jeongwoo Shin, Seung-Woo Ko, Seongsu Ha, Joonseok Lee. [doi]
- CLIP Adaptation by Intra-Modal Overlap ReductionAlexey Kravets, Vinay P. Namboodiri. [doi]
- Enabling Local Editing in Diffusion Models by Joint and Individual Component AnalysisTheodoros Kouzelis, Emmanouil Plitsis, Mihalis Nicolaou, Yannis Panagakis. [doi]
- PhysFlow: Skin tone transfer for remote heart rate estimation through conditional normalizing flowsJoaquim Comas, Antònia Alomar, Adria Ruiz, Federico Sukno. [doi]
- Acoustic-based 3D Human Pose Estimation Robust to Human PositionYusuke Oumi, Yuto Shibata, Go Irie, Akisato Kimura, Yoshimitsu Aoki, Mariko Isogawa. [doi]
- Local Implicit Wavelet Transformer for Arbitrary-Scale Super-ResolutionMinghong Duan, Linhao Qu, Shaolei Liu, Manning Wang. [doi]
- Multimodal base distributions in conditional flow matching generative modelsShane Josias, Willie Brink. [doi]
- Efficiency-preserving Scene-adaptive Object DetectionZekun Zhang, Vu Quang Truong, Minh Hoai. [doi]
- Mixstyle-Entropy: Whole Process Domain Generalization with Causal Intervention and PerturbationLuyao Tang, Yuxuan Yuan, Chaoqi Chen, Xinghao Ding, Yue Huang 0001. [doi]
- Topology-preserving Adversarial Training for Alleviating Natural Accuracy DegradationXiaoyue Mi, Fan Tang, Yepeng Weng, Danding Wang, Juan Cao 0001, Sheng Tang, Peng Li 0030, Yang Liu 0005. [doi]
- Separated and Independent Contrastive Learning on Labeled and Unlabeled Samples: Boosting Performance on Long-tail Semi-supervised LearningDongyoung Kim, Jeong-Gun Lee, Wonsook Lee. [doi]
- EIANet: A Novel Domain Adaptation Approach to Maximize Class Distinction with Neural Collapse PrinciplesZicheng Pan, Xiaohan Yu 0001, Yongsheng Gao 0001. [doi]
- PEEKABOO: Hiding Parts of an Image for Unsupervised Object LocalizationHasib Zunair, Abdessamad Ben Hamza. [doi]
- Anomaly Detection Based on Semi-Formula Driven Pre-training Dataset to Represent Subtle Difference and Anomaly ScoreHiroki Kobayashi, Naoki Murakami, Naoto Hiramatsu, Takahiro Suzuki, Manabu Hashimoto. [doi]
- Erasing Concepts from Text-to-Image Diffusion Models with Few-shot UnlearningMasane Fuchi, Tomohiro Takagi. [doi]
- Prompt-guided Multi-modal contrastive learning for Cross-compression-rate Deepfake DetectionChing-Yi Lai, Chiou-Ting Hsu, Chih-Chung Hsu, Chia-Wen Lin. [doi]
- Unsupervised Domain Adaptation for Tubular Structure Segmentation Across Different Anatomical SourcesYuxiang An, Dongnan Liu, Weidong Cai 0001. [doi]
- SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific PostersShohei Tanaka, Hao Wang, Yoshitaka Ushiku. [doi]
- MxT: Mamba x Transformer for Image InpaintingShuang Chen 0010, Amir Atapour Abarghouei, Haozheng Zhang, Hubert P. H. Shum. [doi]
- GLCM-Adapter: Global-Local Content Matching for Few-shot CLIP AdaptationShuo Wang 0008, Xieenlong, Jinda Lu, Jinghan Li, Yanbin Hao. [doi]
- PlainMamba: Improving Non-Hierarchical Mamba in Visual RecognitionChenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley. [doi]
- PV-SLAM: Panoptic Visual SLAM with Loop Closure and Online Bundle AdjustmentAshok Bandyopadhyay, Pranjal Baranwal, Arijit Sur, U. P. Rajeev. [doi]
- SuperLoRA: Parameter-Efficient Unified Adaptation of Large Foundation ModelsXiangyu Chen 0008, Jing Liu 0009, Ye Wang 0001, Pu (Perry) Wang, Matthew Brand, Guanghui Wang 0001, Toshiaki Koike-Akino. [doi]
- Semantic Image Synthesis of Anime Characters Based on Conditional Generative Adversarial NetworksXuhui Zhu, Feng Jiang, Jing Wen, Yi Wang, Qiang Gao. [doi]
- Boundary Contrastive Learning for Label-Efficient Medical Image SegmentationSatoshi Kamiya, Kota Yamashita, Kazuhiro Hotta. [doi]
- TalkLoRA: Low-Rank Adaptation for Speech-Driven AnimationJack R. Saunders, Vinay P. Namboodiri. [doi]
- Flexible Graph Convolutional Network for 3D Human Pose EstimationAbu Taib Mohammed Shahjahan, Abdessamad Ben Hamza. [doi]
- Key-point Guided Deformable Image Manipulation Using Diffusion ModelSeokHwan Oh, Guil Jung, Myeong-Gee Kim, Sang-Yun Kim 0003, Young-Min Kim 0004, Hyeon-Jik Lee, Hyuksool Kwon, Hyeon-Min Bae. [doi]
- MV-Match: Multi-View Matching for Domain-Adaptive Identification of Plant Nutrient DeficienciesJinhui Yi, Yanan Luo, Marion Deichmann, Gabriel Schaaf, Juergen Gall. [doi]
- Budget-aware Dynamic Spatially Adaptive InferenceGeorgios Zampokas, Christos-Savvas Bouganis, Dimitrios Tzovaras. [doi]
- JEAN: Joint Expression and Audio-guided NeRF-based Talking Face GenerationSai Tanmay Reddy Chakkera, Aggelina Chatziagapi, Dimitris Samaras. [doi]
- RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance FieldsMihnea Bogdan Jurca, Remco Royen, Ion Giosan, Adrian Munteanu 0001. [doi]
- Motion Tracking with Rotated Bounding Boxes on Overhead Fisheye ImageryJordan Lam. [doi]
- topK dice loss for medical image segmentationSeyed Mohsen Hosseini. [doi]
- ML-2SN: A Hybrid Two-Stream System for Sitting Posture DetectionKehang Jia, Gaorui Zhang, Yixuan Yang, Guangwei Huang, Penghuan Wang, Cheng Cheng. [doi]
- Infrared and Visible Image Fusion Using Multi-level Adaptive Fractional DifferentialKang Zhang, Xinnian Guo. [doi]
- ControlDreamer: Blending Geometry and Style in Text-to-3DYeongtak Oh, Jooyoung Choi, YongSung Kim, MinJun Park, Chaehun Shin, Sungroh Yoon. [doi]
- Self-Supervised Real-World Denoising by Jointly Learning Visible and Invisible NoiseShaoyu Wang, Changze Zhou, Bolin Song, Yiyang Wang. [doi]
- Lightweight Human Pose Estimation with Enhanced Knowledge ReviewHao Xu, Shengye Yan, Wei Zheng. [doi]
- CosFairNet: A Parameter-Space based Approach for Bias Free LearningRajeev Ranjan Dwivedi, Priyadarshini Kumari, Vinod K. Kurmi. [doi]
- Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit QuantizationRóisín Luo, Alexandru Drimbarean, James McDermott, Colm O'Riordan. [doi]
- On Partial Prototype Collapse in the DINO Family of Self-Supervised MethodsHariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik Lindsten. [doi]
- Few-shot Multispectral Segmentation with Representations Generated by Reinforcement LearningDilith Jayakody, Thanuja D. Ambegoda. [doi]
- From Black-box to Label-only: a Plug-and-Play Attack Network for Model InversionHuan Bao, Kaimin Wei, Yao Chen, Hanting Hou, Jinpeng Chen 0001, Yongdong Wu. [doi]
- Calibration of 2D LiDAR sensors using cylindrical targetTamás Tófalvi, Bandó Kovács, Levente Hajder. [doi]
- Out-Of-Distribution Detection for Audio-visual Generalized Zero-Shot Learning: A General FrameworkLiuyuan Wen. [doi]
- NCA-Morph: Medical Image Registration with Neural Cellular AutomataAmin Ranem, John Kalkhof, Anirban Mukhopadhyay 0003. [doi]
- Prompt Generation Networks for Input-Space Adaptation of Frozen Vision TransformersJochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M. Asano. [doi]
- CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder RefinementYijie Li 0003, Hewei Wang 0001, Aggelos K. Katsaggelos. [doi]
- Transferable Learned Image Compression-Resistant Adversarial PerturbationsYang Sui, Zhuohang Li, Ding Ding 0004, Xiang Pan, Xiaozhong Xu, Shan Liu 0001, Zhenzhong Chen. [doi]
- Detecting Audio-Visual Deepfakes with Fine-Grained InconsistenciesMarcella Astrid, Enjie Ghorbel, Djamila Aouada. [doi]
- FFR-UNet: Feature Filter-Refinement UNet for Medical Image SegmentationWeixin Xu. [doi]
- PatchRot: Self-Supervised Training of Vision Transformers by Rotation PredictionSachin Chhabra, Hemanth Venkateswara, Baoxin Li. [doi]
- Open-Vocabulary Temporal Action Localization using Multimodal GuidanceAkshita Gupta, Aditya Arora, Sanath Narayan, Salman Khan 0001, Fahad Shahbaz Khan, Graham W. Taylor. [doi]
- TransHuPR: Cross-View Fusion Transformer for Human Pose Estimation Using mmWave RadarNiraj Prakash Kini, Ruey-Horng Shiue, Ryan Chandra, Wen-Hsiao Peng, Ching-Wen Ma, Jenq-Neng Hwang. [doi]
- Effective Message Hiding with Order-Preserving MechanismsYu Gao, Xuchong Qiu, Zihan Ye. [doi]
- Interactive Image Segmentation with Temporal Information AugmentedQiaoqiao Wei, Hui Zhang 0013, Jun-Hai Yong. [doi]
- Unified Compositional Query Machine with Multimodal Consistency for Video-based Human Activity RecognitionTuyen Tran, Thao Minh Le, Duy Hung Tran, Truyen Tran 0001. [doi]
- Enhancing 3D Hand Pose Estimation via Dense Ordinal Regression NetworkYamin Mao, Zhihua Liu, Weiming Li, SoonYong Cho, Qiang Wang, Xiaoshuai Hao. [doi]
- Backdoor Defense through Self-Supervised and Generative LearningIvan Sabolic, Ivan Grubisic 0001, Sinisa Segvic. [doi]
- Multi-Scale Semantic Enrichment and Dual Angular Margin Contrast for Few-Shot Class Incremental LearningRiya Verma, Sukhendu Das. [doi]
- Unsupervised Hashing Network with Hyper Quantization TreeSungeun Kim, Jongbin Ryu. [doi]
- NSSR-DIL: Null-Shot Image Super-Resolution Using Deep Identity LearningSree Rama Vamsidhar S., Gorthi Rama Krishna Sai Subrahmanyam. [doi]
- FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language ModelYuanwei Li, Elizaveta Ivanova, Martins Bruveris. [doi]
- ACIL: Active Class Incremental Learning for Image ClassificationAditya R. Bhattacharya, Debanjan Goswami, Shayok Chakraborty. [doi]
- Revitalizing Legacy Video Content: Deinterlacing with Bidirectional Information PropagationZhaowei Gao, MingYang Song, Christopher Schroers, Yang Zhang. [doi]
- ICAF-4: An Integrated Framework of Category-level Articulated Object Perception and Manipulation for Embodied IntelligenceWenbo Xu, Li Zhang, Qiankun Li, Qi Wu, Lin Yuanbo Wu, Liu Liu. [doi]
- Feature Splatting for Better Novel View Synthesis with Low OverlapTomás Berriel Martins, Javier Civera 0001. [doi]
- STPose: 6D object pose estimation network based on sparse attention and cross-layer connectionShihao Chen, Xiaobing Li, Keduo Yan, Yong Li 0028, Dongxu Gao. [doi]
- Open-World Semi-Supervised Learning under Compound Distribution ShiftsShijia Xu, Lin Zhao 0003, Jialiang Tang, Guangyu Li, Chen Gong 0002. [doi]
- Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point CloudsYuyang Zhao, Na Zhao 0004, Gim Hee Lee. [doi]
- TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-TrainingLi Li 0092, Tanqiu Qiao, Hubert P. H. Shum, Toby P. Breckon. [doi]
- Privacy-preserving datasets by capturing feature distributions with Conditional VAEsFrancesco Di Salvo, David Tafler, Sebastian Doerrich, Christian Ledig. [doi]
- Spike-SLR: An Energy-efficient Parallel Spiking Transformer for Event-based Sign Language RecognitionXinxu Lin, Mingxuan Liu, Kezhuo Liu, Hong Chen 0002. [doi]
- Frequency Decomposition to Tap the Potential of Single Domain for GeneralizationHongjing Niu, Qingyue Yang, Pengfei Xia, Wei Zhang 0251, Bin Li 0025, Feng Zhao 0004. [doi]
- 3D Point Cloud Network Pruning: When Some Weights Do not MatterAmrijit Biswas, Md. Ismail Hossain, Mirza M. Lutfe Elahi, Ali Cheraghian, Fuad Rahman 0001, Nabeel Mohammed, Shafin Rahman. [doi]
- Hierarchical Prompt Learning for Scene Graph GenerationXuhan Zhu, Yifei Xing 0001, Ruiping Wang 0001, Yaowei Wang 0001, Xiangyuan Lan. [doi]
- FastForensics: Efficient Two-Stream Design for Real-Time Image Manipulation DetectionYangxiang Zhang, Yuezun Li, Ao Luo, Jiaran Zhou, Junyu Dong. [doi]
- Interpretable Long-term Action Quality AssessmentXu Dong, Xinran Liu, Wanqing Li 0001, Anthony Adeyemi-Ejeye, Andrew Gilbert. [doi]
- Optimising Diffusion Models for Histopathology Image SynthesisVictoria Porter, Richard Gault, Stephanie G. Craig, Jacqueline A. James. [doi]
- Layout Free Scene Graph to Image GenerationRameshwar Mishra, A. Venkata Subramanyam. [doi]
- Neural Collapse Inspired Contrastive Continual LearningAntoine Montmaur, Nicolas Larue, Ngoc-Son Vu. [doi]
- A Novel Divide and Merge Approach for Improved Classification of Functional DataWei Zhao, Xiao-Jun Zeng, Chengdong Shi, Ching-Hsun Tseng, Yue Chang. [doi]
- Text-Guided Mixup Towards Long-Tailed Image CategorizationRichard Franklin, Jiawei Yao, Deyang Zhong, Qi Qian 0001, Juhua Hu. [doi]
- A self-supervised and adversarial approach to hyperspectral demosaicking and RGB reconstruction in surgical imagingPeichao Li, Oscar MacCormac, Jonathan Shapey, Tom Vercauteren. [doi]
- CSAD: Unsupervised Component Segmentation for Logical Anomaly DetectionYu-Hsuan Hsieh, Shang-Hong Lai. [doi]
- The Attempt on Combining Three Talents by KD with Enhanced Boundary in Co-Salient Object DetectionZiyi Cao, Shengye Yan, Wei Zheng. [doi]
- A Prototype Unit for Image De-raining using Time-Lapse DataJaeHoon Cho, Minjung Yoo, Jini Yang, Sunok Kim. [doi]
- Align-DETR: Enhancing End-to-end Object Detection with Aligned LossZhi Cai, Songtao Liu, Guodong Wang, Zeming Li, Zheng Ge, Xiangyu Zhang 0005, Di Huang 0001. [doi]
- ControlEdit: A MultiModal Local Clothing Image Editing MethodDi Cheng, Yingjie Shi, Shixin Sun, Jiafu Zhang, Weijing Wang, Yu Liu. [doi]
- Linear Calibration Approach to Knowledge-free Group Robust ClassificationRyota Ishizaki, Shunya Yamagami, Yuta Goto, Go Irie. [doi]
- RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-GuidanceAvideep Mukherjee, Soumya Banerjee, Piyush Rai, Vinay P. Namboodiri. [doi]
- SagaGAN: Style Applied using Gram matrix Attribution based on StarGAN v2Yongseon Yoo, Seonggyu Kim, Jong-Min Lee. [doi]
- Label Smoothing++: Enhanced Label Regularization for Training Neural NetworksSachin Chhabra, Hemanth Venkateswara, Baoxin Li. [doi]
- No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMsCristian Sbrolli, Matteo Matteucci. [doi]
- SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate ReductionEvgeney Bogatyrev, Ivan Molodetskikh, Dmitriy S. Vatolin. [doi]
- Alignment-aware Patch-level Routing for Dynamic Video Frame InterpolationBan Chen, Xin Jin, Longhai Wu, Jie Chen, Ilhyun Cho, Cheul-Hee Hahm. [doi]
- Multi-Modal Information Bottleneck Attribution with Cross-Attention GuidancePauline Bourigault, Emmanuelle Bourigault, Danilo P. Mandic. [doi]
- Interpretable Representation Learning from Videos using Nonlinear PriorsMarian Longa, João F. Henriques. [doi]
- Anchor-Based Masked Generative Distillation for Pixel-Level Prediction TasksXie Yu, Wentao Zhang. [doi]
- Trimming the Fat: Efficient Compression of 3D Gaussian Splats through PruningMuhammad Salman Ali, Maryam Qamar, Sung-Ho Bae, Enzo Tartaglione. [doi]
- GeoFormer: A Multi-Polygon Segmentation TransformerMaxim Khomiakov, Michael Riis Andersen, Jes Frellsen. [doi]
- InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular DepthCho-Ying Wu, Quankai Gao, Chin-Cheng Hsu, Te-Lin Wu, Jing-Wen Chen, Ulrich Neumann. [doi]
- LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention MapsAndrey Palaev, Adil Khan 0001, Syed M. Ahsan Kazmi. [doi]
- VLAVAD: Vision-Language Models Assisted Unsupervised Video Anomaly DetectionChangkang Li, Yalong Jiang. [doi]
- Adapting MIMO video restoration networks to low latency constraintsValéry Dewil, Zhe Zheng, Arnaud Barral, Lara Raad, Nao Nicolas, Ioannis Cassagne, Jean-Michel Morel, Gabriele Facciolo, Bruno Galerne, Pablo Arias 0001. [doi]
- Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art PerformanceOliver Mills, Nishant Ravikumar, Philip G. Conaghan, Samuel D. Relton. [doi]
- On Evaluating Adversarial Robustness of Volumetric Medical Segmentation ModelsHashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman Khan 0001, Fahad Shahbaz Khan. [doi]
- Kernel Representation for Dynamic NetworksYichen Zhou, Teck Khim Ng. [doi]
- PawFACS: Leveraging Semi-Supervised Learning for Pet Facial Action RecognitionAnandavardhan Hegde, Sudha Velusamy, Narayan Kothari, Aman Bahuguna, Apnesh Rawat, Hema Sathiamurthy, Ankit Raja. [doi]
- A Revisit to the Decoder for Camouflaged Object DetectionSeung-Woo Ko, Joopyo Hong, Suyoung Kim, Seungjai Bang, Sungzoon Cho, Nojun Kwak, Hyung-Sin Kim, Joonseok Lee. [doi]
- Group Activity Recognition via Spatio-Temporal Reasoning of Key InstancesHaoting He, Yaochen Li, Yutong Wang, Gaojie Li, Wei Guo, Runlin Zou. [doi]
- MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time AdaptationKim Yu-Ji, Hyunwoo Ha, Kim Youwang, Jaeheung Surh, Hyowon Ha, Tae Hyun Oh. [doi]
- Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object DetectionYunsong Wang, Na Zhao 0004, Gim Hee Lee. [doi]
- Vision-Language Guidance for LiDAR-based Unsupervised 3D Object DetectionChristian Fruhwirth-Reisinger, Wei Lin 0019, Dusan Malic, Horst Bischof, Horst Possegger. [doi]
- AUPIMO: Redefining Anomaly Localization Benchmarks with High Speed and Low ToleranceJoão P. C. Bertoldo, Dick Ameln, Ashwin Vaidya, Samet Akcay. [doi]
- DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment AnimationRaquel Vidaurre, Elena Garces 0001, Dan Casas. [doi]
- DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch InferenceAhmet Serdar Karadeniz, Dimitrios Mallis, Nesryne Mejri, Kseniya Cherenkova, Anis Kacem 0001, Djamila Aouada. [doi]
- Knowledge Distillation with Global Filters for Efficient Human Pose EstimationKaushik Bhargav Sivangi, Fani Deligianni. [doi]
- Learning Object Placement via Convolution Scoring AttentionYibin Wang, Yuchao Feng, Jianwei Zheng 0001. [doi]
- AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance FieldRong Liu, Rui Xu, Yue Hu, Meida Chen, Andrew Feng. [doi]
- Outlier detection by ensembling uncertainty with negative objectnessAnja Delic, Matej Grcic, Sinisa Segvic. [doi]
- MSA2Net: Multi-scale Adaptive Attention-guided Network for Medical Image SegmentationSina Ghorbani Kolahi, Seyed Kamal Chaharsooghi, Toktam Khatibi, Afshin Bozorgpour, Reza Azad, Moein Heidari, Ilker Hacihaliloglu, Dorit Merhof. [doi]
- COSMo: CLIP Talks on Open-Set Multi-Target Domain AdaptationMunish Monga, Sachin Kumar Giroh, Ankit Jha, Mainak Singha, Biplab Banerjee, Jocelyn Chanussot. [doi]
- Retinex-Inspired Cooperative Game Through Multi-Level Feature Fusion for Robust, Universal Image EnhancementRuiqi Mao, Rongxin Cui. [doi]
- Gaussian Splatting in Mirrors: Reflection-aware Rendering via Virtual Camera OptimizationZihan Wang, Shuzhe Wang, Matias Turkulainen, Junyuan Fang, Juho Kannala. [doi]
- Discovering an Image-Adaptive Coordinate System for Photography ProcessingZiteng Cui, Lin Gu 0003, Tatsuya Harada. [doi]
- Advancing Anomaly Detection: The IDW dataset and MC algorithmAlexander D. J. Taylor, Jonathan James Morrison, Phillip Tregidgo, Neill D. F. Campbell. [doi]
- Sign Stitching: A Novel Approach to Sign Language ProductionHarry Walsh, Ben Saunders, Richard Bowden. [doi]
- Generalizing Teacher Networks for Effective Knowledge Distillation Across Student ArchitecturesKuluhan Binici, Weiming Wu, Tulika Mitra. [doi]
- SAE: Single Architecture Ensemble Neural NetworksMartin Ferianc, Hongxiang Fan, Miguel R. D. Rodrigues. [doi]
- Measuring Physical Plausibility of 3D Human Poses Using Physics SimulationNathan Louis, Mahzad Khoshlessan, Jason J. Corso. [doi]
- GLPI: A Global Layered Prompt Integration approach for Explicit Visual PromptYufei Gao, Bin Fu, Lei Shi 0001, Chengming Liu, Yucheng Shi. [doi]
- Deep Learning for GPS-Denied SAR Image Focusing and Vehicle Trajectory EstimationChristopher Beam, Andrew R. Willis, Kevin M. Brink. [doi]
- Pseudo Labelling for Enhanced Masked Auto EncodersSrinivasa Rao Nandam, Sara Atito 0001, Zhenhua Feng 0001, Josef Kittler, Muhammad Awais 0001. [doi]
- Federated Learning for Face Recognition via Intra-subject Self-supervised LearningHansol Kim, Hoyeol Choi, Youngjun Kwak. [doi]
- Projected Stochastic Gradient Descent with Quantum Annealed Binary GradientsMaximilian Krahn, Michele Sasdelli, Frances Fengyi Yang, Vladislav Golyanik, Juho Kannala, Tat-Jun Chin, Tolga Birdal. [doi]
- Learning conditionally untangled latent spaces using Fixed Point IterationVictor Enescu, Hichem Sahbi. [doi]
- SceneSAM: Integrating 2D Labels for Weakly Supervised 3D Scene UnderstandingJulius Körner, Dogu Tamgac, Dávid Rozenberszki. [doi]
- Enhancing Radiology Report Generation: The Impact of Locally Grounded Vision and Language TrainingSergio Sánchez Santiesteban, Muhammad Awais 0001, Yi-Zhe Song, Josef Kittler. [doi]
- Learning Scene-Goal-Aware Motion Representation for Trajectory PredictionZiyang Ren, Ping Wei 0001, Haowen Tang, Huan Li, Jin Yang. [doi]
- Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion TransformerSungmin Kang, Jaeha Song, Jihie Kim. [doi]
- GazeHELL: Gaze Estimation with Hybrid Encoders and Localised Losses with weighingShubham Dokania, Vasudev Singh, Shuaib Ahmed. [doi]
- Multi-Scope Representation Learning for Causal Relation Discovery with new Challenging DatasetsJiageng Zhu, Hanchen Xie, Jianhua Wu, Mohamed E. Hussein 0001, Mahyar Khayatkhoei, Jiazhi Li 0001, Wael AbdAlmageed. [doi]
- Taming the Tail: Leveraging Asymmetric Loss and Padé Approximation to Overcome Long-Tailed Class ImbalancePankhi Kashyap, Pavni Tandon, Sunny Gupta, Abhishek Tiwari, Ritwik Kulkarni, Kshitij Sharad Jadhav. [doi]
- Cost-Sensitive Learning for Long-Tailed Temporal Action SegmentationZhanzhong Pang, Fadime Sener, Shrinivas Ramasubramanian, Angela Yao. [doi]
- iHAST: Integrating Hybrid Attention for Super-Resolution in Spatial TranscriptomicsXi Li, Jing Zhang 0062, Ziheng Duan, Yi Dai, Siwei Xu. [doi]
- GN-FR: Generalizable Neural Radinace Fields for Flare RemovalGopi Raju Matta, Rahul Siddartha, Rongali Simhachala Venkata Girish, Sumit Sharma 0014, Kaushik Mitra. [doi]
- FLARE up your data: Diffusion-based Augmentation Method in Astronomical ImagingMohammed Talha Alam, Raza Imam, Mohsen Guizani, Fakhri Karray. [doi]
- Depth-Guided Privacy-Preserving Visual Localization Using 3D Sphere CloudsHeeJoon Moon, Jongwoo Lee, Jeong Gon Kim, Je Hyeong Hong. [doi]
- Few-Shot Classification of Interactive Activities of Daily Living (InteractADL)Zane Durante, Robathan Harries, Edward Vendrow, Zelun Luo, Yuta Kyuragi, Kazuki Kozuka, Li Fei-Fei 0001, Ehsan Adeli 0001. [doi]
- Explaining Multi-modal Large Language Models by Analyzing their Vision PerceptionLoris Giulivi, Giacomo Boracchi. [doi]
- DRAFT: Direct Radiance Fields Editing with Composable OperationsZhihan Cai, Kailu Wu, Dapeng Cao, Feng Chen, Kaisheng Ma. [doi]
- 3D Blur Kernel on Gaussian SplattingYongchao Lin, Xiangdong Su, Yuhan Yang. [doi]
- Learning to Segment Publicly Accessible Green Spaces with Visual and Semantic DataJian Gao, Niall McLaughlin, Joanna Sara Valson, Neil Anderson, Ruth F. Hunter. [doi]
- Toward Highly Efficient Semantic-Guided Machine Vision for Low-Light Object DetectionXin Feng, Junxian Zeng, Siping Wang, Zhenwei He. [doi]
- VEMIC: View-aware Entropy model for Multi-view Image CompressionSusmija Jabbireddy, Davit Soselia, Max Ehrlich, Christopher A. Metzler, Amitabh Varshney. [doi]
- Guided Attention for Interpretable Motion CaptioningKarim Radouane, Julien Lagarde, Sylvie Ranwez, Andon Tchechmedjiev. [doi]
- A Multimodal Network on Handwritten Chinese Character Error CorrectionHaizhao Sun, Yu Ning, Xu Ji, Chuang Zhang, Ming Wu 0001. [doi]
- Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language ModelsEman Ali, Muhammad Haris Khan. [doi]
- A Super-pixel-based Approach to the Stable Interpretation of Neural NetworksShizhan Gong, Jingwei Zhang, Qi Dou 0001, Farzan Farnia. [doi]
- Drawing Insights: Sequential Representation Learning in ComicsSam Titarsolej, Neil Cohn, Nanne van Noord. [doi]
- Horospherical Learning with Smart PrototypesPaul Berg, Björn Michele, Minh-Tan Pham, Laetitia Chapel, Nicolas Courty. [doi]
- Unsupervised Point Cloud Registration with Self-DistillationChristian Löwens, Thorben Funke, André Wagner, Alexandru Paul Condurache. [doi]
- Complete the Feature Space: Diffusion-Based Fictional ID Generation for Face RecognitionMyeong-Yeon Yi, Dongjae Lee, Naeun Ko, Yonghyun Jeong, Sang-goo Lee, Seunggyu Chang. [doi]
- A Deep Belief Network Approach to Scalable Compression of Light Field Data for Auto-Stereoscopic DisplaysSally Khaidem, Mansi Sharma. [doi]
- MMPrune4U: Regularizing Multimodal Feature Distortion in Weight Pruning for Deep Neural Network CompressionSudip Das, Kaixin Xu, Nushrat Hussain, Ziyuan Zhao, Arindam Das, Weisi Lin, Ujjwal Bhattacharya. [doi]
- Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud SequencesRui Yu, Runkai Zhao, Cong Nie, Heng Wang, Siyu Li, Songhao Zhu. [doi]
- S³-Match: Common-View Aligned Image Matching via Self-Supervised Keypoint SelectionShizhen Li, Jingcheng Liu, Jianwu Fang, Dezheng Gao, Jianru Xue. [doi]
- A Learnable Color Correction Matrix for RAW ReconstructionAnqi Liu, Shiyi Mu, Shugong Xu. [doi]
- CLIP with Generative Latent Replay: a Strong Baseline for Incremental LearningEmanuele Frascaroli, Aniello Panariello, Pietro Buzzega, Lorenzo Bonicelli, Angelo Porrello, Simone Calderara. [doi]
- Blocks as Probes: Dissecting Categorization Ability of Large Multimodal ModelsBin Fu, Qiyang Wan, Jialin Li, Ruiping Wang 0001, Xilin Chen 0001. [doi]
- Deep Unfolding Network with Spatial-spectral Perception Enhanced for Pan-sharpeningMengjiao Zhao, Mengting Ma, Xiangdong Li, Ao Gao, Siyang Song, Wei Zhang 0243. [doi]
- Sequential Amodal Segmentation via Cumulative Occlusion LearningJiayang Ao, Qiuhong Ke, Krista A. Ehinger. [doi]
- Revisiting Image Captioning Training Paradigm via Direct CLIP-based OptimizationNicholas Moratelli, Davide Caffagni, Marcella Cornia, Lorenzo Baraldi 0002, Rita Cucchiara. [doi]
- QUD: Unsupervised Knowledge Distillation for Deep Face RecognitionJan Niklas Kolf, Naser Damer, Fadi Boutros. [doi]
- Seg-HGNN: Unsupervised and Light-Weight Image Segmentation with Hyperbolic Graph Neural NetworksDebjyoti Mondal, Rahul Mishra, Chandan Kumar Pandey. [doi]
- Recovering SLAM Tracking Lost by Trifocal Pose Estimation using GPU-HC++Chiang-Heng Chien, Ahmad Abdelfattah, Benjamin B. Kimia. [doi]
- Uni-Mlip: Unified Self-Supervision for Medical Vision Language Pre-trainingAmeera Ali Bawazir, Kebin Wu, Wenbin Li. [doi]
- AutoDOM: Automated Dimension Overlay for Enhanced Measurement-GuidancePushpendu Ghosh, Aniket Joshi, Soumyajit Chowdhury, Promod Yenigalla. [doi]
- Region-based Entropy Separation for One-shot Test-Time AdaptationKodai Kawamura, Shunya Yamagami, Go Irie. [doi]
- Leveraging Inductive Bias in ViT for Medical Image DiagnosisJungmin Ha, Euihyun Yoon, Sungsik Kim, Jinkyu Kim, Jaekoo Lee. [doi]
- Spatio-Temporal Transformer with Rotary Position Embedding and Bone Priors for 3D Human Pose EstimationCheng Chen, Jiang Liu, Liaoyuan Zeng, Fang Duan, Sean McGrath 0001, Tian Dan. [doi]
- D³Nav: Data-Driven Driving Agents for Autonomous Vehicles in Unstructured TrafficAditya Nalgunda Ganesh, Gowri Srinivasa. [doi]
- MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and MotionAngel Villar-Corrales, Moritz Austermann, Sven Behnke. [doi]
- Enhancing Cardiovascular Disease Prediction through Multi-Modal Self-Supervised LearningFrancesco Girlanda, Olga V. Demler, Bjoern H. Menze, Neda Davoudi. [doi]
- IncreLM: Incremental 3D Line MappingXulong Bai, Hainan Cui, Shuhan Shen. [doi]
- Learning to Project for Cross-Task Knowledge DistillationDylan Auty, Roy Miles, Benedikt Kolbeinsson, Krystian Mikolajczyk. [doi]
- Box for Mask and Mask for Box: weak losses for multi-task partially supervised learningHoàng-Ân Lê, Paul Berg, Minh-Tan Pham. [doi]
- Time-conditioned Illumination for Inverse Rendering of Outdoor ScenesXiaoxue Chen, Hao Zhao 0002, Guyue Zhou, Ya-Qin Zhang. [doi]
- ATLANTIS: A Framework for Automated Targeted Language-guided Augmentation Training for Robust Image SearchInderjeet Singh, Roman Vainshtein, Alon Zolfi, Asaf Shabtai, Tu Bui, Jonathan Brokman, Omer Hofman, Fumiyoshi Kasahara, Kentaro Tsuji, Hisashi Kojima. [doi]
- Adaptive Weighted Co-Learning for Cross-Domain Few-Shot LearningAbdullah Alchihabi, Marzi Heidari, Yuhong Guo. [doi]
- RETRO: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised LearningKhanh-Binh Nguyen, Chae Jung Park. [doi]
- Hybrid-CSR: Coupling Explicit and Implicit Reconstruction of Cortical SurfaceShanlin Sun, Tung Le, Pooya Khosravi, Chenyu You, Kun Han, Haoyu Ma, Deying Kong, Xiangyi Yan, Xiaohui Xie. [doi]
- CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose EstimationJianyu Zhao, Wei Quan, Bogdan J. Matuszewski. [doi]
- Guidance-base Diffusion Models for Improving Photoacoustic Image QualityTatsuhiro Eguchi, Shumpei Takezaki, Mihoko Shimano, Takayuki Yagi, Ryoma Bise. [doi]
- Disparity Estimation Using a Quad-Pixel SensorZhuofeng Wu 0003, Doehyung Lee, Zihua Liu, Kazunori Yoshizaki, Yusuke Monno, Masatoshi Okutomi. [doi]
- MixMask: Revisiting Masking Strategy for Siamese ConvNetsKirill Vishniakov, Eric P. Xing, Zhiqiang Shen. [doi]
- Channel-Partitioned Windowed Attention And Frequency Learning for Single Image Super-ResolutionDinh-Phu Tran, Dao Duy Hung, Daeyoung Kim 0001. [doi]
- AggSS: An Aggregated Self-Supervised Approach for Class Incremental LearningJayateja Kalla, Soma Biswas. [doi]
- Benchmarking and Optimizing Federated Learning with Hardware-related MetricsKai Pan, Yapeng Tian, Yinhe Han 0001, Yiming Gan. [doi]
- TrakAthlete4D: Multi-View On-Field Player Position Tracking in SportsNitish Agarwal, Steven Cadavid. [doi]
- Annotation by Clicks: A Point-Supervised Contrastive Variance Method for Medical Semantic SegmentationQing En, Yuhong Guo. [doi]
- APTPose: Anatomy-aware Pre-Training for 3D Human Pose EstimationQing-Wen Yang, Kai-Wen Duan, Ting-Yi Lu, Kevin Lin, Cheng-Yen Yang, Lijuan Wang, Jenq-Neng Hwang, Shang-Hong Lai. [doi]
- Balancing Calibration and Performance: Stochastic Depth in Segmentation BNNsLinghong Yao, Denis Hadjivelichkov, Andromachi Maria Delfaki, Yuanchang Liu, Brooks Paige, Dimitrios Kanoulas. [doi]
- InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task LearningBabak Ehteshami Bejnordi, Gaurav Kumar, Amelie Royer, Christos Louizos, Tijmen Blankevoort, Mohsen Ghafoorian. [doi]
- SOFI: Multi-Scale Deformable Transformer for Camera Calibration with Enhanced Line QueriesSebastian Janampa, Marios Pattichis. [doi]
- FILS: Self-Supervised Video Feature Prediction In Semantic Language SpaceMona Ahmadian, Frank Guerin, Andrew Gilbert. [doi]
- Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo PairsSadra Safadoust, Fabio Tosi, Fatma Güney, Matteo Poggi. [doi]
- SAM Helps SSL: Mask-guided Attention Bias for Self-supervised LearningKensuke Taguchi, Takehiko Kawai, Wataru Imaeda, Hironobu Fujiyoshi. [doi]
- Spatiotemporal Vision Transformer for Weakly Supervised Dense Prediction of Dynamic Brain MapsBehnam Kazemivash, Armin Iraji, Sergey M. Plis, Vince D. Calhoun. [doi]
- MotionMAE: Self-supervised Video Representation Learning with Motion-Aware Masked AutoencodersHaosen Yang 0003, Deng Huang, Bin Wen, Jiannan Wu, Hongxun Yao, Yi Jiang, Xiatian Zhu, Zehuan Yuan. [doi]
- HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene ReconstructionHaoyu Zhao, Xingyue Zhao, Lingting Zhu, Weixi Zheng, Yongchao Xu. [doi]
- As Firm As Their Foundations: Creating Transferable Adversarial Examples Across Downstream Tasks with CLIPAnjun Hu, Jindong Gu, Francesco Pinto, Konstantinos Kamnitsas, Philip Torr 0001. [doi]
- MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration ManifoldsZiqiang Dang, Tianxing Fan, Boming Zhao, Xujie Shen, Lei Wang, Guofeng Zhang 0001, Zhaopeng Cui. [doi]
- Multi-modal Crowd Counting via Modal EmulationChenhao Wang, Xiaopeng Hong, Zhiheng Ma, Yupeng Wei, Yabin Wang, Xiaopeng Fan. [doi]
- PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB ImagesYiheng Xiong, Angela Dai. [doi]
- Painterly Image Harmonization via Bi-Transformation with Dynamic KernelsZhangliang Sun, Hui Zhang. [doi]
- Decoupling Forgery Semantics for Generalizable Deepfake DetectionWei Ye, Xinan He, Feng Ding 0007. [doi]
- A self-supervised cyclic neural-analytic approach for novel view synthesis and 3D reconstructionDragos Costea, Alina Marcu, Marius Leordeanu. [doi]
- Rectifying Shortcut Learning through Cellular Differentiation in Deep Learning NeuronsHongjing Niu, Hanting Li, Guoping Wu, Bin Li 0025, Feng Zhao 0004. [doi]
- AR-TTA: A Simple Method for Real-World Continual Test-Time AdaptationDamian Sójka, Bartlomiej Twardowski, Tomasz Trzcinski, Sebastian Cygert. [doi]
- Training-Free Zero-Shot Semantic Segmentation with LLM RefinementYuantian Huang, Satoshi Iizuka, Kazuhiro Fukui. [doi]
- Textual Attention RPN for Open-Vocabulary Object DetectionTae-Min Choi, Inug Yoon, Jong-Hwan Kim 0001, Juyoun Park. [doi]
- Direct-Sum Approach to Integrate Losses Via Classifier SubspaceTakumi Kobayashi 0001. [doi]
- BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth EstimationKieran Saunders, Luis J. Manso, George Vogiatzis. [doi]
- Efficient Data Source Relevance Quantification for Multi-Source Neural NetworksJakob Gawlikowski, Nina Maria Gottschling. [doi]
- Difflare: Removing Image Lens Flare with Latent Diffusion ModelsTianwen Zhou, Qihao Duan, Zitong Yu. [doi]
- Beyond Face Matching: A Facial Traits based Privacy Score for Synthetic Face DatasetsRobero Leyva, Praveen Selvaraj, Andrew Elliott, Gregory Epiphaniou, Carsten Maple. [doi]
- Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNNJiawei Yao, Tong Wu, Xiaofeng Zhang. [doi]
- Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data RegimesDmitry Demidov, Abduragim Shtanchaev, Mihail Mihaylov, Mohammad Almansoori. [doi]
- SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive LearningHao Chen 0011, Jiaze Wang, Ziyu Guo, Jinpeng Li 0004, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng. [doi]
- Spatial-Temporal NAS for Fast Surgical SegmentationMatthew Lee, Felix John Samuel Bragman, Ricardo Sanchez-Matilla, Imanol Luengo, Danail Stoyanov. [doi]
- Mumpy: Multilateral Temporal-view Pyramid Transformer for Video Inpainting DetectionYing Zhang, Yuezun Li, Bo Peng, Jiaran Zhou, Huiyu Zhou 0001, Junyu Dong. [doi]
- Are Sparse Neural Networks Better Hard Sample Learners?Qiao Xiao, Boqian Wu, Lu Yin 0006, Christopher Neil Gadzinski, Tianjin Huang, Mykola Pechenizkiy, Decebal Constantin Mocanu. [doi]
- AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New DomainsKrzysztof Baron-Lis, Matthias Rottmann, Annika Mütze, Sina Honari, Pascal Fua, Mathieu Salzmann. [doi]
- Layer-wise Learning of CNNs by Self-tuning Learning Rate and Early Stopping at Each LayerMelika Sadeghi Tabrizi, Ali Karimi, Ahmad Kalhor, Babak Nadjar Araabi, Mona Ahmadian. [doi]
- MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAMRenwu Li, Wenjing Ke, Dong Li 0025, Lu Tian, Emad Barsoum. [doi]
- Examining the Threat Landscape: Foundation Models and Model StealingAnkita Raj, Deepankar Varma, Chetan Arora 0001. [doi]
- Recovering Global Data Distribution Locally in Federated LearningZiyu Yao 0001. [doi]
- Improving Object Detection via Local-global Contrastive LearningDanai Triantafyllidou, Sarah Parisot, Ales Leonardis, Steven McDonagh 0001. [doi]
- May the Forgetting Be with You: Alternate Replay for Learning with Noisy LabelsMonica Millunzi, Lorenzo Bonicelli, Angelo Porrello, Jacopo Credi, Petter N. Kolm, Simone Calderara. [doi]
- Into the Fog: Evaluating Robustness of Multiple Object TrackingNadezda Kirillova, Muhammad Jehanzeb Mirza, Horst Bischof, Horst Possegger. [doi]
- DisCoM-KD: Cross-Modal Knowledge Distillation via Disentanglement Representation and Adversarial LearningDino Ienco, Cássio Fraga Dantas. [doi]
- HDRSplat: Gaussian Splatting for High Dynmaic Range 3D Scene Reconstruction from Raw ImagesShreyas Singh, Aryan Garg, Kaushik Mitra. [doi]
- Motion Avatar: Generate Human and Animal Avatars with Arbitrary MotionZeyu Zhang 0006, Yiran Wang, Biao Wu 0006, Shuo Chen, Zhiyuan Zhang, Shiya Huang, Wenbo Zhang 0009, Meng Fang, Ling Chen 0006, Yang Zhao 0019. [doi]
- Towards Better Zero-Shot Anomaly Detection under Distribution Shift with CLIPJiyao Gao, Chengxin He, Lei Duan, Jie Zuo. [doi]
- Prompting Diffusion Representations for Cross-Domain Semantic SegmentationRui Gong, Martin Danelljan, Han Sun, Julio Delgado Mangas, Nikolay Marin, Luc Van Gool. [doi]
- Text Removal In E-Commerce Images: A Comparison Of Inpainting MethodsHiya Roy, Björn Stenger. [doi]
- Rethinking Domain Adaptive Optic Disc and Cup Segmentation in Fundus Image through Dynamic Diffusion FlowCanran Li, Dongnan Liu, Weidong Cai 0001. [doi]
- Beyond Static and Dynamic Quantization - Hybrid Quantization of Vision TransformersPiotr Kluska, Florian Scheidegger, A. Cristiano I. Malossi, Enrique S. Quintana-Ortí. [doi]
- Content and Style Aware Audio-Driven Facial AnimationQingju Liu, Hyeongwoo Kim, Gaurav Bharaj. [doi]
- AISE: Adaptive Input Sampling for Explanation of Black-box ModelsEvgeny Tsykunov, Wonju Lee, Minje Park. [doi]
- SAM-EG: Segment Anything Model with Egde Guidance framework for efficient Polyp SegmentationQuoc-Huy Trinh, Hai Dang Nguyen, Bao-Tram Nguyen Ngoc, Debesh Jha, Ulas Bagci, Minh-Triet Tran. [doi]
- G3FA: Geometry-guided GAN for Face AnimationAlireza Javanmardi, Alain Pagani, Didier Stricker. [doi]
- When Text and Images Don't Mix: Bias-Correcting Language-Image Similarity Scores for Anomaly DetectionAdam Goodge, Bryan Hooi, Wee Siong Ng. [doi]
- Reconstructing Spheres by Fitting PlanesErol Ozgur, Mohammad Alkhatib, Youcef Mezouar, Adrien Bartoli. [doi]
- Task-Related Feature Enhancement Network for Neuronal Morphology ClassificationChunli Sun, Feng Zhao 0004. [doi]
- Drone-assisted Road Gaussian Splatting with Cross-view UncertaintySaining Zhang, Baijun Ye, Xiaoxue Chen, Yuantao Chen, Zongzheng Zhang, Cheng Peng, Yongliang Shi, Hao Zhao 0002. [doi]
- Towards Generative Class Prompt Learning for Fine-grained Visual RecognitionSoumitri Chattopadhyay, Sanket Biswas, Emanuele Vivoli, Josep Lladós 0001. [doi]