Abstract is missing.
- Adaptive Out-of-Distribution Detection with Coarse-to-Fine Grained RepresentationKohei Fukuda, Hiroaki Aizawa. 19-26 [doi]
- Distortion-Aware Adversarial Attacks on Bounding Boxes of Object DetectorsPham Phuc, Son Vuong, Khang Nguyen 0003, Tuan Dang. 27-38 [doi]
- Pose-Centric Motion Synthesis Through Adaptive Instance NormalizationOliver Hixon-Fisher, Jarek Francik, Dimitrios Makris 0001. 39-47 [doi]
- ConvKAN: Towards Robust, High-Performance and Interpretable Image ClassificationAchref Ouni, Chafik Samir, Yousef Bouaziz, Anis Fradi. 48-58 [doi]
- Latent Space Characterization of Autoencoder VariantsAnika Shrivastava, Renu Rameshan, Samar Agnihotri. 59-67 [doi]
- Beyond Labels: Self-Attention-Driven Semantic Separation Using Principal Component Clustering in Latent Diffusion ModelsFelix Stillger, Frederik Hasecke, Lukas Hahn, Tobias Meisen. 68-80 [doi]
- Experience Replay and Zero-Shot Clustering for Continual Learning in Diabetic Retinopathy DetectionGusseppe Bravo Rocca, Peini Liu, Jordi Guitart, Ajay Dholakia, David Ellison, Rodrigo M. Carrillo-Larco. 81-92 [doi]
- Detection of Door-Closing Defects by Learning from Physics-Based SimulationsRyoga Takahashi, Yota Yamamoto, Ryosuke Furuta, Yukinobu Taniguchi. 93-98 [doi]
- Leveraging Vision Language Models for Understanding and Detecting Violence in VideosJose Alejandro Avellaneda Gonzalez, Tetsu Matsukawa, Einoshin Suzuki. 99-113 [doi]
- Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot LearningEric Brouwer, Jan Erik van Woerden, Gertjan J. Burghouts, Matias Valdenegro-Toro, Marco Zullich. 114-125 [doi]
- MetaToken: Detecting Hallucination in Image Descriptions by Meta ClassificationLaura Fieback, Jakob Spiegelberg, Hanno Gottschalk. 126-137 [doi]
- ReST: High-Precision Soccer Player Tracking via Motion Vector SegmentationFahad Majeed, Khaled Ahmed Lutf Al Thelaya, Nauman Ullah Gilal, Kamilla Swart-Arries, Marco Agus, Jens Schneider 0002. 138-149 [doi]
- Transformer or Mamba for Temporal Action Localization? Insights from a Comprehensive Experimental Comparison StudyZejian Zhang, Cristina Palmero, Sergio Escalera. 150-162 [doi]
- DeepSpace: Navigating the Frontier of Deepfake Identification Using Attention-Driven Xception and a Task-Specific SubspaceAyush Roy, Sk Mohiuddin, Maxim V. Minenko, Dmitrii I. Kaplun, Ram Sarkar. 163-172 [doi]
- Self-Supervised Iterative Refinement for Anomaly Detection in Industrial Quality ControlMuhammad Aqeel, Shakiba Sharifi, Marco Cristani, Francesco Setti. 173-183 [doi]
- VectorWeaver: Transformers-Based Diffusion Model for Vector Graphics GenerationIvan Jarsky, Maxim Kuzin, Valeria Efimova, Viacheslav Shalamov, Andrey Filchenkov. 184-195 [doi]
- Data-Free Dynamic Compression of CNNs for Tractable EfficiencyLukas Meiner, Jens Mehnert, Alexandru Paul Condurache. 196-208 [doi]
- Enhancing 3D Human Pose Estimation: A Novel Post-Processing MethodElham Iravani, Frederik Hasecke, Lukas Hahn, Tobias Meisen. 209-220 [doi]
- Temporally Accurate Events Detection Through Ball Possessor Recognition in SoccerMarc Peral, Guillem Capellera, Antonio Rubio 0001, Luis Ferraz, Francesc Moreno-Noguer, Antonio Agudo. 221-231 [doi]
- Improving Image Classification Tasks Using Fused Embeddings and Multimodal ModelsArtur A. M. Oliveira, Mateus Espadoto, Roberto Hirata Jr., Roberto M. Cesar Jr.. 232-241 [doi]
- Paint Blob Detection and Decoding for Identification of Honey BeesAndrea P. Gómez-Jaime, Luke Meyers, Josué A. Rodríguez-Cordero, José L. Agosto-Rivera, Tugrul Giray, Rémi Mégret. 242-250 [doi]
- Vision-Language In-Context Learning Driven Few-Shot Visual Inspection ModelShiryu Ueno, Yoshikazu Hayashi, Shunsuke Nakatsuka, Yusei Yamada, Hiroaki Aizawa, Kunihito Kato. 253-260 [doi]
- Action Tube Generation by Person Query Matching for Spatio-Temporal Action DetectionKazuki Omi, Jion Oshima, Toru Tamaki. 261-268 [doi]
- CodeSCAN: ScreenCast ANalysis for Video Programming TutorialsAlexander Naumann, Felix Hertlein, Jacqueline Höllig, Lucas Cazzonelli, Steffen Thoma. 269-277 [doi]
- Spiideo SoccerNet SynLoc: Single Frame World Coordinate Athlete Detection and Localization with Synthetic DataHåkan Ardö, Mikael G. Nilsson, Anthony Cioppa, Floriane Magera, Silvio Giancola, Haochen Liu, Bernard Ghanem, Marc Van Droogenbroeck. 278-285 [doi]
- Deep Image Clustering with Model-Agnostic Meta-LearningKim Bjerge, Paul Bodesheim, Henrik Karstoft. 286-297 [doi]
- Improving Periocular Recognition Accuracy: Opposite Side Learning Suppression and Vertical Image InversionMasakazu Fujio, Yosuke Kaga, Kenta Takahashi. 298-305 [doi]
- Segment-Level Road Obstacle Detection Using Visual Foundation Model Priors and Likelihood RatiosYoussef Shoeb, Nazir Nayal, Azarm Nowzad, Fatma Güney, Hanno Gottschalk. 306-315 [doi]
- Classifier Ensemble for Efficient Uncertainty Calibration of Deep Neural Networks for Image ClassificationMichael Schulze, Nikolas Ebert, Laurenz Reichardt, Oliver Wasenmüller. 316-323 [doi]
- Conditioned Generative AI for Synthetic Training of 6D Object Pose DetectionMathijs Lens, Aaron Van Campenhout, Toon Goedemé. 324-331 [doi]
- Deep Local Feature Matching Image Anomaly Detection with Patch Adaptive Average Pooling TechniqueAfshin Dini, Esa Rahtu. 332-339 [doi]
- CTypiClust: Confidence-Aware Typical Clustering for Budget-Agnostic Active Learning with Confidence CalibrationTakuya Okano, Yohei Minekawa, Miki Hayakawa. 340-347 [doi]
- Neural Network Meta Classifier: Improving the Reliability of Anomaly SegmentationJurica Runtas, Tomislav Petkovic. 348-355 [doi]
- New Paths in Document Data Augmentation Using Templates and Language ModelsLucas Wojcik, Luiz Coelho, Roger Granada, David Menotti. 356-366 [doi]
- Uncertainty Estimation for Super-Resolution Using ESRGANManiraj Sai Adapa, Marco Zullich, Matias Valdenegro-Toro. 367-374 [doi]
- Herbicide Efficacy Prediction Based on Object Segmentation of Glasshouse ImageryMajedaldein Almahasneh, Baihua Li, Haibin Cai, Nasir Rajabi, Laura Davies, Qinggang Meng. 375-382 [doi]
- Inductive Self-Supervised Dimensionality Reduction for Image RetrievalDeryk Willyan Biotto, Guilherme Henrique Jardim, Vinicius Atsushi Sato Kawai, Bionda Rozin, Denis Henrique Pinheiro Salvadeo, Daniel Carlos Guimarães Pedronette. 383-391 [doi]
- A Method for Detecting Hands Moving Objects from VideosRikuto Konishi, Toru Abe, Takuo Suganuma. 392-399 [doi]
- Rescuing Easy Samples in Self-Supervised PretrainingQin Wang, Kai Krajsek, Hanno Scharr. 400-409 [doi]
- Knowledge Amalgamation for Single-Shot Context-Aware Emotion RecognitionTristan Cladière, Olivier Alata, Christophe Ducottet, Hubert Konik, Anne-Claire Legrand. 410-419 [doi]
- Handling Drift in Industrial Defect Detection Through MMD-Based Domain AdaptationXuban Barberena, Fátima A. Saiz, Iñigo Barandiaran. 420-429 [doi]
- Beyond Data Augmentations: Generalization Abilities of Few-Shot Segmentation ModelsMuhammad Ahsan, Guy Ben-Yosef, Gemma Roig. 430-438 [doi]
- Membership Inference Attacks for Face Images Against Fine-Tuned Latent Diffusion ModelsLauritz Christian Holme, Anton Mosquera Storgaard, Siavash Arjomand Bigdeli. 439-446 [doi]
- Minimizing Number of Distinct Poses for Pose-Invariant Face RecognitionCarter Ung, Pranav Mantini, Shishir K. Shah. 447-455 [doi]
- VLLM Guided Human-Like Guidance Navigation GenerationMasaki Nambata, Tsubasa Hirakawa, Takayoshi Yamashita, Hirobobu Fujiyoshi, Takehito Teraguchi, Shota Okubo, Takuya Nanri. 456-463 [doi]
- CLIP-MDGAN: Multi-Discriminator GAN Using CLIP Task AllocationShonosuke Gonda, Fumihiko Sakaue, Jun Sato. 464-470 [doi]
- Simultaneous Estimation of Driving Intentions for Multiple Vehicles Using Video TransformerJunya Isogawa, Fumihiko Sakaue, Jun Sato. 471-477 [doi]
- Human Pose Estimation from an Extremely Low-Resolution Image Sequence by Pose Transition Embedding NetworkYasutomo Kawanishi, Hitoshi Nishimura, Hiroshi Murase. 478-485 [doi]
- Multi-Scale Foreground-Background Confidence for Out-of-Distribution SegmentationSamuel Marschall, Kira Maag. 486-496 [doi]
- Accuracy Improvement of Neuron Concept Discovery Using CLIP with Grad-CAM-Based Attention RegionsTakahiro Sannomiya, Kazuhiro Hotta. 497-502 [doi]
- Expanding Domain Coverage in Injection Molding Quality Inspection with Physically-Based Synthetic DataDominik Schraml, Gunther Notni. 503-510 [doi]
- Neighbor Embedding Projection and Graph Convolutional Networks for Image ClassificationGustavo Rosseto Leticio, Vinicius Atsushi Sato Kawai, Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette. 511-518 [doi]
- Graph Convolutional Networks and Particle Competition and Cooperation for Semi-Supervised LearningGustavo Rosseto Leticio, Matheus Henrique Jacob dos Santos, Lucas Pascotti Valem, Vinicius Atsushi Sato Kawai, Fabricio Aparecido Breve, Daniel Carlos Guimarães Pedronette. 519-526 [doi]
- Exploration and Validation of Specialized Loss Functions for Generative Visual-Thermal Image Domain TransferSimon Fischer, Benedikt Kottler, Eva Strauß, Dimitri Bulatov. 527-534 [doi]
- Semi-Supervised Anomaly Detection in Skin Lesion ImagesAlina Burgert, Babette Dellen, Uwe Jaekel, Dietrich Paulus. 535-541 [doi]
- Automatic Detection of the Driver Distractions Based on the Analysis of Face VideosArtur Urzedowski, Kazimierz Choros. 542-549 [doi]
- HandMvNet: Real-Time 3D Hand Pose Estimation Using Multi-View Cross-Attention FusionMuhammad Asad Ali, Nadia Robertini, Didier Stricker. 555-562 [doi]
- MuSt-NeRF: A Multi-Stage NeRF Pipeline to Enhance Novel View SynthesisSudarshan Raghavan Iyengar, Subash Sharma, Patrick Vandewalle. 563-573 [doi]
- Urban Re-Identification: Fusing Local and Global Features with Residual Masked Maps for Enhanced Vehicle Monitoring in Small DatasetsWilliam A. Ramirez, César A. Sierra Franco, Thiago Motta, Alberto Raposo 0001. 574-581 [doi]
- 2D Motion Generation Using Joint Spatial Information with 2CM-GPTRyota Inoue, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi. 582-590 [doi]
- Segmentation-Guided Neural Radiance Fields for Novel Street View SynthesisYizhou Li, Yusuke Monno, Masatoshi Okutomi, Yuuichi Tanaka, Seiichi Kataoka, Teruaki Kosiba. 591-597 [doi]
- ConMax3D: Frame Selection for 3D Reconstruction Through Concept MaximizationAkash Malhotra, Nacéra Seghouani, Gilbert Badaro, Christophe Blaya. 598-609 [doi]
- Improving Adaptive Density Control for 3D Gaussian SplattingGlenn Grubert, Florian Barthel, Anna Hilsmann, Peter Eisert. 610-621 [doi]
- Sensor Calibration and Data Analysis of the MuFoRa DatasetValentino Behret, Regina Kushtanova, Islam Fadl, Simon Weber, Thomas Helmer, Frank Palme. 622-631 [doi]
- Uncertainty and Feature-Based Weighted Loss for 3D Wheat Part SegmentationReena, John H. Doonan, Kevin Williams, Fiona M. K. Corke, Huaizhong Zhang, Yonghuai Liu. 632-641 [doi]
- D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-SegmentationMaik Steinhauser, Laurenz Reichardt, Nikolas Ebert, Oliver Wasenmüller. 645-650 [doi]
- Adaptable Distributed Vision System for Robot Manipulation TasksMarko Pavlic, Darius Burschka. 651-658 [doi]
- Real-Time Kinematic Positioning and Optical See-Through Head-Mounted Display for Outdoor Tracking: Hybrid System and Preliminary AssessmentMuhannad Ismael, Maël Cornil. 659-666 [doi]
- Noisemaker 3D: Comprehensive Framework for Mesh Noise GenerationVladimir Mashurov, Vasilii Latonov, Anastasia Martynova, Natalia Semenova. 667-674 [doi]
- Evaluating Homography Error for Accurate Multi-Camera Multi-Object Tracking of Dairy CowsShunpei Aou, Yota Yamamoto, Kazuaki Nakamura, Yukinobu Taniguchi. 675-682 [doi]
- FiDaSS: A Novel Dataset for Firearm Threat Detection in Real-World ScenesMurilo Santos Regio, Isabel H. Manssour. 683-690 [doi]
- Comparative Analysis of Deep Learning-Based Multi-Object Tracking Approaches Applied to Sports User-Generated VideosElton Alencar, Larissa Pessoa, Fernanda Costa, Guilherme Souza, Rosiane de Freitas. 691-698 [doi]
- Learning Neural Velocity Fields from Dynamic 3D Scenes via Edge-Aware Ray SamplingSota Ito, Yoshikazu Hayashi, Hiroaki Aizawa, Kunihito Kato. 699-706 [doi]
- 3DSES: An Indoor Lidar Point Cloud Segmentation Dataset with Real and Pseudo-Labels from a 3D ModelMaxime Mérizette, Nicolas Audebert, Pierre Kervella, Jérôme Verdun. 707-716 [doi]
- MAESTRO: A Full Point Cloud Approach for 3D Anomaly Detection Based on ReconstructionRemi Lhoste, Antoine Vacavant, Damien Delhay. 717-724 [doi]
- Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor ScenariosIryna Repinetska, Anna Hilsmann, Peter Eisert. 725-734 [doi]
- Efficient 3D Human Pose and Shape Estimation Using Group-Mix Attention in Transformer ModelsYushan Wang, Shuhei Tarashima, Norio Tagawa. 735-742 [doi]
- Leveraging Unreal Engine for UAV Object Tracking: The AirTrackSynth Synthetic DatasetMingyang Zhang, Kristof Van Beeck, Toon Goedemé. 743-750 [doi]
- Recovery of Detailed Posture and Shape from Motion Video Images by Deforming SMPLYumi Ando, Fumihiko Sakaue, Jun Sato. 751-757 [doi]
- Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired NavigationMarziyeh Bamdad, Hans-Peter Hutter, Alireza Darvishy. 758-765 [doi]
- Benchmarking Neural Rendering Approaches for 3D Reconstruction of Underwater EnvironmentsSalvatore Mario Carota, Alessandro Privitera, Daniele Di Mauro, Antonino Furnari, Giovanni Maria Farinella, Francesco Ragusa. 766-773 [doi]
- An Event Camera Simulator for Arbitrary Viewpoints Based on Neural Radiance FieldsDiego Hernández Rodríguez, Motoharu Sonogashira, Kazuya Kitano, Yuki Fujimura, Takuya Funatomi, Yasuhiro Mukaigawa, Yasutomo Kawanishi. 774-780 [doi]
- A Computer Vision Approach to Counting Farmed Fish in Flowing WaterMasanori Nishiguchi, Hitoshi Habe, Koji Abe, Masayuki Otani, Nobukazu Iguchi. 781-789 [doi]
- Shape from Mirrored Polarimetric Light FieldShunsuke Nakagawa, Takahiro Okabe, Ryo Kawahara. 790-796 [doi]
- PrIcosa: High-Precision 3D Camera Calibration with Non-Overlapping Field of ViewsOguz Kedilioglu, Tasnim Tabassum Nova, Martin Landesberger, Lijiu Wang, Michael Hofmann, Jörg Franke, Sebastian Reitelshöfer. 801-809 [doi]
- Fine-Grained Self-Localization from Coarse Egocentric Topological MapsDaiki Iwata, Kanji Tanaka 0003, Mitsuki Yoshida, Ryogo Yamamoto, Morishita Yuudai, Hiroki Tomoe. 810-819 [doi]
- GIFF: Graph Iterative Attention Based Feature Fusion for Collaborative PerceptionAhmed N. Ahmed, Siegfried Mercelis, Ali Anwar 0002. 820-829 [doi]
- SSGA: Synthetic Scene Graph Augmentation via Multiple Pipeline VariantsKenta Tsukahara, Ryogo Yamamoto, Kanji Tanaka 0003, Hiroki Tomoe. 833-840 [doi]
- Low Latency Pedestrian Detection Based on Dynamic Vision Sensor and RGB Camera FusionBingyu Huang, Gianni Allebosch, Peter Veelaert, Tim Willems, Wilfried Philips, Jan Aelterman. 841-850 [doi]
- Automated Individualization of Object Detectors for the Semantic Environment Perception of Mobile RobotsChristian Hofmann, Christopher May, Patrick Ziegler, Iliya Ghotbiravandi, Jörg Franke, Sebastian Reitelshöfer. 851-862 [doi]
- Online Detection of End of Take and Release Actions from Egocentric VideosAlessandro Sebastiano Catinello, Giovanni Maria Farinella, Antonino Furnari. 863-870
- Robotic Visual Attention Architecture for ADAS in Critical Embedded Systems for Smart VehiclesDiego Renan Bruno, William D'Abruzzo Martins, Rafael Alceste Berri, Fernando Santos Osório. 871-878 [doi]
- Defense Against Model Inversion Attacks Using a Dummy Recognition Model Trained with Synthetic SamplesYuta Kotsuji, Kazuaki Nakamura. 883-892 [doi]
- Optimum-Path Forest Ensembles to Estimate the Internal Decay in Urban TreesGiovani Candido, Luis Henrique Morelli, Danilo Samuel Jodas, Giuliana Del Nero Velasco, Reinaldo Araújo de Lima, Kelton Augusto Pontara da Costa, João Paulo Papa. 895-902 [doi]
- Coloring 3D Avatars with Single-ImagePin-Yuan Yang, Yu-Shan Deng, Chieh-Shan Lin, An-Chun Luo, Shih-Chieh Chang. 903-910 [doi]
- Internal State Estimation Based on Facial Images with Individual Feature Separation and Mixup AugmentationAyaka Asaeda, Noriko Takemura. 911-918 [doi]
- Disease Estimation Using Gait Videos by Separating Individual Features Based on Disentangled Representation LearningShiori Furukawa, Noriko Takemura. 919-925 [doi]
- Towards a Dataset for Paleographic Details in Historical Torah ScrollsLaura Frank, Germaine Götzelmann, Danah Tonne. 926-933 [doi]
- Efficient CNN-Based System for Automated Beetle Elytra Coordinates PredictionHojin Yoo, Dhanyapriya Somasundaram, Hyunju Oh. 934-941 [doi]
- Effectiveness of Cross-Model Learning Through View-Model Ensemble on Detection of Spatiotemporal EEG PatternsÖmer Muhammet Soysal, Iphy Emeka Kelvin, Muhammed Esad Oztemel. 942-949 [doi]
- A Multimodal Approach to Research Paper SummarizationPranav Bookanakere, Syeda Saniya, Syed Munzer Nouman, S. Pramath, Jayashree Rangareddy. 950-957 [doi]
- Differential Diagnosis of Brain Diseases Using Ensemble Learning and Explainable AINighat Bibi, Kathleen M. Curran, Jane Courtney. 958-964 [doi]
- Leveraging Affordable Solutions for Stereo Video Capture in Virtual Reality ApplicationsLeina Yoshida, Gustavo Camargo Domingues, Fabiana F. F. Peres, Claudio R. M. Mauricio, João Marcelo X. N. Teixeira. 965-971 [doi]
- Sleep-Stage Efficient Classification Using a Lightweight Self-Supervised ModelEldiane Borges dos Santos Durães, João Batista Florindo. 972-979 [doi]
- DeepCellCount: Cell Counting Using Two-Step Deep LearningSara Tesfamariam, Isah Abdullahi Lawal, Arda Durmaz, Jacob G. Scott. 980-985 [doi]
- Towards Safe Self-Stimulatory Behaviors in Autistic Children: HarmAlert4AutisticChildren (HA4AC)Aleenah Khan, Hassan Foroosh. 986-994 [doi]