Abstract is missing.
- A Framework for Vision-Based 3D Inspections for Maintenance Activities and Digital Twin IntegrationPanagiotis Vrachnos, Carlos Ramonell, Ilias Koulalis, Konstantinos Ioannidis, Irina Stipanovic, Stefanos Vrochidis. 1-7 [doi]
- Finding Video Shots for Immersive Journalism Through Text-to-Video SearchLyndon J. B. Nixon, Damianos Galanopoulos, Vasileios Mezaris. 1-6 [doi]
- Modeling Musical Knowledge With Quantum Bayesian NetworksFlorian Krebs, Hermann Fürntratt, Roland Unterberger, Franz Graf 0002. 1-6 [doi]
- Divexplore at Ivr4b 2024Mario Leopold, Klaus Schoeffmann. 1-7 [doi]
- From Controlled to Chaotic: Disparities in Laboratory vs Real-World Stress DetectionSimão Ferreira, Fátima Rodrigues 0001, Johanna Kallio, Filipe Coelho, Vesa Kyllönen, Nuno Rocha, Matilde A. Rodrigues, Elena Vildjiounaite. 1-7 [doi]
- Visione 5.0: Toward Evaluation With Novice UsersGiuseppe Amato 0001, Paolo Bolettieri, Fabio Carrara, Fabrizio Falchi, Claudio Gennaro, Nicola Messina. 1-6 [doi]
- SAM in the Pipeline: Transforming Axis-Aligned to Oriented Bounding Boxes for Superior Sperm DetectionPål Andreas Hoven Bentsen, Steven Hicks, Eric Jul, Pål Halvorsen, Vajira Thambawita. 1-7 [doi]
- A Behavior and Emotion Recognition Framework for Emotion-Aware Services in Physical SpacesSari Järvinen, Johanna Kallio, Johannes Peltola, Satu-Marja Mäkelä. 1-6 [doi]
- Video Shot Discovery Through Text2Video Embeddings in a News Analytics DashboardLyndon J. B. Nixon, Damianos Galanopoulos, Vasileios Mezaris, Alexander Hubmann-Haidvogel, Daniel Fischl, Arno Scharl. 1-5 [doi]
- Combining Image and Region Uncertainty-Based Active Learning for Melanoma SegmentationNicolas Martin, Jean-Pierre Chevallet, Philippe Mulhem, Georges Quénot. 1-7 [doi]
- Improving the Flexibility of Video Events Retrieval Through Dynamic Conditional Refinement With Multilingual CapabilitiesThang-Long Nguyen-Ho, Van-Tu Ninh, Minh-Triet Tran, Graham Healy, Cathal Gurrin. 1-4 [doi]
- A Survey on Graph Deep Representation Learning for Facial Expression RecognitionThéo Gueuret, Akrem Sellami, Chaabane Djeraba. 1-7 [doi]
- Taxonomap: an Interactive System for the Exploration and Explanation of Unsupervised Large-Scale News ClassificationSimon Ott, Daria Liakhovets, Mina Schütz, Medina Andresel, Moritz W. Rothmund-Burgwall, Armin Vogl, Heidi Scheichenbauer, Michael Suker, Alexander Schindler. 1-4 [doi]
- Fine-Grained Rebalancing of Datasets for Correct Demographic ClassificationAndrea Bozzitelli, Pia Cavasinni di Benedetto, Maria De Marsico, Xing Di, Vishal M. Patel. 1-7 [doi]
- A Quest Through Interconnected Datasets: Lessons From Highly-Cited ICASSP PapersCynthia C. S. Liem, Doga Tascilar, Andrew M. Demetriou. 1-8 [doi]
- Motion Consistency Constraint Map for Facial Expression SpottingOuala Ben Jemaa, Amel Aissaoui, Benjamin Allaert, Ioan Marius Bilasco. 1-7 [doi]
- A Concept Design for a Positive Mood Supporting ApplicationAurora Saibene, Riccardo Giussani, Claudia Rabaioli, Nicolò Dozio, Francesca Gasparini, Francesco Ferrise. 1-7 [doi]
- A Multi-Instance Learning Approach for Improving Knee Osteoarthritis Diagnosis From Mri DataMohamed Berrimi, Yun Xin Teoh, Aladine Chetouani, Lotfi Houam, Rachid Jennane. 1-6 [doi]
- MeshConv3D: Efficient Convolution and Pooling Operators for Triangular 3D MeshesGermain Bregeon, Marius Preda, Radu Ispas, Titus Zaharia. 1-6 [doi]
- Emvd Dataset: a Dataset of Extreme Vocal Distortion Techniques Used in Heavy MetalModan Tailleur, Julien Pinquier, Laurent Millot, Corsin Vogel, Mathieu Lagrange. 1-5 [doi]
- VidBasys: A User-Friendly Interactive Video Retrieval System for Novice Users in IVR4BThao-Nhu Nguyen, Quang-Linh Tran, Hoang Bao Le, Binh T. Nguyen 0001, Liting Zhou, Gareth J. F. Jones, Cathal Gurrin. 1-5 [doi]
- Leveraging Query Expansion and Reformulation for Image Retrieval With Large Language and Vision-Language ModelsSandrina Frunza, Stevan Rudinac, Cees Diks. 1-7 [doi]
- Weakly-Supervised Autism Severity Assessment in Long VideosAbid Ali 0002, Mahmoud Ali, Camilla Barbini, Séverine Dubuisson, Jean-Marc Odobez, François Brémond, Susanne Thümmler. 1-7 [doi]
- Fire Detection for Emergency Responders using XDimitrios Stefanopoulos, Aristeidis Bozas, Georgia Christodoulou, Maria I. Maslioukova, Yiannis Kouloglou, Maria Pegia, Anastasia Moumtzidou, Ilias Gialampoukidis, Konstantinos Avgerinakis, Stefanos Vrochidis, Ioannis Kompatsiaris. 1-7 [doi]
- Pgnn-Based Approach for Robust 3D Light Direction Estimation in Outdoor ImagesMarcello Zanardelli, Mahyar Gohari, Riccardo Leonardi, Sergio Benini, Nicola Adami. 1-7 [doi]
- Enabling Domain Experts to Train Efficient Few-Shot Incremental Landmark RecognitionHelmut Neuschmied, Werner Bailer. 1-4 [doi]
- Predicting Multiple Reading Tasks Using Eye Movement MeasuresOnanong Kongmeesub, Cathal Gurrin, Prapaporn Rattanatamrong. 1-7 [doi]
- Wseseg: Introducing a Dataset for the Segmentation of Winter Sports Equipment With a Baseline for Interactive SegmentationRobin Schön, Daniel Kienzle, Rainer Lienhart. 1-7 [doi]
- Enhanced Defect Detection in Airport Runway Infrastructure Using Image-Text PairingMarios Krestenitis, Eftichia Badeka, Ilias Koulalis, Konstantinos Ioannidis, Stefanos Vrochidis. 1-7 [doi]
- Demo: Creating Player-Specific Soccer Highlight Clips with PlayerTVHåkon Maric Solberg, Mehdi Houshmand Sarkhoosh, Sushant Gautam, Saeed Shafiee Sabet, Pål Halvorsen, Cise Midoglu. 1-5 [doi]
- Verge: Simplifying Video Search for Novice UsersNick Pantelidis, Maria Pegia, Damianos Galanopoulos, Konstantinos Apostolidis, Dimitris Georgalis, Klearchos Stavrothanasopoulos, Anastasia Moumtzidou, Konstantinos Gkountakos, Ilias Gialampoukidis, Stefanos Vrochidis, Vasileios Mezaris, Ioannis Kompatsiaris. 1-6 [doi]
- Expressive Multimedia Query Formulation for Novices in Virtual Reality with Vitrivr-VRFlorian Spiess 0001, Heiko Schuldt. 1-4 [doi]
- Exploring the Plausibility of Hate and Counter Speech Detectors With Explainable AiAdrian Jaques Böck, Djordje Slijepcevic, Matthias Zeppelzauer. 1-8 [doi]
- Descriptor Impact on Multimodal 3D RetrievalMaria Eirini Pegia, Björn Þór Jónsson 0001, Anastasia Moumtzidou, Sotiris Diplaris, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris. 1-7 [doi]
- Exquisitor: Studying the Interplay Between Conversational Search and Relevance FeedbackOmar Shahbaz Khan, Ujjwal Sharma 0001, Stevan Rudinac, Björn Þór Jónsson 0001. 1-5 [doi]
- Elevating Video Retrieval Capabilities: A Cross-Modal Approach Utilizing Text and Image Generative ModelsKazuya Ueki, Yuma Suzuki, Haruki Sato, Takayuki Hori, Takumi Takada, Hiroki Takushima, Hayato Tanoue, Aiswariya Manoj Kumar, Hiroki Nishihara. 1-7 [doi]
- IMSearch: An Interactive Multimedia Video-Moment Search SystemDuc-Tuan Luu, Duy-Ngoc Nguyen, Khanh-Linh Bui-Le, Vinh-Tiep Nguyen, Minh-Triet Tran. 1-7 [doi]
- A Comparison of Late-Fusion Training Strategies for Quad-Modal Joint EmbeddingsDomenic Luca Fürer, Abraham Bernstein, Luca Rossetto. 1-7 [doi]
- Query Refinement for Non-Existing Items in Image RetrievalNaoto Naka, Shin'ichi Satoh 0001. 1-7 [doi]
- Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach: Use Case of Riot or Violent Context DetectionLam Pham, Tin Nguyen 0007, Phat Lam, Hieu Tang, Alexander Schindler. 1-4 [doi]
- Towards Training Music Taggers on Synthetic DataNadine Kroher, Steven Manangu, Aggelos Pikrakis. 1-6 [doi]
- SoccerRAG: Multimodal Soccer Information Retrieval via Natural QueriesAleksander Theo Strand, Sushant Gautam, Cise Midoglu, Pål Halvorsen. 1-7 [doi]
- Predicting 3D Projectile Motion in Table Tennis Using Computer Vision and Physics-Informed Neural NetworkZaineb Chiha, Renaud Péteri, Laurent Mascarilla. 1-7 [doi]
- Learning Scene Semantics From Vehicle-Centric Data for City-Scale Digital TwinsHermann Fürntratt, Stefanie Onsori-Wechtitsch, Werner Bailer, Isaac Agustí Ventura, Carles Sala Navarro, Aleksandar Jevtic, Jawad Haidar. 1-6 [doi]
- Latent Space Exploration for Drum SamplesJake Drysdale, Jason Hockman. 1-7 [doi]
- Evaluation of Deep Audio Representations for Semantic Sound SimilarityRecep Oguz Araz, Dmitry Bogdanov, Pablo Alonso-Jiménez, Frederic Font. 1-7 [doi]
- Leveraging Latent Diffusion Models for Training-Free in-Distribution Data Augmentation for Surface Defect DetectionFederico Girella, Ziyue Liu, Franco Fummi, Francesco Setti, Marco Cristani, Luigi Capogrosso. 1-7 [doi]
- Music Scope Pad: Video Selecting Application by Natural Movement in VR SpaceMasatoshi Hamanaka. 1-4 [doi]
- Is Clip the Main Roadblock for Fine-Grained Open-World Perception?Lorenzo Bianchi 0001, Fabio Carrara, Nicola Messina, Fabrizio Falchi. 1-8 [doi]
- Real-Time Musical Collaboration With a Probabilistic ModelKarl Johannsson, Victor Shepardson, Enrique Hurtado, Thor Magnusson, Hannes Högni Vilhjálmsson. 1-4 [doi]
- HRV Based Stress Detection Using Convolutional Neural Networks (CNNs)Salomé Quoy, Dan Istrate, Mouna Benchekroun, Vincent Zalc. 1-5 [doi]
- Invariant Audio Prints for Music Indexing and AlignmentRémi Mignot, Geoffroy Peeters. 1-7 [doi]
- XAIface: A Framework and Toolkit for Explainable Face RecognitionNélida Mirabet Herranz, Martin Winter, Yuhang Lu, Naima Bousnina, Jonas Pfister, Chiara Galdi, Jean-Luc Dugelay, Werner Bailer, Touradj Ebrahimi, Paulo Lobato Correia, Fernando Pereira 0001, Felix Schmautzer, Erich Schweighofer. 1-7 [doi]
- Lowering Barriers to Entry for Fully-Integrated Custom Payloads on a DJI MatriceJoshua Springer, Gylfi Þór Guðmundsson, Marcel Kyas. 1-5 [doi]
- Data-Efficient Domain Transfer for Instance Segmentation for AR ScenesStefanie Onsori-Wechtitsch, Hermann Fürntratt, Hannes Fassold, Werner Bailer. 1-7 [doi]
- Towards Advanced Wildfire Analysis: A Siamese Network-Based Change Detection Approach Through Self-Supervised LearningDimitris Valsamis, Alexandros Oikonomidis, Chrysoula Chatzichristaki, Anastasia Moumtzidou, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris. 1-7 [doi]
- Demo: Soccer Information Retrieval Via Natural Queries using SoccerRAGAleksander Theo Strand, Sushant Gautam, Cise Midoglu, Pål Halvorsen. 1-5 [doi]
- A Hybrid AI System for Fusion of Object and Context Information: Application to the Rail Line Defect DetectionAlexey Zhukov, Jenny Benois-Pineau, Alain Rivero, Akka Zemmari, Mohamed Mosbah 0001, Danilo Crispiani. 1-7 [doi]
- Coarse-To-Fine Pruning of Graph Convolutional Networks for Skeleton-Based RecognitionHichem Sahbi. 1-7 [doi]