Abstract is missing.
- Retrieval-Augmented Transformer for Image CaptioningSara Sarto, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara. 1-7 [doi]
- Hybrid Transformer Network for Deepfake DetectionSohail Ahmed Khan, Duc-Tien Dang-Nguyen. 8-14 [doi]
- An Exploration into the Benefits of the CLIP model for Lifelog RetrievalLy-Duyen Tran, Naushad Alam, Yvette Graham, Linh Khanh Vo, Nghiem Tuong Diep, Binh Nguyen, Liting Zhou, Cathal Gurrin. 15-22 [doi]
- An Audio-Visual Dataset and Deep Learning Frameworks for Crowded Scene ClassificationLam Pham, Dat Ngo, Tho Nguyen, Phu X. Nguyen, Truong Van Hoang, Alexander Schindler. 23-28 [doi]
- A Fine Grained Quality Assessment of Video Anomaly DetectionJiang Zhou, Kevin McGuinness, Joseph Antony, Noel E. O'Connor. 29-35 [doi]
- Learning Co-occurrence Features Across Spatial and Temporal Domains for Hand Gesture RecognitionMohammad Rehan, Hazem Wannous, Jafar Alkheir, Kinda Aboukassem. 36-42 [doi]
- Sentiment analysis on 2D images of urban and indoor spaces using deep learning architecturesKonstantinos Chatzistavros, Theodora Pistola, Sotiris Diplaris, Konstantinos Ioannidis, Stefanos Vrochidis, Ioannis Kompatsiaris. 43-49 [doi]
- Urban Image Geo-Localization Using Open Data on Public SpacesMathias Glistrup, Stevan Rudinac, Björn Þór Jónsson 0001. 50-56 [doi]
- A domain adaptive deep learning solution for scanpath prediction of paintingsMohamed Amine Kerkouri, Marouane Tliba, Aladine Chetouani, Alessandro Bruno. 57-63 [doi]
- ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and RetrievalNicola Messina, Matteo Stefanini, Marcella Cornia, Lorenzo Baraldi, Fabrizio Falchi, Giuseppe Amato, Rita Cucchiara. 64-70 [doi]
- Improving Nearest Neighbor Indexing by Multitask LearningAmorntip Prayoonwong, Ke-Long Zeng, Chih-Yi Chiu. 71-76 [doi]
- Towards Human Performance on Sketch-Based Image RetrievalOmar Seddati, Stéphane Dupont, Saïd Mahmoudi, Thierry Dutoit. 77-83 [doi]
- Analysis of the Complementarity of Latent and Concept Spaces for Cross-Modal Video SearchVarsha Devi, Philippe Mulhem, Georges Quénot. 84-90 [doi]
- Real-time deblurring network for face AR applicationsJuhwan Lee, Jongha Lee, Sangwook Yoo. 91-96 [doi]
- Hyperspectral Image Reconstruction of Heritage Artwork Using RGB Images and Deep Neural NetworksAilin Chen, Rui Jesus, Márcia Vilarigues. 97-102 [doi]
- A survey for image based methods in construction: from images to digital twinsIlias Koulalis, Nikolaos I. Dourvas, Theocharis Triantafyllidis, Konstantinos Ioannidis, Stefanos Vrochidis, Ioannis Kompatsiaris. 103-110 [doi]
- Segmenting partially annotated medical imagesNicolas Martin, Jean-Pierre Chevallet, Georges Quénot. 111-115 [doi]
- Chest Diseases Classification Using CXR and Deep Ensemble LearningAdnane Ait Nasser, Moulay A. Akhloufi. 116-120 [doi]
- Skin Cancer Detection using Ensemble Learning and Grouping of Deep ModelsTakfarines Guergueb, Moulay A. Akhloufi. 121-125 [doi]
- Learning to Detect Fallen People in Virtual WorldsFabio Carrara, Lorenzo Pasco, Claudio Gennaro, Fabrizio Falchi. 126-130 [doi]
- Few-shot Object Detection as a Semi-supervised Learning ProblemWerner Bailer, Hannes Fassold. 131-135 [doi]
- Deep Features for CBIR with Scarce Data using Hebbian LearningGabriele Lagani, Davide Bacciu, Claudio Gallicchio, Fabrizio Falchi, Claudio Gennaro, Giuseppe Amato. 136-141 [doi]
- BiasUNet: Learning Change Detection over Sentinel-2 Image PairsMaria Eirini Pegia, Anastasia Moumtzidou, Ilias Gialampoukidis, Björn Þór Jónsson 0001, Stefanos Vrochidis, Ioannis Kompatsiaris. 142-148 [doi]
- Wildfire Segmentation using Deep-RegSeg Semantic Segmentation ArchitectureRafik Ghali, Moulay A. Akhloufi, Wided Souidène Mseddi, Marwa Jmal. 149-154 [doi]
- Ecological Impact Assessment Framework for areas affected by Natural DisastersArief Setyanto, Kusrini Kusrini, Gardyas Bidari Adninda, Renindya Kartikakirana, Rhisa Aidilla Suprapto, Arif Laksito, I Made Artha Agastya, Krishna Chandramouli, Andrea Majlingova, Yvonne Brodrechtová, Konstantinos Demestichas, Ebroul Izquierdo. 155-161 [doi]
- StyleGAN-based CLIP-guided Image Shape ManipulationYuchen Qian, Kohei Yamamoto, Keiji Yanai. 162-166 [doi]
- Streaming learning with Move-to-Data approach for image classificationAbel Kahsay Gebreslassie, Jenny Benois-Pineau, Akka Zemmari. 167-173 [doi]
- Analysing the Memorability of a Procedural Crime-Drama TV Series, CSISeán Cummins, Lorin Sweeney, Alan F. Smeaton. 174-180 [doi]
- A large-scale TV video and metadata database for French political content analysis and fact-checkingFrédéric Rayar, Mathieu Delalandre, Van-Hao Le. 181-185 [doi]
- Relational Database Performance for Multimedia: A Case StudyBjörn Þór Jónsson 0001, Aaron Duane, Nikolaj Mertz. 186-190 [doi]
- The Potential of Webcam Based Real Time Eye-Tracking to Reduce Rendering CostIsabel Kütemeyer, Mathias Lux. 191-195 [doi]
- Self-Supervised Spiking Neural Networks applied to Digit ClassificationBenjamin Chamand, Philippe Joly. 196-200 [doi]
- A Virtual Reality Talking Avatar for Investigative Interviews of Maltreat ChildrenSyed Zohaib Hassan, Pegah Salehi, Michael Alexander Riegler, Miriam Sinkerud Johnson, Gunn Astrid Baugerud, Pål Halvorsen, Saeed Shafiee Sabet. 201-204 [doi]
- A Toolchain for Extracting and Visualising Road Traffic DataHelmut Neuschmied, Florian Krebs, Stefan Ladstätter, Elisabeth Eder, Mohamed Redouane Berrazouane, Georg Thallinger. 205-208 [doi]