Abstract is missing.
- The MUSTI challenge@MediaEval 2023 - Multimodal Understanding of Smells in Texts and Images with Zero-shot EvaluationAli Hürriyetoglu, Inna Novalija, Mathias Zinnen, Vincent Christlein, Pasquale Lisena, Stefano Menini, Marieke van Erp, Raphaël Troncy. [doi]
- Automated Recognition of Sports Scores using PyTessaract OCR and CNN: SportsVideo Task at MediaEval 2023Bhuvana Jayaraman, Mirnalinee T. T, Harshida Sujatha Palaniraj, Mohith Adluru, Sanjjit Sounderrajan. [doi]
- SELAB-HCMUS at MediaEval 2023: A cross-domain and subject-centric approach towards the memorability prediction taskMinh-Quang Nguyen, Minh-Huy Trinh, Huy-Giap Bui, Khac-Trieu Vo, Minh-Triet Tran, Thien Phuc Tran, Hai Dang Nguyen. [doi]
- Cross-modal Networks, Fine-Tuning, Data Augmentation and Dual Softmax Operation for MediaEval NewsImages 2023Antonios Leventakis, Damianos Galanopoulos, Vasileios Mezaris. [doi]
- The Impact of Transformers Ensemble on Model Memorability and GeneralizabilityMuhammad Mustafa Ali Usmani, Humna Faisal, Muhammad Atif Tahir. [doi]
- ConvLSTM for Table Tennis Stroke ClassificationJansi Rani Sella Veluswami, Ananth Narayanan P, Bhuvan S, Shobith Kumar R. [doi]
- Handle the problem of ample label space by using the Image-guided Feature Extractor on the MUSTI datasetLe Ngoc Duc, Le Minh Hung, Dinh Quang Vinh. [doi]
- A Text-Image Olfactory Matching Method Based on the Distribution of Real-World DataYi Shao, Yulong Sun, Wenbo Wan, Jing Li, Jiande Sun. [doi]
- Multimodal Learning for Image-Text Matching: A Blip-Based ApproachDhanya Srinivasan, Subhashree M, Mirunalini P, Jaisakthi S. M. [doi]
- Beyond Keywords: ChatGPT's Semantic Understanding for Enhanced Media SearchHoang Chau Truong Vinh, Doan Khai Ta, Duc Duy Nguyen, Le Thanh Nguyen, Quang Vinh Nguyen. [doi]
- Multimodal Fusion in NewsImages 2023: Evaluating Translators, Keyphrase Extraction, and CLIP Pre-TrainingTien-Huy Nguyen, Hoang-Long Nguyen-Huu, Thien-Doanh Le, Huu-Loc Tran, Quoc-Khanh Le-Tran, Hoang-Bach Ngo, Minh-Hung An, Quang Vinh Dinh. [doi]
- A Hybrid Approach To Stroke Detection In SwimmingAnkitha Reddy A, Pranav Moorthi, Samyuktaa Sivakumar, Shwetha S, Prabavathy Balasundaram. [doi]
- The Relation between Texts and Images in News: News Images in MediaEval 2023Andreas Lommatzsch, Benjamin Kille, Özlem Özgöbek, Mehdi Elahi, Duc-Tien Dang-Nguyen. [doi]
- MUSTI - Multimodal Understanding of Smells in Texts and Images Using CLIPMirunalini P, Sanjhay V, Rohitram S, Rohith M. [doi]
- Investigating the Performance of the CLIP Model and Concept Matching in Text-Image Retrieval SystemsXiaomeng Wang, Mingliang Liang, Martha A. Larson. [doi]
- Two-Stream Network and Attention Mechanism for Sports Video Classification in Table TennisPengcheng Dong, Hongxin Xie, Fuqiang Zheng, Jiande Sun 0001. [doi]
- Connecting Text and Images in News Articles using VSE++Abbhinav Elliah, Mirunalini Palaniappan, Keerthick V, Haricharan Bharathi, Anirudh Bhaskar, Vithula S. [doi]
- An Empirical Exploration of Perceived Similarity between News Article Texts and ImagesLucien Heitz, Abraham Bernstein, Luca Rossetto. [doi]
- Medico Multimedia Task at MediaEval 2023: Transparent Tracking of SpermatozoaVajira Thambawita, Andrea M. Storås, Tuan-Luc Huynh, Hai Dang Nguyen, Minh-Triet Tran, Trung-Nghia Le, Pål Halvorsen, Michael Riegler 0001, Steven Hicks, Thien Phuc Tran. [doi]
- Overview of The MediaEval 2023 Predicting Video Memorability TaskMihai Gabriel Constantin, Claire-Hélène Demarty, Camilo Fosco, Alba García Seco de Herrera, Sebastian Halder 0001, Graham Healy, Bogdan Ionescu, Ana Matran-Fernandez, Rukiye Savran Kiziltepe, Alan F. Smeaton, Lorin Sweeney. [doi]
- Prompt-based Alignment of Headlines and Images Using OpenCLIPLucien Heitz, Yuin Kwan Chan, Hongji Li, Kerui Zeng, Luca Rossetto, Abraham Bernstein. [doi]
- SportsVideo: A Multimedia Dataset for Sports Event and Position Detection in Table Tennis and SwimmingAymeric Erades, Pierre-Etienne Martin, Romain Vuillemot, Boris Mansencal, Renaud Péteri, Julien Morlier, Stefan Duffner, Jenny Benois-Pineau. [doi]
- Enhancing Multimodal Language Models with Olfactory InformationMurathan Kurfali, Jonas K. Olofsson, Thomas Hörberg. [doi]
- NewsImages Fusion: Bridging Textual Context and Visual Content in Media RepresentationArvind V, Vettri Chezhiyan, Harish J, Dr. Priyadharshini R, Mohanapriya E. [doi]
- Exploring Video Transformers and Automatic Segment Selection for Memorability PredictionIván Martín-Fernández, Sergio Esteban Romero, Jaime Bellver-Soler, Manuel Gil-Martín, Fernando Fernández-Martínez. [doi]
- Transparent Tracking of Spermatozoa with YOLOv8Bao-Tin Nguyen, Van Loc Nguyen, Minh-Triet Tran. [doi]
- Ensemble Pre-trained Multimodal Models for Image-text Retrieval in the NewsImages MediaEval 2023Taihang Wang, Jianxiang Tian, Xiangrun Li, Xiaoman Xu, Ye Jiang. [doi]
- AIMultimediaLab at MediaEval 2023: Studying the generalization of media memorability prediction methodsMihai Gabriel Constantin, Bogdan Ionescu. [doi]
- Integrated Multi-stage Contextual Attention Network for Text-Image MatchingJiande Sun 0001, Yi Shao, Yawen Chen, Yang Zhang, Tianlin Zhang, Xuan Zhang, Ye Jiang, Jing Li. [doi]
- Baseline Method for the Sport Task of MediaEval 2023 3D CNNs using Attention Mechanisms for Table Tennis Stoke Detection and ClassificationPierre-Etienne Martin. [doi]
- Optimizing Visual Pairings: A CLIP Framework for Precision News Image RematchingPooja Premnath, Venkatasai Ojus Yenumulapalli, Rajalakshmi Sivanaiah, Angel Deborah Suseelan. [doi]
- Multimodal and Multilingual Olfactory Matching based on Contrastive LearningSergio Esteban Romero, Iván Martín-Fernández, Jaime Bellver-Soler, Manuel Gil-Martín, Fernando Fernández-Martínez. [doi]
- Optimizing Sperm Detection and Tracking in Fluids with Equalize Class Representation AugmentationTrong-Hieu Nguyen Mau, Quoc-Huy Trinh, Ngoc-Linh Nguyen-Ha, Tuong-Vy Truong-Thuy, Tuan-Anh Yang, Hai Dang Nguyen, Ngoc-Thao Nguyen, Minh-Triet Tran. [doi]
- Tracking and Prediction Of Human Spermatozoa Motility Using Yolov8n and Greedy Shape Geometry TechniqueMuhammad Osaid, Abdul-Samad, Omer Qureshi, Muhammad Atif Tahir, Muhammad Nouman Durrani. [doi]