Abstract is missing.
- S2MGen: A synthetic skin mask generator for improving segmentationSubhadra Gopalakrishnan, Trisha Mittal, Jaclyn Pytlarz, Yuheng Zhao. 1-8 [doi]
- Generating and Evaluating Cursive Chinese Calligraphy by Semi-Classifying Style: A Case Study Using a Diffusion ModelYi-Chieh Wu, Yu-Jung Hsu. 9-16 [doi]
- StegoFusion-Net: Fusion of Convolutional Neural Networks for Spatial Image SteganalysisYassine Belkhouche, AlaaIdin Dwaik. 17-23 [doi]
- Disparity Correction Method of the Monocular Omnidirectional Stereo CameraHisayoshi Kaneda, Ryota Kawamata, Kazuyoshi Yamazaki, Kazuya Shimizu. 24-25 [doi]
- Unveiling the Potential of SSL-Generated Audio Embeddings for Cross-Lingual Speaker RecognitionWen-Hung Liao, Po-Han Chen, Yi-Chieh Wu. 26-32 [doi]
- Two-stage instrument timbre transfer method using RAVEDi Hu, Katunobu Ito. 33-40 [doi]
- Speaker Pseudonymization for Japanese Speech Using Duration EmbeddingsAoi Ito, Katunobu Itou. 41-48 [doi]
- Modeling User Quality of Experience in Adaptive Point Cloud Video StreamingDuc V. Nguyen 0001, Quang Long Nguyen, Tran Thuy Hien, Nguyen Ngoc Huyen, Truong Thu Huong, Pham Ngoc Nam 0001. 49-54 [doi]
- Appeal prediction for AI up-scaled ImagesSteve Göring, Rasmus Merten, Alexander Raake. 55-62 [doi]
- Modelling Concurrent RTP Flows for End-to-end Predictions of QoS in Real Time CommunicationsTailai Song, Paolo Garza, Michela Meo, Maurizio Matteo Munafò. 63-70 [doi]
- SoccerNet-Echoes: A Soccer Game Audio Commentary DatasetSushant Gautam, Mehdi Houshmand Sarkhoosh, Jan Held, Cise Midoglu, Anthony Cioppa, Silvio Giancola, Vajira Thambawita, Michael A. Riegler, Pål Halvorsen, Mubarak Shah. 71-78 [doi]
- Ensuring Color Consistency in RGB-D Multi-Camera SetupPeter O. Fasogbon. 79-84 [doi]
- Low Complexity Learning-based Lossless Event-based CompressionAhmadreza Sezavar, Catarina Brites, João Ascenso. 85-92 [doi]
- PlayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight ClipsHåkon Maric Solberg, Mehdi Houshmand Sarkhoosh, Sushant Gautam, Saeed Shafiee Sabet, Pål Halvorsen, Cise Midoglu. 93-97 [doi]
- Flexible And Faithful Data Insights GenerationWei Zhang, Victor Soares Bursztyn. 98-105 [doi]
- Holistic Visualization of Contextual Knowledge in Hotel Customer Reviews Using Self-AttentionShuntaro Masuda, Toshihiko Yamasaki. 106-109 [doi]
- Investigation of Feature Distribution and Network Weight Updates in the Machine Unlearning ProcessWen-Hung Liao, Yang-Jing Lin. 110-113 [doi]
- Platform for Endangered Language EducationGreeshma Sree Parimi, Gurkirat Singh Guliani, Min Chen. 114-115 [doi]
- Homophonic Music Composition Using a GAN and LSTM Pipeline for Melody and Harmony GenerationClément Saint-Marc, Katunobu Itou. 116-119 [doi]
- *Yuhuan Wang, Katunobu Itou. 120-123 [doi]
- Generating Bass Phrases from Guitar Chord Backing with NMFTomoo Kouzai, Junya Koguchi, Tetsuro Kitahara. 124-125 [doi]
- Watch your back! Dynamic thumbnails for a 360-degree video player to enhance viewing experience on 2D displaysJakub Kovác, Wolfgang Hürst. 126-132 [doi]
- Influence of Display Devices and Field of View on Subjective Quality of Experience Evaluation of 8K 360° VideosDaichi Arai, Yuichi Kondo, Yasuko Sugito, Yuichi Kusakabe. 133-136 [doi]
- VEMOCLAP: A video emotion classification web applicationSerkan Sulun, Paula Viana, Matthew E. P. Davies. 137-140 [doi]
- A Power-Law Transformation Approach for Template-Based Cross-Component PredictionZhikai Liu, Kun Zhang, Xin-Yi Cui, Wei Sun, Fan Liang. 141-142 [doi]
- Investigating the Impact of High Frame Rate on Video Quality: A SAMVIQ ApproachDominik Keller, Paul Rudi Frank, Steve Göring, Alexander Raake. 143-144 [doi]
- A Server-driven View-aware Point Cloud Video Streaming FrameworkTran Gia Minh, Truong Thu Huong, Duc V. Nguyen 0001. 145-148 [doi]
- Evaluation of strategies for efficient rate-distortion NeRF streamingPedro Martin, António Rodrigues, João Ascenso, Maria Paula Queluz. 149-153 [doi]
- Perceptual Quality Driven Point Cloud Compression for 6DoF 3D Point Cloud StreamingYumeka Chujo, Yusuke Tagashira, Yukiko Harada, Kenji Kanai, Jiro Katto. 154-157 [doi]
- On Multi-CDN Delivery Costs Optimization ProblemYuriy A. Reznik, Guillem Cabrera. 158-161 [doi]
- Sliding Window Check: Repairing Object IdentitiesGeerthan Srikantharajah, Naimul Khan 0001. 162-169 [doi]
- Data Augmentation with Diffusion Model for Hand DetectionGenta Matsukawa, Atsuo Yoshitaka. 170-173 [doi]
- AI Maintenance Techniques by Detecting Performance Degradation in Domain Shift Using Model EnsemblesKeita Yamane, Akira Kitayama, Keigo Hasegawa, Yusuke Obonai, Hiroto Sasao. 174-175 [doi]
- Cross-Modal 3D Model RetrievalRaphael Waltenspül, Florian Spiess 0001, Heiko Schuldt. 176-180 [doi]
- Prevention of Unexpected Object Generation in Diffusion Model-Based InpaintingTakumi Komori, Takahiro Hayashi. 181-184 [doi]
- LMM-Regularized CLIP Embeddings for Image ClassificationMaria Tzelepi, Vasileios Mezaris. 185-188 [doi]
- Evaluation Framework for Novel View SynthesisKolja Kieslich, Louay Bassbouss, Stephan Steglich, Stefan Arbanowski. 189-192 [doi]
- A Simulation for the Evaluation of the Mean Opinion Score (MOS) for EVS-WB and AMR-WB Audio Codecs for 5G Mobile NetworksJussif J. Abularach Arnez, Cassio A. Tavares Alves, Wederson Medeiros Silva, Isaac Barros Gomes, Carla Lapa Nogueira, Maria G. Lima Damasceno. 193-196 [doi]
- FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network SettingsJohn Li, Deepak Nair, Klara Nahrstedt, Indranil Gupta, Shehab Sarar Ahmed. 197-200 [doi]
- Ultra-low-latency 8K120p-video-transmission System Parallelizing SMPTE ST 2110Yasuhiro Mochida, Takuro Yamaguchi, Hirokazu Takahashi, Koichi Takasugi. 201-202 [doi]
- Low-latency Software-based Uncompressed Video TransmissionTakuro Yamaguchi, Yasuhiro Mochida, Hirokazu Takahashi. 203-204 [doi]
- Visual Speech Recognition with Surrounding and Emotional InformationPengcheng Zeng, Atsuo Yoshitaka. 205-212 [doi]
- Synchronized Object Sharing for Augmented Reality Virtual ConferencingJohn Murray, Michael Zink. 213-218 [doi]
- Fusion-Based Human Pose Estimation Using RGB and IR Images with Transformer-Based DecodingViviana Crescitelli, Takashi Oshima. 219-220 [doi]
- Occlusion-Aware Real-Time Tiny Facial Alignment Model for Makeup Virtual Try-OnKin Ching Lydia Chau, Zhi Yu, Ruowei Jiang. 221-224 [doi]
- A Study on Mental Stress Test using Cybersickness caused by Virtual Reality ContentsNan Bu, Kakeru Nakano. 225-226 [doi]
- Exploring Augmented Table Setup and Lighting Customization in a Simulated Restaurant to Improve the User ExperienceJana Motowilowa, Maurizio Vergari, Tanja Kojic, Maximilian Warsinke, Sebastian Möller 0001, Jan-Niklas Voigt-Antons. 227-231 [doi]
- Human-in-the-loop knowledge base upkeep for retrieval augmented generation applicationsPedro Baptista de Castro, Hiroko Sukeda, Soichi Takashige. 232-233 [doi]
- LiveSkeleton: High-Quality Real-Time Human Tracking and Pose EstimationHannes Fassold. 234-235 [doi]
- A technical Concept for enhancing the Student Experience in Hybrid Lecture ScenariosFlorian Schimanke, Robert Mertens 0002, Felix Prankel. 236-241 [doi]
- SpotiView: Partial Face Display Method for Smooth Communication While Protecting PrivacyRyota Kishimoto, Shuhei Tsuchida, Tsutomu Terada, Masahiko Tsukamoto. 242-249 [doi]
- Characterizing students behavior in multi-user multi-computer testing environmentsRajini Chittimalla, Sujung Choi, Madhu Sai Vineel Reka, Yassine Belkhouche. 250-254 [doi]
- Evaluating Interactive Concept Maps Produced from E-PortfoliosAlexander Gantikow, Andreas Isking, Wolfgang Müller 0004, Paul Libbrecht, Sandra Rebholz. 255-260 [doi]
- Gender Stereotypes in the Creation of Educational Cases with ChatGPTGabriel Valerio-Ureña, Giomara Sevilla-Campoverde, Soledad Ortúzar, Christian Lazcano. 261-266 [doi]
- Multi-View Gesture Recognition in Conflict SituationsKaram Dawoud, Birgit Nierula, Farelle Toumaleu Siewe, Thomas Koch, Daniel Johannes Meyer, Andreas Bock, Marianne Heinze, Daniela Knuth, Denis Martin, Julia Schander, Anna Hilsmann, Peter Eisert, Sebastian Bosse. 267-268 [doi]
- PanoramaViewer - A Framework for Educational Collaborative Virtual Field TripsMario Wolf, Sebastian Hartwig, Gregor Steinhöfel, Heinrich Söbke, Eckhard Kraft. 269-274 [doi]
- Real-time Multi-modal Highlight Prediction for Simultaneous Viewing of Multiple Live StreamsYusuke Maeda, Takahiro Hayashi. 275-278 [doi]
- Slide Analysis Method for Editing Lecture Materials based on Hierarchical Structures of Subject TerminologiesItsuki Sano, Yuanyuan Wang 0003, Yukiko Kawai, Kazutoshi Sumiya. 279-284 [doi]
- The ≪Huh?≫ Button: Improving Understanding in Educational Videos with Large Language ModelsBoris Ruf, Marcin Detyniecki. 285-289 [doi]