Abstract is missing.
- GHOI: A Green Human-Object-Interaction DetectorTsung-Shan Yang, Yun-Cheng Wang, Chengwei Wei, C. C. Jay Kuo. 1-7 [doi]
- Real-Time Lane-Wise Traffic Monitoring in Optimal ROIsMei Qiu, Wei Lin, Lauren Ann Christopher, Stanley Y. P. Chien, Yaobin Chen, Shu Hu. 8-14 [doi]
- SkyDataNet: An Object Detection Algorithm with 2D Gaussian Loss for UAV-Based Aerial ImagesMehmet Akif Özkanoglu, Ali C. Begen, Sedat Ozer. 21-27 [doi]
- Target-Aware Siamese Networks Based on Masked Attention Mechanism for Visual Object TrackingYao-Hui Su, Ming-Der Shieh, Chia-Chi Tsai. 28-34 [doi]
- A Framework for Generating Images and Hashtags for Social Media Posts for Artificial InfluencersRaju Shrestha, Hanne Korneliussen. 42-48 [doi]
- Automatic Visual Citation Generation for Text-to-Image GenerationNing Xu, Serhad Doken. 49-54 [doi]
- Enhancing Local LLM Performance Through Heterogeneous Multi-Device ComputingRyan Metcalfe, Garth Long, Charlie L. Wang, Iole Moccagatta. 55-60 [doi]
- Text-Driven Synchronized Diffusion Video and Audio Talking Head GenerationZhenfei Zhang, Tsung-Wei Huang, Guan-Ming Su, Ming-Ching Chang, Xin Li. 61-67 [doi]
- 10x Future of Filmmaking Empowered by AIGCHaohong Wang, Daniel Smith, Malgorzata Kudelska. 68-74 [doi]
- Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic SegmentationDaniel Kienzle, Marco Kantonis, Robin Schön, Rainer Lienhart. 75-81 [doi]
- Green Image Label TransferHaiyi Li, Xuejing Lei, Xinyu Wang, C. C. Jay Kuo. 82-87 [doi]
- GenCheck: A LoRA-Adapted Multimodal Large Language Model for Check AnalysisFei Zhao, Jiawen Chen, Bin Huang, Chengcui Zhang, Gary Warner, Rushi Chen, Shaorou Tang, Yuanfei Ma, Zixi Nan. 88-94 [doi]
- Leveraging Semantic Segmentation for Image Manipulation Detection and LocalizationYuwei Chen, Ming-Ching Chang, Xin Li. 95-101 [doi]
- GeoVQA: A Comprehensive Multimodal Geometry Dataset for Secondary EducationAvinash Anand, Raj Jaiswal, Abhishek Dharmadhikari, Atharva Marathe, Harsh Popat, Harshil Mital, Ashwin R. Nair, Kritarth Prasad, Sidharth Kumar, Astha Verma, Rajiv Ratn Shah, Roger Zimmermann. 102-108 [doi]
- ProxeGraph: Scene Graph Generation Utilizing Proxemics for Smart HomesDebaleen Das Spandan, Razib Iqbal. 109-115 [doi]
- HOI as Embeddings: Advancements of Model Representation Capability in Human-Object Interaction DetectionJunwen Chen, Yingcheng Wang, Keiji Yanai. 116-122 [doi]
- Lightweight Schemes Fusion for Heatmap-based Human Pose EstimationSheng-Jhou Lu, Hung-Wei Lee, Yu-Ming Han, Ji-Min Zhou, Ying Liu, Huang-Chia Shih. 123-126 [doi]
- Anomaly Detection in Video Using CompressionMichael R. Smith, Renee Gooding, Jonathan Bisila, Christina L. Ting. 127-133 [doi]
- SSLCT: A Convolutional Transformer for Synthetic Speech LocalizationKratika Bhagtani, Amit Kumar Singh Yadav, Paolo Bestagini, Edward J. Delp. 134-140 [doi]
- Playlist Continuation of Cold-Start SongsChun-Han Cheng, Ting-Yu Wei, Homer H. Chen. 141-147 [doi]
- Improved Standard-Based Motion Parallax Measurement in Mixed RealityHung-Jui Guo, Balakrishnan Prabhakaran 0001. 148-154 [doi]
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based RetrieversKunal Sawarkar, Abhilasha Mangal, Shivam Raj Solanki. 155-161 [doi]
- Automated Thematic Composer Classification Using Segment RetrievalJacob Edward Galajda, Kien A. Hua. 162-168 [doi]
- DRM-SN: Detecting Reused Multimedia Content on Social NetworksWen-Shiang Li, Yao-Cheng Lu, Wen-Kai Hsiao, Yu-Yao Tseng, Ming-Hung Wang. 169-175 [doi]
- Uncertainty-Guided Appearance-Motion Association Network for Out-of-Distribution Action DetectionXiang Fang, Arvind Easwaran, Blaise Genest. 176-182 [doi]
- FastLearn: A Rapid Learning Agent for Chat Models to Acquire Latest KnowledgeChenhan Fu, Guoming Wang, Rongxing Lu, Siliang Tang. 183-189 [doi]
- Enhancement of Neonatal Lung Pathology Classification Using Multi-view Feature RepresentationRyan Tan, Thanh Hong-Phuoc, Lei Gao 0001, Randy Tan, Sagarjit Aujla, Adel Mohamed, Ling Guan, Karthikeyan Umapathy, Naimul Mefraz Khan. 190-195 [doi]
- Holistic Visual-Textual Sentiment Analysis with Prior ModelsJunyu Chen, Jie An 0002, Hanjia Lyu, Christopher Kanan, Jiebo Luo. 196-202 [doi]
- Radio Map Estimation (RME) with Deep Progressive NetworkJashia Mitayeegiri, Shaohua Dong, Chenxi Qiu, Qing Yang, Xinrong Li, Heng Fan 0001, Yan Huang. 203-206 [doi]
- CM-ASAP: Cross-Modality Adaptive Sensing and Perception for Efficient Hand Gesture RecognitionSoheil Hor, Mostafa El-Khamy, Yanlin Zhou, Amin Arbabian, SukHwan Lim. 207-213 [doi]
- Mitigating Privacy Threats Without Degrading Visual Quality of VR Applications: Using Re-Identification Attack as a Case StudyYu-Szu Wei, Yuan-Chun Sun, Shin-Yi Zheng, Hsun-Fu Hsu, Chun-Ying Huang, Cheng-Hsin Hsu. 214-220 [doi]
- Device-Agnostic Remote Range-of-motion Assessment using Data AbstractionOmeed Ashtiani, Meghana Spurthi Maadugundu, Minhas Kamal, Balakrishnan Prabhakaran. 221-226 [doi]
- Retrieval Augmented Structured Generation: Business Document Information Extraction as Tool UseFranz Louis Cesista, Rui Aguiar, Jason Kim 0010, Paolo Acilo. 227-230 [doi]
- Will Neural 3D Object Representations be the Silver Bullet for Improving VR Experience in HMDs?Charlie Hsu, Yuan-Chun Sun, Kuan-Yu Lee, Chun-Ying Huang. 231-234 [doi]
- Frame-Level Latent Embedding Using Weak Labels for Multi-View Action RecognitionVijay John, Yasutomo Kawanishi. 235-238 [doi]
- A Deep Features Based Approach Using Modified ResNet50 and Gradient Boosting for Visual Sentiments ClassificationMuhammad Arslan, Muhammad Mubeen, Arslan Akram, Saadullah Farooq Abbasi, Muhammad Salman Ali, Muhammad Usman Tariq. 239-242 [doi]
- Dental X-ray Segmentation and Auto Implant Design Based on Convolutional Neural NetworkYang Xing, Peixi Liao, Reem AwdhE Alasleh, Vissuta Khampatee, Farshid Alizadeh-Shabdiz. 243-246 [doi]
- Joint HDR Denoising and Fusion on Mobile DevicesJie Cai, Yuan Lin, Jiang Li, Jiaming Ding, Ling Ouyang, Chiu Man Ho, Zibo Meng. 247-252 [doi]
- MU-MAE: Multimodal Masked Autoencoders-Based One-Shot LearningRex Liu, Xin Liu. 253-259 [doi]
- Structured Pruning for Multi-Task Deep Neural NetworksSiddhant Garg, Lijun Zhang, Hui Guan 0001. 260-266 [doi]
- UU-Mamba: Uncertainty-aware U-Mamba for Cardiac Image SegmentationTing-Yu Tsai, Li Lin, Shu Hu, Ming-Ching Chang, Hongtu Zhu, Xin Wang. 267-273 [doi]
- Unveiling Statistical Significance of Online Regression Over Multiple DatasetsMohammad Abu-Shaira, Weishi Shi. 274-279 [doi]
- Macro-AUC-Driven Active Learning Strategy for Multi-Label Classification EnhancementMinghao Li, Junjie Qiu, Weishi Shi. 280-286 [doi]
- Viewing Comfort Enhancement on Head-Mounted Displays Using Stereo Disparity ControlDae Yeol Lee, Geonsun Lee, Guan-Ming Su. 287-293 [doi]
- Multi-Task Decision-Making for Multi-User $360^{\circ}$ Video Processing over Wireless NetworksBabak Badnava, Jacob Chakareski, Morteza Hashemi. 294-300 [doi]
- ExCEDA: Unlocking Attention Paradigms in Extended Duration E-Classrooms by Leveraging Attention-Mechanism ModelsAvinash Anand, Avni Mittal, Laavanaya Dhawan, Juhi Krishnamurthy, Mahisha Ramesh, Naman Lal, Astha Verma, Pijush Bhuyan, Himani, Rajiv Ratn Shah, Roger Zimmermann, Shin'ichi Satoh 0001. 301-307 [doi]
- Pulse of the Crowd: Quantifying Crowd Energy through Audio and Video AnalysisAvinash Anand, Sarthak Jain, Shashank Sharma, Akhil P. Dominic, Aman Gupta, Ashta Verma, Raj Jaiswal, Naman Lal, Rajiv Ratn Shah, Roger Zimmermann. 308-314 [doi]
- Plastic Surgery Image Classification and GenerationYiwei Han, Kaiyi Qi, Jiebo Luo. 315-320 [doi]
- CU-Mamba: Selective State Space Models with Channel Learning for Image RestorationRui Deng, Tianpei Gu. 328-334 [doi]
- OmniDet: Omnidirectional Object Detection via Fisheye Camera AdaptationChih-Chung Hsu, Wei-Hao Huang, Wen-Hai Tseng, Ming-Hsuan Wu, Ren-Jung Xu, Chia-Ming Lee. 335-341 [doi]
- GESA: Exploring Loss-based Adversarial Attacks in Volumetric Media StreamingMost Husne Jahan, Abdelhak Bentaleb. 342-348 [doi]
- LivePics-24: A Multi-person, Multi-camera, Multi-settings Live Photos DatasetOmkar N. Kulkarni, Aryan Mishra, Shashank Arora, Vivek K. Singh 0001, Pradeep K. Atrey. 349-354 [doi]
- TextSleuth: A New Dataset and Baseline for Scene Text Manipulation DetectionAbhineet Kumar Pandey, Ming-Ching Chang, Xin Li. 362-368 [doi]
- MambaTab: A Plug-and-Play Model for Learning Tabular DataMd Atik Ahamed, Qiang Shawn Cheng. 369-375 [doi]
- Deep Learning-based Text-in-Image WatermarkingBishwa Karki, Chun-Hua Tsai, Pei-Chi Huang, Xin Zhong. 376-382 [doi]
- Single-frame Supervised Action Temporal Localization Based on Multi-view Contrastive LearningHaoran Tong, Xu Cui, Laiyun Qing. 383-389 [doi]
- Mutual Information Analysis in Multimodal Learning SystemsHadi Hadizadeh, S. Faegheh Yeganli, Bahador Rashidi, Ivan V. Bajic. 390-395 [doi]
- Mathematics-Inspired Learning: A Green Learning Model with Interpretable PropertiesLing Guan, Lei Gao, Kai Liu, Zheng Guo. 396-402 [doi]
- Learning to Switch off, Switch on, and Integrate Modalities in Large Pre-trained TransformersTejas Duseja, K. M. Annervaz, Jeevithiesh Duggani, Shyam Zacharia, Michael Free, Ambedkar Dukkipati. 403-409 [doi]
- Cultural Relevance Index: Measuring Cultural Relevance in AI-Generated ImagesWala Elsharif, Marco Agus, Mahmood Alzubaidi, James She. 410-416 [doi]
- Parameter-Efficient Adaptation of Foundation Models for Damaged Building AssessmentFei Zhao, Chengcui Zhang. 417-422 [doi]
- Exploring the Impact of Hand Pose and Shadow on Hand-Washing Action RecognitionShengtai Ju, Amy R. Reibman. 423-429 [doi]
- Enabling Paper-Based Surface Authentication via Digital Twin and Experimental VerificationPrasun Datta, Chau-Wai Wong, Min Wu 0001. 430-438 [doi]
- DeepFake-o-meter v2.0: An Open Platform for DeepFake DetectionYan Ju, Chengzhe Sun, Shan Jia, Shuwei Hou, Zhaofeng Si, Soumyya Kanti Datta, Lipeng Ke, Riky Zhou, Anita Nikolich, Siwei Lyu. 439-445 [doi]
- GeoSecure-B: A Method for Secure Bearing CalculationVikram Patil, Sharmilee Rajkumar Rajan, Pradeep K. Atrey. 446-451 [doi]
- Clearing Text Images: A Non-blind Deblurring with Convex Total Variation Regularization ModelNarendra Kumar, Gaurav Bhatnagar. 452-457 [doi]
- Algorithmic Stock Trading StrategiesCraig Rainey, Min Chen. 458-464 [doi]
- Benchmarking the Robustness of UAV Tracking Against Common CorruptionsXiaoqiong Liu, Yunhe Feng, Shu Hu 0001, Xiaohui Yuan 0001, Heng Fan 0001. 465-470 [doi]
- LitAI: Enhancing Multimodal Literature Understanding and Mining with Generative AIGowtham Medisetti, Zacchaeus Compson, Heng Fan 0001, Huaxiao Yang, Yunhe Feng. 471-476 [doi]
- GaugeTracker: AI - Powered Cost-Effective Analog Gauge Monitoring SystemBeitong Tian, Mingyuan Wu, Ruixiao Zhang, Haozhen Zheng, Bo Chen, Yaohui Wang, Shiv Trivedi, Shanbo Zhang, Robert Bruce Kaufman, Leah Espenhahn, Gianni Pezzarossi, Mauro Sardela, John Dallesasse, Klara Nahrstedt. 477-483 [doi]
- Balancing Explanations and Adaptation in Offline Continual Learning Systems Using Active Augmented ReplyMd. Abdullah Al Forhad, Weishi Shi. 484-490 [doi]
- Controllable Universal Edge-Preserving Image FilteringShijun Liang, Dongdong Fu. 491-494 [doi]
- Attenuation-Aware Weighted Optical Flow with Medium Transmission Map for Learning-Based Visual Odometry in Underwater TerrainNguyen Gia Bach, Chanh Minh Tran, Eiji Kamioka, Phan Xuan Tan. 495-498 [doi]
- Harmful Brain Activity Classification of Spectrograms with Transfer Deep LearningShanker Ram, Sambhu Ganesan, Yajat Nagaraj Kiran. 499-502 [doi]
- TQCompressor: Improving Tensor Decomposition Methods in Neural Networks Via PermutationsVadim Abronin, Aleksei Naumov, Denis Mazur, Dmitriy Bystrov, Katerina Tsarova, Artem Melnikov, Sergey Dolgov, Reuben Brasher, Michael Perelshtein. 503-506 [doi]
- BubbleSig: Same-Hand Ballot Stuffing DetectionFei Zhao, Chengcui Zhang, Maya Shah, Nitesh Saxena. 507-510 [doi]
- Towards a Novel Blob Detection Approach for Concealed Object Detection in Passive Terahertz ImagingSushmita Chandel, Preeti Dwivedi, Gaurav Bhatnagar, Marcin Kowalski. 511-514 [doi]
- A VR 360°-Video Encoding Framework with Differentiated Tile Compression Based on Digital-Twin TechnologyAndrea Caruso, Giovanni Schembra. 515-521 [doi]
- Trustworthy and Robust Machine Learning for Multimedia: Challenges and PerspectivesKatsuaki Nakano, Michael Zuzak, Cory E. Merkel, Alexander C. Loui. 522-528 [doi]
- Counterfactual Gradients-based Quantification of Prediction Trust in Neural NetworksMohit Prabhushankar, Ghassan Alregib. 529-535 [doi]
- A Framework for Single-View Multi-Plane Image InpaintingZachary Mcbride Lazri, Dae Yeol Lee, Guan-Ming Su. 536-541 [doi]
- GPSR: A Green Point Cloud Surface Reconstruction MethodQingyang Zhou, Jiawei Yu, Shan Liu 0001, C. C. Jay Kuo. 542-548 [doi]
- Behavioral Emotion Analysis Model for Large Language ModelsEdward Y. Chang. 549-556 [doi]
- MISS: Memory-efficient Instance Segmentation for Sport-Scenes with Visual Inductive PriorsChih-Chung Hsu, Chia-Ming Lee. 557-561 [doi]
- Simultaneous Classification and Segmentation of Subretinal Lesions on ICGA ImagesMing-Wen Kuan, Wei-Yang Lin, Chia-Ling Tsai, Shih-Jen Chen, Paisan Ruamviboonsuk, Dong-Jie Jiang. 562-565 [doi]
- Automatic Clipping and Text Logging for Baseball Game Videos Using Deep LearningChen-Wei Wang, Hwai-jung Hsu. 566-571 [doi]
- Advancing Retinal Image Segmentation: A Denoising Diffusion Probabilistic Model PerspectiveAlnur Alimanov, Md Baharul Islam. 572-578 [doi]
- Automated Recognition of Optic Disc and Blood Vessels in Diabetic Fundoscopy Images Using Real-Time Image AnalysisKaixuan Li 0006, Wei-bang Chen, Yongjin Lu, Xiaoliang Wang 0003, He Gao. 579-585 [doi]
- Robust COVID-19 Detection in CT Images with CLIPLi Lin, Yamini Sri Krubha, Zhenhuan Yang, Cheng Ren, Thuc Duy Le, Irene Amerini, Xin Wang, Shu Hu. 586-592 [doi]
- Enhancing Video Stability with Object-Centric StabilizationAparna Tiwari, Hitika Tiwari, K. S. Venkatesh, Anuj Kumar Sharma. 593-599 [doi]
- CIA: Controllable Image Augmentation Framework Based on Stable DiffusionMohamed Benkedadra, Dany Rimez, Tiffanie Godelaine, Natarajan Chidambaram, Hamed Razavi Khosroshahi, Horacio Tellez, Matei Mancas, Benoît Macq, Sidi Ahmed Mahmoudi. 600-606 [doi]
- Robust Light-Weight Facial Affective Behavior Recognition with CLIPLi Lin, Sarah Papabathini, Xin Wang, Shu Hu 0001. 607-611 [doi]
- Guarding Against ChatGPT Threats: Identifying and Addressing VulnerabilitiesDingzong Zhang, Khushi Jain, Priyanka Singh. 612-615 [doi]
- Advection-Diffusion for Feature-based Cancer DiagnosisFayadh Alenezi. 616-621 [doi]
- Exploiting Correlation Between Facial Action Units for Detecting Deepfake VideosQuoc Hoan Vu, Priyanka Singh. 622-625 [doi]
- Perceptual Image Compression via Stable Diffusion at Low BitrateLuoxu Jin, Hiroshi Watanabe. 626-629 [doi]
- Building a Generative AI Showroom for Foundation Models with Different ModalitiesBenny Stein, Niklas Beck, Daniel Becker 0010, Dennis Wegener. 630-633 [doi]
- Where You Look Matters in Group Photos: A Demo of GARGI iOS AppOmkar N. Kulkarni, Thomas Lloyd-Jones, My Tran, Gregory Vincent, Vivek K. Singh 0001, Pradeep K. Atrey. 634-637 [doi]
- Early Alzheimer's Detection: The Promise of AI-Powered MRI AnalysisDominic Baker, Wei-bang Chen, He Gao. 638-641 [doi]
- ProSchedule: A Comprehensive Mobile Solution for Seamless Academic SchedulingHe Gao, Wei-bang Chen. 642-645 [doi]
- A Clustering-based Sequence Variants Analysis Method for Electronic Medical Records of Multimedical InstitutionsHieu Hanh Le, Yuki Yasumitsu, Ryosuke Matsuo, Tomoyoshi Yamazaki, Haruo Yokota. 653-659 [doi]
- Privacy-Preserving Disease Prediction with Secure Data Deduplication on Untrusted Cloud ServersKhushi Jain, Priyanka Singh, Xue Li. 660-666 [doi]
- Understanding eGFR Trajectories and Kidney Function Decline via Large Multimodal ModelsChih-Yuan Li, Jun-Ting Wu, Chan Hsu, Ming-Yen Lin, Yihuang Kang. 667-673 [doi]
- Self-Monitoring the Mental-Health State of a Focused Population with Multiple Self-Questionnaires and Sentiment DescriptionsSukhan Lee, Soojin Lee, Yaejin Lee. 674-680 [doi]
- Big Data and Bigger Dilemmas: Ethical Concerns of Data in HealthcareNisha Daga, George Kodimattam Joseph. 681-684 [doi]
- Patient 3D Data Visualisation with AR-based Interactive Technology for Brain MRIVishakha Pareek, Shreyansh Sharma, Vibhor Singh, Shashwat Singh. 685-690 [doi]