Speech and Computer - 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13-15, 2025, Proceedings, Part I - researchr publication

researchr

You are not signed in
Sign in
Sign up

Alexey Karpov 0001, Gábor Gosztolya, editors, Speech and Computer - 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13-15, 2025, Proceedings, Part I. Volume 16187 of Lecture Notes in Computer Science, Springer, 2026. [doi]

Conference: specom2026

Abstract is missing.

Towards Responsible Multimodal Modeling for Mental HealthcareHeysem Kaya, Gizem Sogancioglu. 3-22 [doi]

When Voice Matters: Evidence of Gender Disparity in Positional Bias of SpeechLLMsShree Harsha Bokkahalli Satish, Gustav Eje Henter, Éva Székely. 25-38 [doi]

WhiSQA: Non-intrusive Speech Quality Prediction Using Whisper Encoder FeaturesGeorge Close, Kris Y. Hong, Thomas Hain, Stefan Goetze. 39-51 [doi]

Prompting the Mind: EEG-to-Text Translation with Multimodal LLMs and Semantic ControlMohammed Salah Al-Radhi, Sadi Mahmud Shurid, Géza Németh. 52-66 [doi]

Effectiveness of Tacotron2 for Intonation Model Synthesis in RussianAnastasiia Sherban, Uliana E. Kochetkova. 67-82 [doi]

Enhancing Sinhala Text-to-Speech with End-to-End VITS ArchitectureSasangi Nayanathara, Inuri Harischandra, Thamira Weerakoon, Randil Pushpananda. 83-98 [doi]

Spoken Emotion Recognition Using Soft LabelsDániel Halmai, Gábor Gosztolya. 101-112 [doi]

NAMTalk: From Muscle Vibrations to Emotional SpeechKunjan Gajre, Rajnidhi Gupta, Ravindrakumar M. Purohit, Hemant A. Patil. 113-128 [doi]

What Do LLMs Know About Human Emotions? The Russian Case StudyOlga Mitrofanova, Polina Iurevtseva, Maxim Bakaev. 129-144 [doi]

Emotions Manifestation by Adolescents with Intellectual DisabilitiesEgor Kleshnev, Elena E. Lyakso. 145-156 [doi]

Retention-Augmented Voice Assistant: A Lightweight Architecture for Stateful Interaction with Comprehensive Evaluation and Privacy-Preserving DesignAbdelkader Seif El Islem Rahmani, Yasser Yahiaoui, Abdelghani Bouziane. 157-169 [doi]

Investigation of Explainable Multimodal Methods for Detecting Mental DisordersMikhail Dolgushin, Daria Guseva, Alexey Karpov 0001. 173-187 [doi]

Attention Deficit Hyperactivity Disorder: Identifying Approaches for Early Diagnosis, a Pilot StudyElena E. Lyakso, Olga V. Frolova, Anton Matveev, Petr Shabanov, Andrei Lebedev, Aleksandr Nikolaev, Egor Kleshnev, Severin Grechanyi, Ruban Nersisson. 188-202 [doi]

Text-to-Dysarthric-Speech Generation for Dysarthric Automatic Speech Recognition: Is Purely Synthetic Data Enough?Wing-Zin Leung, Heidi Christensen, Stefan Goetze. 203-216 [doi]

Colour Preferences in Schizophrenic SpeechAnna V. Shevlyakova, Vladimir V. Bochkarev, Stanislav Khristoforov. 217-227 [doi]

Automated Assessment of Phrase Intelligibility for Russian Speech Based on Esophageal VoiceEvgeny Kostyuchenko. 228-237 [doi]

Subtle Changes in L1 Stops of Late Salento Italian-French Bilinguals: An Acoustic Study Using AutoVOT Adapted for Italian and FrenchMarie Fongaro, Barbara Gili Fivela, Maud Pélissier, Gabriel Hévr. 241-255 [doi]

Sound and Colour in Phonosemantics: Perceptual and Acoustic Correlates of Mongolian VowelsRodmonga Potapova, Vsevolod Potapov, Tsend-Ayush Ganbaatar, Leonid Motovskikh, Nikolay Bobrov. 256-266 [doi]

Rhythmic Diglossia Based on Discourse Types and Dialects of English: Australian and New Zealand CorporaAnna Borzykh, Tatiana Shevchenko. 267-277 [doi]

Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment CorpusAleksandra S. Maslenikova, Tatiana I. Popova. 278-292 [doi]

Effect of Spoof Speech on Forensic Voice Comparison Using Deep Speaker EmbeddingsMohammed Hamzah Alsalihi, Dávid Sztahó. 295-306 [doi]

Source Vendor Tracing of Audio DeepfakesMarina Volkova, Artem Chirkovskiy, Egor Ausev, Ekaterina Shangina. 307-321 [doi]

Language-Specific Adaptation Strategies for Speaker Recognition Using MobileNetAnton Yakovenko, Evgeny Bessonnitsyn, Valeria Efimova, Mark Zaslavskiy. 322-332 [doi]

Enhancing Audio Replay Attack Detection with Silence-Based Blind Channel Impulse Response EstimationSule Bekiryazici, Cemal Hanilçi, Neyir Ozcan. 333-344 [doi]

runs on WebDSL