Abstract is missing.
- Towards Responsible Multimodal Modeling for Mental HealthcareHeysem Kaya, Gizem Sogancioglu. 3-22 [doi]
- When Voice Matters: Evidence of Gender Disparity in Positional Bias of SpeechLLMsShree Harsha Bokkahalli Satish, Gustav Eje Henter, Éva Székely. 25-38 [doi]
- WhiSQA: Non-intrusive Speech Quality Prediction Using Whisper Encoder FeaturesGeorge Close, Kris Y. Hong, Thomas Hain, Stefan Goetze. 39-51 [doi]
- Prompting the Mind: EEG-to-Text Translation with Multimodal LLMs and Semantic ControlMohammed Salah Al-Radhi, Sadi Mahmud Shurid, Géza Németh. 52-66 [doi]
- Effectiveness of Tacotron2 for Intonation Model Synthesis in RussianAnastasiia Sherban, Uliana E. Kochetkova. 67-82 [doi]
- Enhancing Sinhala Text-to-Speech with End-to-End VITS ArchitectureSasangi Nayanathara, Inuri Harischandra, Thamira Weerakoon, Randil Pushpananda. 83-98 [doi]
- Spoken Emotion Recognition Using Soft LabelsDániel Halmai, Gábor Gosztolya. 101-112 [doi]
- NAMTalk: From Muscle Vibrations to Emotional SpeechKunjan Gajre, Rajnidhi Gupta, Ravindrakumar M. Purohit, Hemant A. Patil. 113-128 [doi]
- What Do LLMs Know About Human Emotions? The Russian Case StudyOlga Mitrofanova, Polina Iurevtseva, Maxim Bakaev. 129-144 [doi]
- Emotions Manifestation by Adolescents with Intellectual DisabilitiesEgor Kleshnev, Elena E. Lyakso. 145-156 [doi]
- Retention-Augmented Voice Assistant: A Lightweight Architecture for Stateful Interaction with Comprehensive Evaluation and Privacy-Preserving DesignAbdelkader Seif El Islem Rahmani, Yasser Yahiaoui, Abdelghani Bouziane. 157-169 [doi]
- Investigation of Explainable Multimodal Methods for Detecting Mental DisordersMikhail Dolgushin, Daria Guseva, Alexey Karpov 0001. 173-187 [doi]
- Attention Deficit Hyperactivity Disorder: Identifying Approaches for Early Diagnosis, a Pilot StudyElena E. Lyakso, Olga V. Frolova, Anton Matveev, Petr Shabanov, Andrei Lebedev, Aleksandr Nikolaev, Egor Kleshnev, Severin Grechanyi, Ruban Nersisson. 188-202 [doi]
- Text-to-Dysarthric-Speech Generation for Dysarthric Automatic Speech Recognition: Is Purely Synthetic Data Enough?Wing-Zin Leung, Heidi Christensen, Stefan Goetze. 203-216 [doi]
- Colour Preferences in Schizophrenic SpeechAnna V. Shevlyakova, Vladimir V. Bochkarev, Stanislav Khristoforov. 217-227 [doi]
- Automated Assessment of Phrase Intelligibility for Russian Speech Based on Esophageal VoiceEvgeny Kostyuchenko. 228-237 [doi]
- Subtle Changes in L1 Stops of Late Salento Italian-French Bilinguals: An Acoustic Study Using AutoVOT Adapted for Italian and FrenchMarie Fongaro, Barbara Gili Fivela, Maud Pélissier, Gabriel Hévr. 241-255 [doi]
- Sound and Colour in Phonosemantics: Perceptual and Acoustic Correlates of Mongolian VowelsRodmonga Potapova, Vsevolod Potapov, Tsend-Ayush Ganbaatar, Leonid Motovskikh, Nikolay Bobrov. 256-266 [doi]
- Rhythmic Diglossia Based on Discourse Types and Dialects of English: Australian and New Zealand CorporaAnna Borzykh, Tatiana Shevchenko. 267-277 [doi]
- Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment CorpusAleksandra S. Maslenikova, Tatiana I. Popova. 278-292 [doi]
- Effect of Spoof Speech on Forensic Voice Comparison Using Deep Speaker EmbeddingsMohammed Hamzah Alsalihi, Dávid Sztahó. 295-306 [doi]
- Source Vendor Tracing of Audio DeepfakesMarina Volkova, Artem Chirkovskiy, Egor Ausev, Ekaterina Shangina. 307-321 [doi]
- Language-Specific Adaptation Strategies for Speaker Recognition Using MobileNetAnton Yakovenko, Evgeny Bessonnitsyn, Valeria Efimova, Mark Zaslavskiy. 322-332 [doi]
- Enhancing Audio Replay Attack Detection with Silence-Based Blind Channel Impulse Response EstimationSule Bekiryazici, Cemal Hanilçi, Neyir Ozcan. 333-344 [doi]