Abstract is missing.
- In-Domain SSL Pre-training and Streaming ASR: Application to Air Traffic Control CommunicationsJarod Duret, Salima Mdhaffar, Gaëlle Laperrière, Ryan Whetten, Audrey Galametz, Catherine Kobus, Marion-Cécile Martin, Jo Oleiwan, Yannick Estève. 3-12 [doi]
- Evaluating the Performance of Several ASR Systems in Environmental and Industrial NoiseSara M. Pearsell, Oliver Niebuhr, Samuel Schmück. 13-28 [doi]
- Ground Truth-Free WER Prediction for ASR via Audio Quality and Model Confidence FeaturesAnton Polevoi, Alexander Kragin, Natalia V. Loukachevitch. 29-44 [doi]
- Enhancing Speech Recognition Through Text-to-Speech and Voice Conversion AugmentationYunus Emre Ozkose, Ali Haznedaroglu. 45-59 [doi]
- Best Data is more Supervised Data - Even for Hungarian ASRGergely Dobsinszki, Péter Mihajlik, Mate S. Kadar, Tibor Fegyó, Katalin Mády. 60-69 [doi]
- Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-Based ModelsBranislav Gerazov, Marcello Politi, Sébastien Bratières. 70-84 [doi]
- Effect of Increased Temporal Resolution on Speech Recognition for French Quebec Using Features from Speech Self-supervised Learning ModelsVishwa Gupta, Gilles Boulianne. 87-103 [doi]
- Modeling Intra-word Code-Switching for Karelian ASRIrina S. Kipyatkova, Kseniia Kiseleva, Mikhail Dolgushin, Ildar Kagirov. 104-117 [doi]
- Improving Whisper-Based Serbian ASR Using Synthetic SpeechVuk Stanojev, Tijana V. Nosek, Sinisa Suzic, Darko Pekar, Vlado Delic, Milan Secujski. 118-129 [doi]
- Domain Knowledge and Language Embeddings for Low-Resource Multilingual Phoneme ASRAnton Legchenko, Ivan Bondarenko. 130-143 [doi]
- Whistler Identification in Whistled Spanish (Silbo): A Case StudyAlejandro López-García, María Alfaro-Contreras, Julien Meyer, Jose J. Valero-Mas. 144-158 [doi]
- PinkVocalTransformer: Neural Acoustic-to-Articulatory Inversion Based on the Pink TromboneZhiyuan Xu, Joshua D. Reiss. 161-173 [doi]
- CrossMP-SENet: Transformer-Based Cross-Attention for Joint Magnitude-Phase Speech EnhancementAlexander Zaburdaev, Denis Ivanko, Dmitry Ryumin. 174-188 [doi]
- Adaptive Singing Voice Enhancement for Live StagesJia-Lien Hsu, Pei-Wen Chien. 189-202 [doi]
- Revealing the Hidden Temporal Structure of HubertSoft Embeddings Based on the Russian Phonetic CorpusAnastasia Ananeva, Anton Tomilov, Marina Volkova. 203-215 [doi]
- Analyzing Web-Scraped and Generated Inputs for Automatic and Scalable Intent ClassificationPhiline Kowol, Stefan Hillmann. 219-230 [doi]
- Enhancing Retrieval Performance via LLM Hard-Negative FilteringDanil Tirskikh, Olesia Koroteeva, Yuri Matveev, Ekaterina Brovkina, Larisa Gonchar. 231-241 [doi]
- Sector-Wise Backpropagation for Low-Resource Text Classification in Deep ModelsJosé Luis Vázquez Noguera, Carlos U. Valdez, Marvin M. Agüero, Julio César Mello Román, José D. Colbes, Sebastián A. Grillo. 242-256 [doi]
- High-Frequency Multiword Units and the Typological Distribution of Multiword Units in Spoken RussianNatalia Bogdanova-Beglarian, Olga Blinova, Mariya Khokhlova, Tatiana Y. Sherstinova, Tatiana I. Popova. 257-270 [doi]
- Estimation of the Genre Composition of the English Subcorpus of the Google Books NgramVladimir V. Bochkarev, Andrey Achkeev, Anna V. Shevlyakova. 271-285 [doi]
- Ensembling Synchronisation-Based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric RecordingsJason Clarke, Yoshihiko Gotoh, Stefan Goetze. 289-301 [doi]
- Phonetic and Visual Characteristics of Cognitive LoadVera Evdokimova, Maria Maksimova. 302-317 [doi]
- Cognitive Humor Processing in the Russian and English Internet Meme Chatting: EEG StudyRodmonga Potapova, Vsevolod Potapov, Ekaterina Karimova, Diana Smolskaya, Nikolay Bobrov, Leonid Motovskikh, Iurii Pozhilov. 318-330 [doi]
- Saudi Sign Language Translation Using T5Ali Alhejab, Tomas Zelezný, Lamya Alkanhal, Ivan Gruber, Yazeed Alharbi, Jakub Straka, Vaclav Javorek, Marek Hrúz, Badriah Alkalifah, Ahmed Ali. 331-343 [doi]