Abstract is missing.
- Multi-Encoder Sequential Attention Network for Context-Aware Speech Recognition in Japanese Dialog ConversationNobuya Tachimori, Sakriani Sakti, Satoshi Nakamura 0001. 1-6 [doi]
- Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy ConditionsMd Mahbub E. Noor, Yen-Ju Lu, Syu-Siang Wang, Supratip Ghose, Chia-Yu Chang, Ryandhimas E. Zezario, Shafique Ahmed, Wei-Ho Chung, Yu Tsao 0001, Hsin-Min Wang. 7-12 [doi]
- A Study on Native American English Speech Recognition by Indian Listeners with Varying Word Familiarity LevelAbhayjeet Singh, Achuth Rao MV, Rakesh Vaideeswaran, Chiranjeevi Yarra, Prasanta Kumar Ghosh. 13-18 [doi]
- Speech Recognition System for Writing Dentist Medical RecordsDinda Yora Islami, Dessi Puji Lestari. 19-25 [doi]
- A Multi-Genre Urdu Broadcast Speech Recognition SystemErbaz Khan, Sahar Rauf, Farah Adeeba, Sarmad Hussain. 25-30 [doi]
- An Empirical Study of Speaker Identification System for Mono and Traverse Linguistic Background Using EM and SMEMAmita Dev, Shweta A. Bansal, Shyam S. Agrawal. 31-36 [doi]
- Self-Supervised Spoken Question Understanding and Speaking with Automatic Vocabulary LearningKeisuke Toyoda, Yusuke Kimura, Mingxin Zhang, Kent Hino, Kosuke Mori, Takahiro Shinozaki. 37-42 [doi]
- Design and Basic Analysis of the TUT Emotional Storytelling CorpusHikaru Oishi, Mika Enomoto, Keiko Ochi, Yasunari Obuchi. 43-48 [doi]
- Construction and Analysis of Tibetan AMDO Dialect Speech Dataset for Speech SynthesisXinyi Zhang, Wenhuan Lu, Xinyue Zhao, Yi Zhu, Jianguo Wei. 49-52 [doi]
- Into-Cass: A Corpus for the Study of Intonation and Prosody in Chinese Dialects and Ethnic LanguagesAijun Li, Ziyu Xiong. 53-58 [doi]
- Discourse Timing in Children's Rhyme Speech Produced by Prelingually Deaf Mandarin-Speaking Children with Cochlear ImplantsJue Yu, Qianwen Jin. 59-64 [doi]
- Towards the Development of Segment Level Speech Overlap Detection Using Convolutional Neural NetworkRonald John Cabatic, Angelica H. De La Cruz. 65-69 [doi]
- On The Use of Gestures in Dialogue Breakdown DetectionTaiga Mori, Kristiina Jokinen, Yasuharu Den. 70-75 [doi]
- Aspect-Based Sentiment Analysis of User Created Game ReviewsIan Michael Urriza, Maria Art Antonette D. ClariƱo. 76-81 [doi]
- L2 Accent and Intelligibility by Chinese L2 Speakers of EnglishYizhou Lan, Tongtong Xie. 82-87 [doi]
- A Study on English Word-Final Coronal Stop Deletion by Chinese EFL LearnersTong Li, Hui Feng. 88-93 [doi]
- The Role of High Variability Phonetic Training on Chinese EFL Learners' Perception of English Vowels in Noisy EnvironmentQianxi Yu, Ping Tang. 94-99 [doi]
- The Effect of Overnight Consolidation on English Vowel Perception by Chinese Learners After High Speaker Variability Phonetic TrainingYanan Shen, Ping Tang. 100-105 [doi]
- Mandarin Speakers' Acquisitions and Representations of Flapping in American English in An ESL Context: A Perception and Production StudyChia-Wei Chuang. 106-110 [doi]
- Tonal Patterns of Tri-Syllabic Words in the Production of Standard Chinese of Bilingual TeachersYuan Jia, Bin Li. 111-115 [doi]
- SPIRE VCV: An Acoustic-Articulatory Corpus with Three Different Speaking RatesTilak Purohit, Tejas Umesh, Shankar Narayanan, Minulakshmi S, Prasanta Kumar Ghosh. 116-121 [doi]
- Khmer Speech Translation Corpus of the Extraordinary Chambers in the Courts of Cambodia (ECCC)Soky Kak, Masato Mimura, Tatsuya Kawahara, Sheng Li 0010, Chenchen Ding, Chenhui Chu, Sethserey Sam. 122-127 [doi]
- SLoClas: A Database for Joint Sound Localization and ClassificationXinyuan Qian, Bidisha Sharma, Amine El Abridi, Haizhou Li 0001. 128-133 [doi]
- GAMVA: A Japanese Audio-Visual Multi-Angle Speech CorpusShinnosuke Isobe, Ryuichi Hirose, Takumi Nishiwaki, Tomohiro Hattori, Satoshi Tamura, Yuuto Gotoh, Masaki Nose. 134-139 [doi]
- M2ASR-MONGO: A Free Mongolian Speech Database and Accompanied BaselinesTiankai Zhi, Ying Shi, Wenqiang Du, Guanyu Li, Dong Wang. 140-145 [doi]
- wSPIRE: A Parallel Multi-Device Corpus in Neutral and Whispered SpeechBhavuk Singhal, Abinay Reddy Naini, Prasanta Kumar Ghosh. 146-151 [doi]
- Which Phonemes Will Distinguish the Different Regions Within the Same Dialect?Xuefei Liu, Jianhua Tao, Yurong Han, Chenglong Wang, Xueying Zheng, Zhengqi Wen. 152-157 [doi]
- Comparison of Static and Time-Sequential Features in Automatic Fluency Detection of Spontaneous SpeechHuaijin Deng, Takehito Utsuro, Akio Kobayashi, Hiromitsu Nishizaki. 158-163 [doi]
- How Do Speakers Pause and Hesitate in English and Japanese? - A Comparison Using Parallel Corpora of English and Japanese Presentation Speeches -Michiko Watanabe, Yuma Shirahata, Ralph Rose, Kikuo Maekawa. 164-167 [doi]
- Korean Dialect Identification Based on Intonation ModelingJooyoung Lee, Kyungwha Kim, Minhwa Chung. 168-173 [doi]
- Development of Accent Recognition Systems for Vietnamese SpeechQuang Tien Duong, Van Hai Do. 174-179 [doi]
- A Blind Method for Phone Segmentation and Its Evaluation on Vietnamese Speech CorpusDac-Thang Hoang, Tat Thang Vu. 180-185 [doi]
- Simultaneous Speech-to-Speech Translation System with Transformer-Based Incremental ASR, MT, and TTSRyo Fukuda, Sashi Novitasari, Yui Oka, Yasumasa Kano, Yuki Yano, Yuka Ko, Hirotaka Tokuyama, Kosuke Doi, Tomoya Yanagita, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura 0001. 186-192 [doi]
- Using Speech Enhancement to Realize Speech Synthesis of Low-Resource Dungan LanguagesRui Jiang, Chengsi Chen, Xin Shan, Hongwu Yang. 193-198 [doi]
- A Study on Neural-Network-Based Text-to-Speech Adaptation Techniques for VietnamesePham Ngoc Phuong, Chung Tran Quang, Quoc Truong Do, Mai Chi Luong. 199-205 [doi]
- Using Local Phrase Dependency Structure Information in Neural Sequence-to-Sequence Speech SynthesisNobuyoshi Kaiki, Sakriani Sakti, Satoshi Nakamura 0001. 206-211 [doi]
- Text-to-Speech Systems for Filipino Using Unit Selection and Deep LearningEdsel Jedd Renovalles, Crisron Rudolf Lucas, Franz A. de Leon, Angelina Aquino, Izza Jalandoni. 212-217 [doi]
- Investigation of an Input Sequence on Thai Neural Sequence-to-Sequence Speech SynthesisPongsathon Janyoi, Ausdang Thangthai. 218-223 [doi]