Journal: Speech Communication

Volume 54, Issue 9

975 -- 997Okko Räsänen. Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions
998 -- 1013Toshio Irino, Yoshie Aoki, Hideki Kawahara, Roy D. Patterson. Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination
1014 -- 1028Atsunori Ogawa, Atsushi Nakamura. Joint estimation of confidence and error causes in speech recognition
1029 -- 1048Irene Ayllón Clemente, Martin Heckmann, Britta Wrede. Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation
1049 -- 1063Khiet P. Truong, David A. van Leeuwen, Franciska M. G. de Jong. Speech-based recognition of self-reported and observed emotion in a dimensional space

Volume 54, Issue 8

923 -- 931Yana Yunusova, Melanie Baljko, Grigore Pintilie, Krista Rudy, Petros Faloutsos, John Daskalogiannakis. Acquisition of the 3D surface of the palate by in-vivo digitization with Wave
932 -- 945Qinghua Sun, Keikichi Hirose, Nobuaki Minematsu. 0 contours based on tone nucleus model and superpositional model
946 -- 956Peggy P. K. Mok. Effects of consonant cluster syllabification on vowel-to-vowel coarticulation in English
957 -- 974Zhongbo Li, Shenghui Zhao, Stefan Bruhn, Jing Wang, Jingming Kuang. Comparison and optimization of packet loss recovery methods based on AMR-WB for VoIP

Volume 54, Issue 7

845 -- 856Lan Wang, Hui Chen, Sheng Li, Helen M. Meng. Phoneme-level articulatory animation in pronunciation training
857 -- 866Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, Keiichi Tokuda. Impacts of machine translation and speech synthesis on speech-to-speech translation
867 -- 880Shajith Ikbal, Hemant Misra, Hynek Hermansky, Mathew Magimai-Doss. Phase AutoCorrelation (PAC) features for noise robust speech recognition
881 -- 892Ronan Flynn, Edward Jones. Reducing bandwidth for robust distributed speech recognition in conditions of packet loss
893 -- 902Thorsten Smit, Friedrich Türckheim, Robert Mores. Fast and robust formant detection from LP data
903 -- 916Ali Hassan, Robert I. Damper. Classification of emotional speech using 3DEC hierarchical classifier
917 -- 922Hugo Quené, Gün R. Semin, Francesco Foroni. Audible smiles and frowns affect speech comprehension

Volume 54, Issue 6

681 -- 702Pilar Prieto, María Vanrell, Lluïsa Astruc, Elinor Payne, Brechtje Post. Phonotactic and phrasal properties of speech rhythm. Evidence from Catalan, English, and Spanish
703 -- 714Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King, Keiichi Tokuda. Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping
715 -- 731Tobias Kaufmann, Beat Pfister. Syntactic language modeling with formal grammars
732 -- 742Petr Zelinka, Milan Sigmund, Jiri Schimmel. Impact of vocal effort variability on automatic speech recognition
743 -- 762Rigas Kotsakis, George Kalliris, Charalampos Dimoulas. Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification
763 -- 780Mohammad H. Moattar, Mohammad M. Homayounpour. Variational conditional random fields for online speaker detection and tracking
781 -- 790Mirjam Wester. Talker discrimination across languages
791 -- 800Takanobu Oba, Takaaki Hori, Atsushi Nakamura. Efficient training of discriminative language models by sample selection
801 -- 813Herman Kamper, Félicien Jeje Muamba Mukanya, Thomas Niesler. Multi-accent acoustic modelling of South African English
814 -- 835Eduardo Pavez, Jorge F. Silva. Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition
836 -- 843Ronan Flynn, Edward Jones. Feature selection for reduced-bandwidth distributed speech recognition
844 -- 0David M. Howard, Evelyn Abberton, Adrian Fourcin. Erratum to "Disordered voice measurement and auditory analysis" [Speech Comm. 54(2012) 611-621]

Volume 54, Issue 5

583 -- 600William Ricardo Rodríguez, Oscar Saz, Eduardo Lleida. A prelingual tool for the education of altered voices
601 -- 610Evaldas Vaiciukynas, Antanas Verikas, Adas Gelzinis, Marija Bacauskiene, Virgilijus Uloza. Exploring similarity-based classification of larynx disorders from human voice
611 -- 621David M. Howard, Evelyn Abberton, Adrian Fourcin. Disordered voice measurement and auditory analysis
622 -- 631Tiago H. Falk, Wai-Yip Chan, Fraser Shein. Characterization of atypical vocal source excitation, temporal dynamics and prosody for objective measurement of dysarthric word intelligibility
632 -- 640Marieke de Bruijn, Louis ten Bosch, Dirk J. Kuik, Birgit I. Witte, Johannes A. Langendijk, C. René Leemans, Irma Verdonck-de Leeuw. Acoustic-phonetic and artificial neural network feature analysis to assess speech quality of stop consonants produced by patients treated for oral or oropharyngeal cancer
641 -- 654Sevasti-Zoi Karakozoglou, Nathalie Henrich, Christophe d'Alessandro, Yannis Stylianou. Automatic glottal segmentation using local-based active contours and application to glottovibrography
655 -- 663Ali Alpan, Jean Schoentgen, Youri Maryn, Francis Grenez, P. Murphy. Assessment of disordered voice via the first rahmonic
664 -- 679Alain Ghio, Gilles Pouchoulin, Bernard Teston, Serge Pinto, Corinne Fredouille, Céline De Looze, D. Robert, François Viallet, A. Giovanni. How to manage sound, physiological and clinical data of 2500 dysphonic and dysarthric speakers?

Volume 54, Issue 4

517 -- 528Anis Ben Aicha, Sofia Ben Jebara. Perceptual speech quality measures separating speech distortion and additive noise degradations
529 -- 542Meihong Wu, Huahui Li, Zhiling Hong, Xinchi Xian, Jingyu Li, Xihong Wu, Liang Li. Effects of aging on the ability to benefit from prior knowledge of message content in masked speech recognition
543 -- 565Md. Sahidullah, Goutam Saha. Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition
566 -- 582David Escudero Mancebo, Lourdes Aguilar, María Vanrell, Pilar Prieto. Analysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system

Volume 54, Issue 3

321 -- 340Bert Réveil, Jean-Pierre Martens, Henk van den Heuvel. Improving proper name recognition by means of automatically learned pronunciation variants
341 -- 350Pandurangarao N. Kulkarni, Prem C. Pandey, Dakshayani S. Jangamashetti. Multi-band frequency compression for improving speech perception by listeners with moderate sensorineural hearing loss
351 -- 367Antonio Moreno-Daniel, Jay G. Wilpon, Biing-Hwang Juang. Index-based incremental language model for scalable directory assistance
368 -- 383Daniel Recasens. A cross-language acoustic study of initial and final allophones of /l/
384 -- 392Takashi Nose, Takao Kobayashi. Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols
393 -- 401Amaro A. de Lima, Thiago de M. Prego, Sergio L. Netto, Bowon Lee, Amir Said, Ronald W. Schafer, Ton Kalker, Majid Fozunbal. On the quality-assessment of reverberated speech
402 -- 413Peng Dai, Ing Yann Soon. A temporal frequency warped (TFW) 2D psychoacoustic filter for robust speech recognition system
414 -- 429Ioulia Grichkovtsova, Michel Morel, Anne Lacheret. The role of voice quality and prosodic contour in affective speech perception
430 -- 444Frank Rudzicz. Using articulatory likelihoods in the recognition of dysarthric speech
445 -- 458Je Hun Jeon, Yang Liu. Automatic prosodic event detection using a novel labeling and selection method in co-training
459 -- 476Jordi Adell, David Escudero Mancebo, Antonio Bonafonte. Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence
477 -- 490Jae Hun Choi, Joon-Hyuk Chang. On using acoustic environment classification for statistical model-based speech enhancement
491 -- 502Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran. Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech
503 -- 515Angel M. Gomez, Belinda Schwerin, Kuldip K. Paliwal. Improving objective intelligibility prediction by combining correlation and coherence based methods with a measure based on the negative distortion ratio

Volume 54, Issue 2

161 -- 174Nigel G. Ward, Alejandro Vega, Timo Baumann. Prosodic and temporal features for language modeling for dialog
175 -- 188J. Sebastian Andersson, Junichi Yamagishi, Robert A. J. Clark. Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis
189 -- 198Sophie Bouton, Pascale Colé, Willy Serniclaes. The influence of lexical knowledge on phoneme discrimination in deaf children with cochlear implants
199 -- 211Jon Gudnason, Mark R. P. Thomas, Daniel P. W. Ellis, Patrick A. Naylor. Data-driven voice source waveform analysis and synthesis
212 -- 218George Saon, Hagen Soltau. Boosting systems for large vocabulary continuous speech recognition
219 -- 228Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Ariya Rastrow, Nobuyasu Itoh, Masafumi Nishimura. Acoustically discriminative language model training with pseudo-hypothesis
229 -- 244Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani. Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection
245 -- 255Vataya Chunwijitra, Takashi Nose, Takao Kobayashi. A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis
256 -- 271Hamid Reza Tohidypour, Seyyed Ali Seyyedsalehi, Hossein Behbood, Hossein Roshandel. A new representation for speech frame recognition based on redundant wavelet filter banks
272 -- 281Fei Chen, Philipos C. Loizou. Impact of SNR and gain-function over- and under-estimation on speech intelligibility
282 -- 305Kuldip K. Paliwal, Belinda Schwerin, Kamil K. Wójcicki. Speech enhancement using a minimum mean-square error short-time spectral modulation magnitude estimator
306 -- 320Andrew Hines, Naomi Harte. Speech intelligibility prediction using a Neurogram Similarity Index Measure

Volume 54, Issue 10

1065 -- 1103Mohammad H. Moattar, Mohammad M. Homayounpour. A review on speaker diarization systems and approaches
1104 -- 1120Veena Karjigi, Preeti Rao. Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling
1121 -- 1131Edward Ozimek, Dariusz Kutzner, Pawel Libiszewski. Speech intelligibility tested by the Pediatric Matrix Sentence test in 3-6 year old children
1132 -- 1142Doris Baum. Recognising speakers from the topics they talk about

Volume 54, Issue 1

1 -- 10Abhishek Jaywant, Marc D. Pell. Categorical processing of negative emotions from speech prosody
11 -- 22Elisabetta Fersini, Enza Messina, Francesco Archetti. Emotional states in judicial courtrooms: An experimental investigation
23 -- 39Mouloud Djamah, Douglas D. O'Shaughnessy. Fine granularity scalable speech coding using embedded tree-structured vector quantization
40 -- 54Abhijeet Sangwan, John H. L. Hansen. Automatic analysis of Mandarin accented English using phonological features
55 -- 67Deepu Vijayasenan, Fabio Valente, Hervé Bourlard. Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features
68 -- 91Máire Ní Chiosáin, Pauline Welby, Robert Espesser. Is the syllabification of Irish a typological exception? An experimental study
92 -- 107Silke Paulmann, Debra Titone, Marc D. Pell. How emotional prosody guides your way: Evidence from eye movements
108 -- 118Peter Jancovic, Xin Zou, Münevver Köküer. Speech enhancement based on Sparse Code Shrinkage employing multiple speech models
119 -- 133Cong-Thanh Do, Dominique Pastor, André Goalic. A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech
134 -- 146Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano. Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech
147 -- 160Ying-Yee Kong, Ala Mullangi. On the development of a frequency-lowering system that enhances place-of-articulation perception