Speech Communication - researchr journal

researchr

You are not signed in
Sign in
Sign up

975	--	997	Okko Räsänen. Computational modeling of phonetic and lexical learning in early language acquisition: Existing models and future directions
998	--	1013	Toshio Irino, Yoshie Aoki, Hideki Kawahara, Roy D. Patterson. Comparison of performance with voiced and whispered speech in word recognition and mean-formant-frequency discrimination
1014	--	1028	Atsunori Ogawa, Atsushi Nakamura. Joint estimation of confidence and error causes in speech recognition
1029	--	1048	Irene Ayllón Clemente, Martin Heckmann, Britta Wrede. Incremental word learning: Efficient HMM initialization and large margin discriminative adaptation
1049	--	1063	Khiet P. Truong, David A. van Leeuwen, Franciska M. G. de Jong. Speech-based recognition of self-reported and observed emotion in a dimensional space

923	--	931	Yana Yunusova, Melanie Baljko, Grigore Pintilie, Krista Rudy, Petros Faloutsos, John Daskalogiannakis. Acquisition of the 3D surface of the palate by in-vivo digitization with Wave
932	--	945	Qinghua Sun, Keikichi Hirose, Nobuaki Minematsu. 0 contours based on tone nucleus model and superpositional model
946	--	956	Peggy P. K. Mok. Effects of consonant cluster syllabification on vowel-to-vowel coarticulation in English
957	--	974	Zhongbo Li, Shenghui Zhao, Stefan Bruhn, Jing Wang, Jingming Kuang. Comparison and optimization of packet loss recovery methods based on AMR-WB for VoIP

845	--	856	Lan Wang, Hui Chen, Sheng Li, Helen M. Meng. Phoneme-level articulatory animation in pronunciation training
857	--	866	Kei Hashimoto, Junichi Yamagishi, William Byrne, Simon King, Keiichi Tokuda. Impacts of machine translation and speech synthesis on speech-to-speech translation
867	--	880	Shajith Ikbal, Hemant Misra, Hynek Hermansky, Mathew Magimai-Doss. Phase AutoCorrelation (PAC) features for noise robust speech recognition
881	--	892	Ronan Flynn, Edward Jones. Reducing bandwidth for robust distributed speech recognition in conditions of packet loss
893	--	902	Thorsten Smit, Friedrich Türckheim, Robert Mores. Fast and robust formant detection from LP data
903	--	916	Ali Hassan, Robert I. Damper. Classification of emotional speech using 3DEC hierarchical classifier
917	--	922	Hugo Quené, Gün R. Semin, Francesco Foroni. Audible smiles and frowns affect speech comprehension

681	--	702	Pilar Prieto, María Vanrell, Lluïsa Astruc, Elinor Payne, Brechtje Post. Phonotactic and phrasal properties of speech rhythm. Evidence from Catalan, English, and Spanish
703	--	714	Keiichiro Oura, Junichi Yamagishi, Mirjam Wester, Simon King, Keiichi Tokuda. Analysis of unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis using KLD-based transform mapping
715	--	731	Tobias Kaufmann, Beat Pfister. Syntactic language modeling with formal grammars
732	--	742	Petr Zelinka, Milan Sigmund, Jiri Schimmel. Impact of vocal effort variability on automatic speech recognition
743	--	762	Rigas Kotsakis, George Kalliris, Charalampos Dimoulas. Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification
763	--	780	Mohammad H. Moattar, Mohammad M. Homayounpour. Variational conditional random fields for online speaker detection and tracking
781	--	790	Mirjam Wester. Talker discrimination across languages
791	--	800	Takanobu Oba, Takaaki Hori, Atsushi Nakamura. Efficient training of discriminative language models by sample selection
801	--	813	Herman Kamper, Félicien Jeje Muamba Mukanya, Thomas Niesler. Multi-accent acoustic modelling of South African English
814	--	835	Eduardo Pavez, Jorge F. Silva. Analysis and design of Wavelet-Packet Cepstral coefficients for automatic speech recognition
836	--	843	Ronan Flynn, Edward Jones. Feature selection for reduced-bandwidth distributed speech recognition
844	--	0	David M. Howard, Evelyn Abberton, Adrian Fourcin. Erratum to "Disordered voice measurement and auditory analysis" [Speech Comm. 54(2012) 611-621]

583	--	600	William Ricardo Rodríguez, Oscar Saz, Eduardo Lleida. A prelingual tool for the education of altered voices
601	--	610	Evaldas Vaiciukynas, Antanas Verikas, Adas Gelzinis, Marija Bacauskiene, Virgilijus Uloza. Exploring similarity-based classification of larynx disorders from human voice
611	--	621	David M. Howard, Evelyn Abberton, Adrian Fourcin. Disordered voice measurement and auditory analysis
622	--	631	Tiago H. Falk, Wai-Yip Chan, Fraser Shein. Characterization of atypical vocal source excitation, temporal dynamics and prosody for objective measurement of dysarthric word intelligibility
632	--	640	Marieke de Bruijn, Louis ten Bosch, Dirk J. Kuik, Birgit I. Witte, Johannes A. Langendijk, C. René Leemans, Irma Verdonck-de Leeuw. Acoustic-phonetic and artificial neural network feature analysis to assess speech quality of stop consonants produced by patients treated for oral or oropharyngeal cancer
641	--	654	Sevasti-Zoi Karakozoglou, Nathalie Henrich, Christophe d'Alessandro, Yannis Stylianou. Automatic glottal segmentation using local-based active contours and application to glottovibrography
655	--	663	Ali Alpan, Jean Schoentgen, Youri Maryn, Francis Grenez, P. Murphy. Assessment of disordered voice via the first rahmonic
664	--	679	Alain Ghio, Gilles Pouchoulin, Bernard Teston, Serge Pinto, Corinne Fredouille, Céline De Looze, D. Robert, François Viallet, A. Giovanni. How to manage sound, physiological and clinical data of 2500 dysphonic and dysarthric speakers?

517	--	528	Anis Ben Aicha, Sofia Ben Jebara. Perceptual speech quality measures separating speech distortion and additive noise degradations
529	--	542	Meihong Wu, Huahui Li, Zhiling Hong, Xinchi Xian, Jingyu Li, Xihong Wu, Liang Li. Effects of aging on the ability to benefit from prior knowledge of message content in masked speech recognition
543	--	565	Md. Sahidullah, Goutam Saha. Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition
566	--	582	David Escudero Mancebo, Lourdes Aguilar, María Vanrell, Pilar Prieto. Analysis of inter-transcriber consistency in the Cat_ToBI prosodic labeling system

321	--	340	Bert Réveil, Jean-Pierre Martens, Henk van den Heuvel. Improving proper name recognition by means of automatically learned pronunciation variants
341	--	350	Pandurangarao N. Kulkarni, Prem C. Pandey, Dakshayani S. Jangamashetti. Multi-band frequency compression for improving speech perception by listeners with moderate sensorineural hearing loss
351	--	367	Antonio Moreno-Daniel, Jay G. Wilpon, Biing-Hwang Juang. Index-based incremental language model for scalable directory assistance
368	--	383	Daniel Recasens. A cross-language acoustic study of initial and final allophones of /l/
384	--	392	Takashi Nose, Takao Kobayashi. Very low bit-rate F0 coding for phonetic vocoders using MSD-HMM with quantized F0 symbols
393	--	401	Amaro A. de Lima, Thiago de M. Prego, Sergio L. Netto, Bowon Lee, Amir Said, Ronald W. Schafer, Ton Kalker, Majid Fozunbal. On the quality-assessment of reverberated speech
402	--	413	Peng Dai, Ing Yann Soon. A temporal frequency warped (TFW) 2D psychoacoustic filter for robust speech recognition system
414	--	429	Ioulia Grichkovtsova, Michel Morel, Anne Lacheret. The role of voice quality and prosodic contour in affective speech perception
430	--	444	Frank Rudzicz. Using articulatory likelihoods in the recognition of dysarthric speech
445	--	458	Je Hun Jeon, Yang Liu. Automatic prosodic event detection using a novel labeling and selection method in co-training
459	--	476	Jordi Adell, David Escudero Mancebo, Antonio Bonafonte. Production of filled pauses in concatenative speech synthesis based on the underlying fluent sentence
477	--	490	Jae Hun Choi, Joon-Hyuk Chang. On using acoustic environment classification for statistical model-based speech enhancement
491	--	502	Gakuto Kurata, Nobuyasu Itoh, Masafumi Nishimura, Abhinav Sethy, Bhuvana Ramabhadran. Leveraging word confusion networks for named entity modeling and detection from conversational telephone speech
503	--	515	Angel M. Gomez, Belinda Schwerin, Kuldip K. Paliwal. Improving objective intelligibility prediction by combining correlation and coherence based methods with a measure based on the negative distortion ratio

161	--	174	Nigel G. Ward, Alejandro Vega, Timo Baumann. Prosodic and temporal features for language modeling for dialog
175	--	188	J. Sebastian Andersson, Junichi Yamagishi, Robert A. J. Clark. Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis
189	--	198	Sophie Bouton, Pascale Colé, Willy Serniclaes. The influence of lexical knowledge on phoneme discrimination in deaf children with cochlear implants
199	--	211	Jon Gudnason, Mark R. P. Thomas, Daniel P. W. Ellis, Patrick A. Naylor. Data-driven voice source waveform analysis and synthesis
212	--	218	George Saon, Hagen Soltau. Boosting systems for large vocabulary continuous speech recognition
219	--	228	Gakuto Kurata, Abhinav Sethy, Bhuvana Ramabhadran, Ariya Rastrow, Nobuyasu Itoh, Masafumi Nishimura. Acoustically discriminative language model training with pseudo-hypothesis
229	--	244	Masakiyo Fujimoto, Shinji Watanabe, Tomohiro Nakatani. Frame-wise model re-estimation method based on Gaussian pruning with weight normalization for noise robust voice activity detection
245	--	255	Vataya Chunwijitra, Takashi Nose, Takao Kobayashi. A tone-modeling technique using a quantized F0 context to improve tone correctness in average-voice-based speech synthesis
256	--	271	Hamid Reza Tohidypour, Seyyed Ali Seyyedsalehi, Hossein Behbood, Hossein Roshandel. A new representation for speech frame recognition based on redundant wavelet filter banks
272	--	281	Fei Chen, Philipos C. Loizou. Impact of SNR and gain-function over- and under-estimation on speech intelligibility
282	--	305	Kuldip K. Paliwal, Belinda Schwerin, Kamil K. Wójcicki. Speech enhancement using a minimum mean-square error short-time spectral modulation magnitude estimator
306	--	320	Andrew Hines, Naomi Harte. Speech intelligibility prediction using a Neurogram Similarity Index Measure

1065	--	1103	Mohammad H. Moattar, Mohammad M. Homayounpour. A review on speaker diarization systems and approaches
1104	--	1120	Veena Karjigi, Preeti Rao. Classification of place of articulation in unvoiced stops with spectro-temporal surface modeling
1121	--	1131	Edward Ozimek, Dariusz Kutzner, Pawel Libiszewski. Speech intelligibility tested by the Pediatric Matrix Sentence test in 3-6 year old children
1132	--	1142	Doris Baum. Recognising speakers from the topics they talk about

1	--	10	Abhishek Jaywant, Marc D. Pell. Categorical processing of negative emotions from speech prosody
11	--	22	Elisabetta Fersini, Enza Messina, Francesco Archetti. Emotional states in judicial courtrooms: An experimental investigation
23	--	39	Mouloud Djamah, Douglas D. O'Shaughnessy. Fine granularity scalable speech coding using embedded tree-structured vector quantization
40	--	54	Abhijeet Sangwan, John H. L. Hansen. Automatic analysis of Mandarin accented English using phonological features
55	--	67	Deepu Vijayasenan, Fabio Valente, Hervé Bourlard. Multistream speaker diarization of meetings recordings beyond MFCC and TDOA features
68	--	91	Máire Ní Chiosáin, Pauline Welby, Robert Espesser. Is the syllabification of Irish a typological exception? An experimental study
92	--	107	Silke Paulmann, Debra Titone, Marc D. Pell. How emotional prosody guides your way: Evidence from eye movements
108	--	118	Peter Jancovic, Xin Zou, Münevver Köküer. Speech enhancement based on Sparse Code Shrinkage employing multiple speech models
119	--	133	Cong-Thanh Do, Dominique Pastor, André Goalic. A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech
134	--	146	Keigo Nakamura, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano. Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech
147	--	160	Ying-Yee Kong, Ala Mullangi. On the development of a frequency-lowering system that enhances place-of-articulation perception

runs on WebDSL