The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998

researchr

You are not signed in
Sign in
Sign up

The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998. ISCA, 1998.

Conference: interspeech1998

Abstract is missing.

Can we hear smile?Marc Schröder, Véronique Aubergé, Marie-Agnès Cathiard. [doi]

Effects of shapes of radiational aperture on radiation characteristicsHiroki Matsuzaki, Kunitoshi Motoki, Nobuhiro Miki. [doi]

Effects of using speech in timetable information systems for WWWPernilla Qvarfordt, Arne Jönsson. [doi]

Measuring the dynamic encoding of speaker identity and dialect in prosodic parametersMichael Barlow, Michael Wagner. [doi]

Suprasegmental cues for the segmentation of identical vowel sequences in JapaneseKazuhiko Kakehi, Yuki Hirose. [doi]

Noise robust two-stream auditory feature extraction method for speech recognitionJilei Tian, Ramalingam Hariharan, Kari Laurila. [doi]

Language development after extreme childhood deprivation: a case studyLisa-Jane Brown, John Locke, Peter Jones, Sandra P. Whiteside. [doi]

An algorithm for automatic generation of Mandarin phonetic balanced corpusJyh-Shing Shyuu, Jhing-Fa Wang. [doi]

Analysis and treatment of esophageal speech for the enhancement of its comprehensionJorge Miquélez, Rocio Sesma, Yolanda Blanco. [doi]

The relation between perceptual and production categories in acquisitionIan Watson. [doi]

Acoustic-articulatory evaluation of the upper vowel-formant region and its presumed speaker-specific potencyFrantz Clermont, Parham Mokhtari. [doi]

SNR-dependent flooring and noise overestimation for joint application of spectral subtraction and model combinationVolker Schless, Fritz Class. [doi]

Information extraction and text generation of news reports for a Swedish-English bilingual spoken dialogue systemBarbara Gawronska, David House. [doi]

BTH: an efficient parsing algorithm for word-spottingYasuyuki Kono, Takehide Yano, Munehiko Sasajima. [doi]

A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level featuresMartin Holzapfel, Nick Campbell. [doi]

The acquisition of Japanese compound accent ruleAyako Shirose, Haruo Kubozono, Shigeru Kiritani. [doi]

Robust automatic continuous-speech recognition based on a voiced-unvoiced decisionHesham Tolba, Douglas D. O Shaughnessy. [doi]

Two-pass utterance verification algorithm for long natural numbers recognitionJavier Caminero, Eduardo López, Luis A. Hernández Gómez. [doi]

Control of larynx height in vowel productionPhilip Hoole, Christian Kroos. [doi]

Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding systemAtsuhiko Kai, Yoshifumi Hirose, Seiichi Nakagawa. [doi]

Word sequence pair spotting for synchronization of speech and text in production of closed-caption TV programs for the hearing impairedIchiro Maruyama, Yoshiharu Abe, Takahiro Wakao, Eiji Sawamura, Terumasa Ehara, Katsuhiko Shirai. [doi]

An asymmetric stochastic language model based on multi-tagged wordsJulio Pastor, José Colás, Rubén San Segundo, José Manuel Pardo. [doi]

Syllable-onset acoustic properties associated with syllable-coda voicingNoël Nguyen, Sarah Hawkins. [doi]

MIMIC : a voice-adaptive phonetic-tree speech synthesiserAimin Chen, Saeed Vaseghi, Charles Ho. [doi]

Phoneme recognition with statistical modeling of the prediction error of neural networksFelix Freitag, Enric Monte. [doi]

Acoustic indicators of topic segmentationJulia Hirschberg, Christine H. Nakatani. [doi]

The IBM trainable speech synthesis systemRobert E. Donovan, Ellen Eide. [doi]

Signal extraction from noisy signal based on auditory scene analysisMasashi Unoki, Masato Akagi. [doi]

The design of a multi-domain Mandarin Chinese spoken dialogue systemYi-Chung Lin, Tung-Hui Chiang, Huei-Ming Wang, Chung-Ming Peng, Chao-Huang Chang. [doi]

Fuzzy-integration based normalization for speaker verificationTuan Pham, Michael Wagner. [doi]

Robust automatic speech recognition by the application of a temporal-correlation-based recurrent multilayer neural network to the mel-based cepstral coefficientsMichel Héon, Hesham Tolba, Douglas D. O Shaughnessy. [doi]

Vector quantizer acceleration for an automatic speech recognition applicationAntonio J. Araujo, Vitor C. Pera, Márcio N. de Souza. [doi]

Hierarchical neural networks (HNN) for Chinese continuous speech recognitionYing Jia, Limin Du, Ziqiang Hou. [doi]

Tonal complexity as a dialectal feature: 25 different citation tones from four zhejiang wu dialectsSean Zhu, Phil Rose. [doi]

Text-to-speech voice adaptation from sparse training dataAlexander Kain, Michael W. Macon. [doi]

Using the multi-stream approach for continuous audio-visual speech recognition: experiments on the M2VTS databaseStéphane Dupont, Juergen Luettin. [doi]

Factors affecting speech retrievalCorinna Ng, Ross Wilkinson, Justin Zobel. [doi]

Sub-band based speaker verification using dynamic recombination weightsPerasiriyan Sivakumaran, Aladdin M. Ariyaeeinia, Jill A. Hewitt. [doi]

A four layer sharing HMM system for very large vocabulary isolated word recognitionRuxin Chen, Miyuki Tanaka, Duanpei Wu, Lex Olorenshaw, Mariscela Amador. [doi]

Common patterns in word level prosodyFrode Holm, Kazue Hata. [doi]

Improving speech recognizer by broader acoustic-phonetic group classificationYoungjoo Suh, Kyuwoong Hwang, Oh-Wook Kwon, Jun Park. [doi]

HMM-based visual speech recognition using intensity and location normalizationOscar Vanegas, Akiji Tanaka, Keiichi Tokuda, Tadashi Kitamura. [doi]

Confidence measures for HMM-based speech recognitionDaniel Willett, Andreas Worm, Christoph Neukirchen, Gerhard Rigoll. [doi]

Incremental on-line speaker adaptation in adverse conditionsOlli Viikki, Kari Laurila. [doi]

Controlling a HIFI with a continuous speech understanding systemJavier Ferreiros, José Colás, Javier Macías Guarasa, Alejandro Ruiz, José Manuel Pardo. [doi]

Sharable software repository for Japanese large vocabulary continuous speech recognitionTatsuya Kawahara, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano. [doi]

Robust speaker verification insensitive to session-dependent utterance variation and handset-dependent distortionTomoko Matsui, Kiyoaki Aikawa. [doi]

A computational memory and processing model for prosodyJanet E. Cahn. [doi]

Robust HMM estimation with Gaussian merging-splitting and tied-transform HMMsAnanth Sankar. [doi]

Techniques for capturing temporal variations in speech signals with fixed-rate processingSatya Dharanipragada, Ramesh A. Gopinath, Bhaskar D. Rao. [doi]

Recovering vocal tract shapes from MFCC parametersSorin Dusan, Li Deng. [doi]

Incorporating linguistic knowledge into automatic dialect identification of SpanishLisa Yanguas, Gerald C. O Leary, Marc A. Zissman. [doi]

Improvements in speech understanding accuracy through the integration of hierarchical linguistic, prosodic, and phonological constraints in the jupiter domainGrace Chung, Stephanie Seneff. [doi]

Towards a minimal standard for dialogue transcripts: a new SGML architecture for the HCRC map task corpusAmy Isard, David McKelvie, Henry S. Thompson. [doi]

Designing a multimodal dialogue system for information retrievalSadaoki Furui, Koh ichiro Yamaguchi. [doi]

Using combined decisions and confidence measures for name recognition in automatic directory assistance systemsAndreas Kellner, Bernhard Rueber, Hauke Schramm. [doi]

Experiments on the meaning of two pitch accent types: the pointed hat versus the accent-lending fall in dutchJohanneke Caspers. [doi]

A VQ based speaker recognition system based in histogram distances. text independent and for noisy environmentsEnric Monte, Ramon Arqué, Xavier Miró. [doi]

Missing data reconstruction for robust automatic speech recognition in the framework of hybrid HMM/ANN systemsStéphane Dupont. [doi]

Korean prosodic break index labelling by a new mixed method of LDA and VQPyungsu Kang, Jiyoung Kang, Jinyoung Kim. [doi]

A phonetic and acoustic study of babbling in an Italian childClaudio Zmarich, Roberta Lanni. [doi]

A new linear predictive method for compression of speech signalsPaavo Alku, Susanna Varho. [doi]

Trajectory formation of articulatory movements for a given sequence of phonemesTakeshi Okadome, Tokihiko Kaburagi, Masaaki Honda. [doi]

On the amount and domain of focal lengthening in SwedishEva Strangert, Mattias Heldner. [doi]

An event driven model for dialogue systemsKuansan Wang. [doi]

Representing the environments for phonological processes in an accent-independent lexicon for synthesis of EnglishSusan Fitt, Stephen Isard. [doi]

An integrated dialogue system for the automation of call centre servicesKallirroi Georgila, Anastasios Tsopanoglou, Nikos Fakotakis, George Kokkinakis. [doi]

A comparison of Thai speech recognition systems using hidden Markov model, neural network, and fuzzy-neural networkVisarut Ahkuputra, Somchai Jitapunkul, Nutthacha Jittiwarangkul, Ekkarit Maneenoi, Sawit Kasuriya. [doi]

Capturing discriminative information using multiple modeling techniquesJi Ming, Philip Hanna, Darryl Stewart, Saeed Vaseghi, F. Jack Smith. [doi]

A statistical study of pitch target points in five languagesEstelle Campione, Jean Véronis. [doi]

Spoken L2 teaching with contrastive visual and auditory feedbackAnne-Marie Öster. [doi]

Using automatic speech recognition and its possible effects on the voiceChristel G. de Bruijn, Sandra P. Whiteside, P. A. Cudd, D. Syder, K. M. Rosen, L. Nord. [doi]

Reducing peak search effort using two-tier pruningMark Wright, Simon Hovell, Simon Ringland. [doi]

Discriminative weighting of multi-resolution sub-band cepstral features for speech recognitionPhilip McMahon, Paul M. McCourt, Saeed Vaseghi. [doi]

Natural language call routing: a robust, self-organizing approachBob Carpenter, Jennifer Chu-Carroll. [doi]

An effective quality evaluation protocol for speech enhancement algorithmsJohn H. L. Hansen, Bryan L. Pellom. [doi]

Dynamic features in children s vowelsSteve Cassidy, Catherine Watson. [doi]

Statistical integration of temporal filter banks for robust speech recognition using linear discriminant analysis (LDA)Jia-lin Shen, Wen-Liang Hwang. [doi]

Phonetic and phonological markers of contrastive focus in KoreanSun-Ah Jun, Hyuck-Joon Lee. [doi]

Improving speaker identification performance in reverberant conditions using lip informationTim Wark, Sridha Sridharan. [doi]

Suprasegmental duration modelling with elastic constraints in automatic speech recognitionLaurence Molloy, Stephen Isard. [doi]

Prosodic analysis of fillers and self-repair in Japanese speechFelix C. M. Quimbo, Tatsuya Kawahara, Shuji Doshita. [doi]

Modeling pronunciation variation for a dutch CSR: testing three methodsMirjam Wester, Judith M. Kessens, Helmer Strik. [doi]

Wavelet transform domain blind equalization and its application to speech analysisMunehiro Namba, Yoshihisa Ishida. [doi]

Combining connectionist multi-band and full-band probability streams for speech recognition of natural numbersNikki Mirghafori, Nelson Morgan. [doi]

Recognizing emotions in speech using short-term and long-term featuresYang Li, Yunxin Zhao. [doi]

Extraction of the dialog act and the topic from utterances in a spoken dialog systemYasuhisa Niimi, Noboru Takinaga, Takuya Nishimoto. [doi]

Heterogeneous measurements and multiple classifiers for speech recognitionAndrew K. Halberstadt, James R. Glass. [doi]

Duration compensation in non-adjacent consonant and temporal regularityHee-Sun Kim. [doi]

ITU-t g.729 extension at 6.4 kbpsE. Ekudden, R. Hagen, B. Johansson, Shinji Hayashi, Akitoshi Kataoka, Sachiko Kurihara. [doi]

Efficient lexical retrieval for English text-to-speech synthesisDaniel Faulkner, Charles Bryant. [doi]

Energy contour generation for a sentence using a neural network learning methodJungchul Lee, Donggyu Kang, Sanghoon Kim, Koengmo Sung. [doi]

The efficiency of multimodal interaction: a case studyPhilip R. Cohen, Michael Johnston, David McGee, Sharon L. Oviatt, Josh Clow, Ira A. Smith. [doi]

A discourse coding scheme for conversational SpanishLori S. Levin, Ann E. Thymé-Gobbel, Alon Lavie, Klaus Ries, Klaus Zechner. [doi]

SABLE: a standard for TTS markupRichard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Alan W. Black, Kevin A. Lenzo, Mike Eddington. [doi]

Nonlinear interpolation of topic models for language model adaptationKristie Seymore, Stanley F. Chen, Ronald Rosenfeld. [doi]

Feature decorrelation methods in speech recognition. a comparative studyEloi Batlle, Climent Nadeu, José A. R. Fonollosa. [doi]

Coarticulation and degrees of freedom in the elaboration of a new articulatory plant: GENTIANEAnne Vilain, Christian Abry, Pierre Badin. [doi]

Duration modeling using cumulative duration probability and speaking rate compensationTae Young Yang, Ji-Sung Kim, Chungyong Lee, Dae Hee Youn, Il Whan Cha. [doi]

An adaptive beamforming microphone array system using a blind deconvolutionJin-Nam Park, Tsuyoshi Usagawa, Masanao Ebata. [doi]

Low bit rate coding for speech and audio using mel linear predictive coding (MLPC) analysisYoshihisa Nakatoh, Takeshi Norimatsu, Ah Heng Low, Hiroshi Matsumoto. [doi]

A new method to achieve fast acoustic matching for speech recognitionClark Z. Lee, Douglas D. O Shaughnessy. [doi]

A mixed-excitation frequency domain model for time-scale pitch-scale modification of speechAlex Acero. [doi]

Global optimisation of neural network models via sequential sampling-importance resamplingJoão F. G. de Freitas, Sue E. Johnson, Mahesan Niranjan, Andrew H. Gee. [doi]

On the use of F0 features in automatic segmentation for speech synthesisTakashi Saito. [doi]

Topic recognition for news speech based on keyword spottingYoichi Yamashita, Toshikatsu Tsunekawa, Riichiro Mizoguchi. [doi]

Vocabulary-independent word confidence measure using subword featuresLi Jiang, Xuedong Huang. [doi]

Human vs. machine speaker identification with telephone speechAstrid Schmidt-Nielsen, Thomas H. Crystal. [doi]

Voice conversion based on parameter transformationJuana M. Gutiérrez-Arriola, Yung-Sheng Hsiao, Juan Manuel Montero, José Manuel Pardo, Donald G. Childers. [doi]

Pragmatic characteristics of infant directed speechSudaporn Luksaneeyanawin, Chayada Thanavisuth, Suthasinee Sittigasorn, Onwadee Rukkarangsarit. [doi]

Correspondence between the glottal gesture overlap pattern and vowel devoicing in JapaneseMasako Fujimoto, Emi Murano, Seiji Niimi, Shigeru Kiritani. [doi]

Automatic pronunciation error detection and guidance for foreign language learningChul-Ho Jo, Tatsuya Kawahara, Shuji Doshita, Masatake Dantsuji. [doi]

Natural number recognition using discriminatively trained inter-word context dependent hidden Markov modelsMalan B. Gandhi. [doi]

A new synthetic speech/sound control languageOsamu Mizuno, Shin ya Nakajima. [doi]

Acoustic qualities of IDS and ADS in ThaiChayada Thanavisuth, Sudaporn Luksaneeyanawin. [doi]

The interactive systems labs view4you video indexing systemThomas Kemp, Petra Geutner, Michael Schmidt, Borislav Tomaz, Manfred Weber, Martin Westphal, Alex Waibel. [doi]

Dynamical spectrogram, an aid for the deafAli-Asghar Soltani-Farani, Edward H. S. Chilton, Robin Shirley. [doi]

Is speech the right thing for your application?Niels Ole Bernsen, Laila Dybkjær. [doi]

Prosodic vs. segmental contributions to naturalness in a diphone synthesizerH. Timothy Bunnell, Steve R. Hoskins, Debra Yarrington. [doi]

A thesaurus-based statistical language model for broadcast news transcriptionAkio Ando, Akio Kobayashi, Toru Imai. [doi]

IVie - a comparative transcription system for intonational variation in EnglishEsther Grabe, Francis Nolan, Kimberley J. Farrar. [doi]

New features for confidence annotationDhananjay Bansal, Mosur K. Ravishankar. [doi]

Robust speech recognition using HMM s with toeplitz state covariance matricesWilliam J. J. Roberts, Yariv Ephraim. [doi]

Bilingual and dialectal adaptation and retrainingUlla Uebler, Michael Schüßler, Heinrich Niemann. [doi]

Duration modeling for HMM-based speech synthesisTakayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura. [doi]

Two automatic approaches for analyzing connected speech processes in dutchMirjam Wester, Judith M. Kessens, Helmer Strik. [doi]

Emotional speech synthesis: from speech database to TTSJuan Manuel Montero, Juana M. Gutiérrez-Arriola, Sira E. Palazuelos, Emilia Enríquez, Santiago Aguilera, José Manuel Pardo. [doi]

Modeling dynamic prosodic variation for speaker verificationM. Kemal Sönmez, Elizabeth Shriberg, Larry P. Heck, Mitchel Weintraub. [doi]

Enhancing a WIMP based interface with speech, gaze tracking and agentsLau Bakman, Mads Blidegn, Martin Wittrup, Lars Bo Larsen, Thomas B. Moeslund. [doi]

Phonological similarity effects in Cantonese spoken-word processingMichael C. W. Yip, Po-Yee Leung, Hsuan-Chih Chen. [doi]

Modelling tongue configuration in German vowel productionPhilip Hoole. [doi]

Waveform interpolation coding with pitch-spaced subbandsW. Bastiaan Kleijn, Huimin Yang, Ed F. Deprettere. [doi]

Automatic recognition of spontaneous speech dialoguesMauro Cettolo, Daniele Falavigna. [doi]

Some developmental patterns in the speech of 6-, 8- and 10-year old children: an acoustic phonetic studySandra P. Whiteside, Carolyn Hodgson. [doi]

A novel method of formant analysis and glottal inverse filteringSteve Pearson. [doi]

Real time voice alteration based on linear predictionPing-Fai Yang, Yannis Stylianou. [doi]

From novice to expert: the effect of tutorials on user expertise with spoken dialogue systemsCandace A. Kamm, Diane J. Litman, Marilyn A. Walker. [doi]

Training of context-dependent subspace distribution clustering hidden Markov modelBrian Mak, Enrico Bocchieri. [doi]

Building a statistical model of the vowel space for phoneticiansMatthew P. Aylett. [doi]

Estimation of the probability distributions of stochastic context-free grammars from the k-best derivationsJoan-Andreu Sánchez, José-Miguel Benedí. [doi]

Example-based error recovery method for speech translation: repairing sub-trees according to the semantic distanceKai Ishikawa, Eiichiro Sumita, Hitoshi Iida. [doi]

Analysis of effects of lexical accent, syntax, and global speech rate upon the local speech rateSumio Ohno, Hiroya Fujisaki, Hideyuki Taguchi. [doi]

Same talker, different languageVerna Stockmal, Danny R. Moates, Zinny S. Bond. [doi]

Phonetic-level mispronunciation detection in non-native Swedish speechPhilippe Langlais, Anne-Marie Öster, Björn Granström. [doi]

Linear discriminant - a new criterion for speaker normalizationMartin Westphal, Tanja Schultz, Alex Waibel. [doi]

How effective is unsupervised data collection for children s speech recognition?Gregory Aist, Peggy Chan, Xuedong Huang, Li Jiang, Rebecca Kennedy, DeWitt Latimer IV, Jack Mostow, Calvin Yeung. [doi]

Exploiting transitions and focussing on linguistic properties for ASRJacques C. Koreman, William J. Barry, Bistra Andreeva. [doi]

Periodicity emphasis of voice wave using nonlinear IIR digital filters and its applicationsHiroyuki Kamata, Akira Kaneko, Yoshihisa Ishida. [doi]

Additional use of phoneme duration hypotheses in automatic speech segmentationKarlheinz Stöber, Wolfgang Hess. [doi]

Comparison of cross-language coarticulation: English, Japanese and Japanese-accented EnglishKimiko Tsukada. [doi]

Using untranscribed training data to improve performanceGeorge Zavaliagkos, Man-Hung Siu, Thomas Colthurst, Jayadev Billa. [doi]

Product-code vector quantization of cepstral parameters for speech recognition over the WWWVassilios Digalakis, Leonardo Neumeyer, Manolis Perakakis. [doi]

Crosslinguistic disfluency modelling: a comparative analysis of Swedish and american English human-human and human-machine dialoguesRobert Eklund, Elizabeth Shriberg. [doi]

Powerful syllabic fillers for general-task keyword-spotting and unlimited-vocabulary continuous-speech recognitionRachida El Méliani, Douglas D. O Shaughnessy. [doi]

Automatic detection of semantic boundaries based on acoustic and lexical knowledgeMauro Cettolo, Daniele Falavigna. [doi]

Tones of a tridialectal: acoustic and perceptual data on ten linguistic tonetic contrasts between lao, nyo and standard ThaiPhil Rose. [doi]

Articulatory, acoustic and perceptual aspects of fricative-stop coarticulationNoël Nguyen, Alan Wrench, Fiona Gibbon, William J. Hardcastle. [doi]

An annotation system for melodic aspects of German spontaneous speechChristel Brindöpke, Brigitte Schaffranietz. [doi]

Towards a formal framework for linguistic annotationsSteven Bird, Mark Liberman. [doi]

Support vector machines for speech recognitionAravind Ganapathiraju, Jonathan Hamaker, Joseph Picone. [doi]

On variable sampling frequencies in speech recognitionFu-Hua Liu, Michael Picheny. [doi]

Intonative structure as a determinant of word order variation in dutch verbal endgroupsMarc Swerts. [doi]

Situated dialogue coordination for spoken dialogue systemsMichio Okada, Noriko Suzuki, Jacques M. B. Terken. [doi]

Perception of concurrent approximant-vowel syllablesWilliam A. Ainsworth. [doi]

On the relationship of speech rates with prosodic units in dialogue speechKeikichi Hirose, Hiromichi Kawanami. [doi]

Acoustic nature and perceptual testing of corpora of emotional speechAkemi Iida, Nick Campbell, Soichiro Iga, Fumito Higuchi, Michiaki Yasumura. [doi]

Speech recognition in noisy environment using weighted projection-based likelihood measureWon-Ho Shin, Weon-Goo Kim, Chungyong Lee, Il Whan Cha. [doi]

A linguistic analysis of repair signals in co-operative spoken dialoguesShu-Chuan Tseng. [doi]

Indexing and classification of TV news articles based on speech dictation using word bigramJun Ogata, Yasuo Ariki. [doi]

ko tok ples ensin bilong tok pisin or the TP-CLE: a first report from a pilot speech-to-speech translation project from Swedish to tok pisinRobert Eklund. [doi]

A HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speechChristel Brindöpke, Gernot A. Fink, Franz Kummert, Gerhard Sagerer. [doi]

Rejection in speech recognition systems with limited trainingAruna Bayya. [doi]

Text analysis for the bell labs French text-to-speech systemEvelyne Tzoukermann. [doi]

The maximum-based description of F0 contours and its application to EnglishThomas Portele, Barbara Heuft. [doi]

A novel technique for the combination of utterance and speaker verification systems in a text-dependent speaker verification taskLeandro Rodríguez Liñares, Carmen García-Mateo. [doi]

A method for modeling liaison in a speech recognition system for FrenchLalit R. Bahl, S. V. De Gennaro, Pieter de Souza, E. Epstein, J. M. Le Roux, B. Lewis, Claire Waast. [doi]

Spectral smoothing for concatenative speech synthesisDavid T. Chappell, John H. L. Hansen. [doi]

Modeling vowel duration for Japanese text-to-speech synthesisJennifer J. Venditti, Jan P. H. van Santen. [doi]

On the use of automatically generated discourse-level information in a concept-to-speech synthesis systemJanet Hitzeman, Alan W. Black, Paul Taylor, Chris Mellish, Jon Oberlander. [doi]

High quality text-to-speech system in Spanish for handicapped peopleFernando Lacunza, Yolanda Blanco. [doi]

Categorical perception of vowelsEllen Gerrits, Bert Schouten. [doi]

Analyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditionsPaavo Alku, Juha Vintturi, Erkki Vilkman. [doi]

An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suiteHideki Kawahara, Alain de Cheveigné, Roy D. Patterson. [doi]

Regional variation in the vowels of female adolescents from sydneyFelicity Cox, Sallyanne Palethorpe. [doi]

Telephone speech multi-keyword spotting using fuzzy search algorithm and prosodic verificationChung-Hsien Wu, Yeou-Jiunn Chen, Yu-Chun Hung. [doi]

Cluster adaptive training for speech recognitionMark J. F. Gales. [doi]

Favourable and unfavourable short duration segments of speech in noiseDaniel Woo. [doi]

Spectral sequence compensation based on continuity of spectral sequenceMasato Akagi, Mamoru Iwaki, Noriyoshi Sakaguchi. [doi]

The effect of orthographic knowledge on the segmentation of speechBruce L. Derwing, Terrance M. Nearey, Yeo Bom Yoon. [doi]

An implementation and evaluation of an on-line speaker verification system for field trialsYong Gu, Trevor Thomas. [doi]

Performance and optimization of the SEEVOC algorithmWeihua Zhang, W. Harvey Holmes. [doi]

Speaker verification on the polycost database using frequency filtered spectral energiesJavier Hernando, Climent Nadeu. [doi]

On the limitations of stochastic conceptual finite-state language models for speech understandingJosé Colás, Javier Ferreiros, Juan Manuel Montero, Julio Pastor, Ascensión Gallardo-Antolín, José Manuel Pardo. [doi]

Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS systemShigenobu Seto, Masahiro Morita, Takehiko Kagoshima, Masami Akamine. [doi]

Variance and invariance in speech rate as a reflection of conceptual planningJanice Fon. [doi]

Modular neural networks for low-complex phoneme recognitionAxel Glaeser. [doi]

Non-native productions of Japanese single stops that are too long for one mora unitYasuyo Minagawa-Kawai, Shigeru Kiritani. [doi]

Automatic classification of dialogue contexts for dialogue predictionsCosmin Popovici, Paolo Baggia, Pietro Laface, Loreta Moisa. [doi]

Multi-lingual concatenative speech synthesisNick Campbell. [doi]

The use of linguistic hierarchies in speech understandingStephanie Seneff. [doi]

Analysis and interpretation of fundamental frequency contours of british English in terms of a command-response modelHiroya Fujisaki, Sumio Ohno, Takashi Yagi, Takeshi Ono. [doi]

A fast method of producing talking head mouth shapes from real speechAndrew P. Breen, O. Gloaguen, P. Stern. [doi]

Synthetic faces as a lipreading supportEva Agelfors, Jonas Beskow, Martin Dahlquist, Björn Granström, Magnus Lundeberg, Karl-Erik Spens, Tobias Öhman. [doi]

Predictive speaker adaptation and its prior trainingDieu Tran, Ken-ichi Iso. [doi]

Perception of words with vowel reductionJohan Frid. [doi]

Unsupervised training of a speech recognizer using TV broadcastsThomas Kemp, Alex Waibel. [doi]

Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processingNobuaki Minematsu, Seiichi Nakagawa. [doi]

A comparison of fusion techniques in mel-cepstral based speaker identificationStefan Slomka, Sridha Sridharan, Vinod Chandran. [doi]

Dialect maps and dialect research; useful tools for automatic speech recognition?Arne Kjell Foldvik, Knut Kvale. [doi]

Cooperation and competition of burst and formant transitions for the perception and identification of French stopsAdrian Neagu, Gérard Bailly. [doi]

Fast and slow speech rate: a characterisation for FrenchBrigitte Zellner. [doi]

A comparative evaluation of variance flooring techniques in HMM-based speaker verificationHåkan Melin, Johan Koolwaaij, Johan Lindberg, Frédéric Bimbot. [doi]

Towards better integration of semantic predictors in statistical language modelingNoah Coccaro, Daniel Jurafsky. [doi]

Don t blame it (all) on the pause: further ERP evidence for a prosody-induced garden-path in running speechKarsten Steinhauer, Kai Alter, Angela D. Friederici. [doi]

How disagreement expressions are used in cooperative tasksHiroyuki Yano, Akira Ito. [doi]

Speech recognition performance on a new voicemail transcription taskMukund Padmanabhan, Bhuvana Ramabhadran, Sankar Basu. [doi]

A hierarchical language model for CSRFrancisco J. Valverde-Albacete, José Manuel Pardo. [doi]

Comparison of spectral estimation techniques for low bit-rate speech codingDerek J. Molyneux, C. I. Parris, X. Q. Sun, Barry M. G. Cheetham. [doi]

Heads and tails in word perception: evidence for early-to-late processing in listening and readingSieb G. Nooteboom, Meinou van Dijk. [doi]

Automated captioning of television programs: development and analysis of a soundtrack corpusIngrid Ahmer, Robin W. King. [doi]

Speech communication profiles across the adult lifespan: persons without self-identified hearing impairmentM. F. Cheesman, K. L. Smilsky, T. M. Major, F. Lewis, L. M. Boorman. [doi]

A minimax search algorithm for CDHMM based robust continuous speech recognitionHui Jiang, Keikichi Hirose, Qiang Huo. [doi]

The importance of the first syllable in English spoken word recognition by adult Japanese speakersKazuo Nakayama, Kaoru Tomita-Nakayama. [doi]

Coherence-based subband decomposition for robust speech and speaker recognition in noisy and reverberant roomsJoaquin Gonzalez-Rodriguez, Santiago Cruz-Llanas, Javier Ortega-Garcia. [doi]

Influence of the speaking style and the noise spectral tilt on the lombard reflex and automatic speech recognitionJean-Claude Junqua, Steven Fincke, Ken Field. [doi]

On different functions of repetitive utterancesMarc Swerts, Hanae Koiso, Atsushi Shimojima, Yasuhiro Katagiri. [doi]

Double tree beam search using hierarchical subword unitsJuan Carlos Torrecilla, Ismael Cortázar, Luis A. Hernández Gómez. [doi]

Integrated recognition of words and phrase boundariesFlorian Gallwitz, Anton Batliner, Jan Buckow, Richard Huber, Heinrich Niemann, Elmar Nöth. [doi]

Robust features for speech recognition systemsAruna Bayya, B. Yegnanarayana. [doi]

Interfacing acoustic models with natural language processing systemsMichael T. Johnson, Mary P. Harper, Leah H. Jamieson. [doi]

Spectral noise subtraction with recursive gain curvesKlaus Linhard, Tim Haulick. [doi]

Linear and nonlinear speech feature analysis for stress classificationGuojun Zhou, John H. L. Hansen, James F. Kaiser. [doi]

Gaussian density tree structure in a multi-Gaussian HMM-based speech recognition systemJacques Simonin, Lionel Delphin-Poulat, Géraldine Damnati. [doi]

Acoustic analysis of /l/ in glossectomeesJulie Lunn, Alan Wrench, Janet MacKenzie Beck. [doi]

Voicing affects perceived manner of articulationSantiago Fernández, Sergio Feijóo, Plinio Almeida. [doi]

Vowel separation using the reassigned amplitude-modulation spectrumDekun Yang, Georg F. Meyer, William A. Ainsworth. [doi]

Pausing in Swedish spontaneous speechPetra Hansson. [doi]

Fly with the EAGLES: evaluation of the ACCeSS spoken language dialogue systemGerhard Hanrieder, Paul Heisterkamp, Thomas Brey. [doi]

End-user driven dialogue system design: the reward experienceKlaus Failenschmid, J. H. Simon Thornton. [doi]

Word-based acoustic confidence measures for large-vocabulary speech recognitionAsela Gunawardana, Hsiao-Wuen Hon, Li Jiang. [doi]

Phoneme-based recognition for the norwegian speechdat(II) databaseFinn Tore Johansen. [doi]

Representing prosodic words using statistical models of moraic transition of fundamental frequency contours of JapaneseKoji Iwano, Keikichi Hirose. [doi]

Micropower electro-magnetic sensors for speech characterization, recognition, verification, and other applicationsJohn F. Holzrichter, Gregory C. Burnett, Todd J. Gable, Lawrence C. Ng. [doi]

Design of cochlear implant device for transmitting voice pitch information in speech sound of asian languagesShizuo Hiki, Kazuya Imaizumi, Yumiko Fukuda. [doi]

Calibration of machine scores for pronunciation gradingHoracio Franco, Leonardo Neumeyer. [doi]

Phonetic alignment: speech synthesis based vs. hybrid HMM/ANNFabrice Malfrère, Olivier Deroo, Thierry Dutoit. [doi]

Steps toward the integration of speaker recognition in real-world telecom applicationsAxel Glaeser, Frédéric Bimbot. [doi]

Prosody and voice quality in the expression of emotionsElisabeth Zetterholm. [doi]

Performance improvements through combining phone- and syllable-scale information in automatic speech recognitionSu-Lin Wu, Brian Kingsbury, Nelson Morgan, Steven Greenberg. [doi]

A schema based approach to dialog controlPaul C. Constantinides, Scott Hansma, Chris Tchou, Alexander I. Rudnicky. [doi]

On-line hierarchical transformation of hidden Markov models for speaker adaptationJen-Tzung Chien. [doi]

Robust speech activity detection in the presence of noiseRuhi Sarikaya, John H. L. Hansen. [doi]

Reducing the OOV rate in broadcast news speech recognitionThomas Kemp, Alex Waibel. [doi]

HMM-based smoothing for concatenative speech synthesisMike Plumpe, Alex Acero, Hsiao-Wuen Hon, Xuedong Huang. [doi]

Speaker detection in broadcast speech databasesAaron E. Rosenberg, Ivan Magrin-Chagnolleau, S. Parthasarathy, Qian Huang. [doi]

Speech-to-lip movement synthesis based on the EM algorithm using audio-visual HMMsEli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. [doi]

A multimodal-input multimedia-output guidance system: MMGSToshiyuki Takezawa, Tsuyoshi Morimoto. [doi]

A proposed decision rule for speaker recognition based on fuzzy c-means clusteringDat Tran, Michael Wagner, Tu Van Le. [doi]

Usability evaluation of IVR systems with DTMF and ASRCristina Delogu, Andrea Di Carlo, Paolo Rotundi, Danilo Sartori. [doi]

Speech pre-processing against intentional imposture in speaker recognitionDominique Genoud, Gérard Chollet. [doi]

Natural-sounding speech synthesis using variable-length unitsJon R. W. Yi, James R. Glass. [doi]

Audio-visual segmentation for content-based retrievalDavid Pye, Nicholas J. Hollinghurst, Timothy J. Mills, Kenneth R. Wood. [doi]

Grammatical and statistical word prediction system for Spanish integrated in an aid for people with disabilitiesSira E. Palazuelos, Santiago Aguilera, José Rodrigo, Juan Ignacio Godino-Llorente. [doi]

An improved decomposition method for WI using IIR wavelet filter banksNicola R. Chong, Ian S. Burnett, Joe F. Chicharo. [doi]

Improving speaker recognisability in phonetic vocodersCarlos M. Ribeiro, Isabel Trancoso. [doi]

Evidence of dual-route phonetic encoding from apraxia of speech: implications for phonetic encoding modelsRosemary A. Varley, Sandra P. Whiteside. [doi]

AN RNN-based compensation method for Mandarin telephone speech recognitionSen-Chia Chang, Shih-Chieh Chien, Chih-Chung Kuo. [doi]

Context-dependent duration modelling for continuous speech recognitionTan Lee, Rolf Carlson, Björn Granström. [doi]

Fully automatic prosody generator for text-to-speechFabrice Malfrère, Thierry Dutoit, Piet Mertens. [doi]

Text-independent speaker recognition using multiple information sourcesKonstantin P. Markov, Seiichi Nakagawa. [doi]

A linguistic and prosodic database for data-driven Japanese TTS synthesisAtsuhiro Sakurai, Takashi Natsume, Keikichi Hirose. [doi]

Prosodic structure in Japanese spontaneous speechYasuo Horiuchi, Akira Ichikawa. [doi]

SALSA version 1.0: a speech-based web browser for hong kong EnglishPascale Fung, Chi Shun Cheung, Kwok Leung Lam, Wai Kat Liu, Yuen Yee Lo. [doi]

German regional variants - a problem for automatic speech recognition?Nicole Beringer, Florian Schiel, Peter Regel-Brietzmann. [doi]

Prosody-based detection of the context of backchannel responsesHiroaki Noguchi, Yasuharu Den. [doi]

Adults with a severe-to-profound hearing impairment. investigating the effects of linguistic context on speech perceptionMark C. Flynn, Richard C. Dowell, Graeme M. Clark. [doi]

An evaluation of keyword spotting performance utilizing false alarm rejection based on prosodic informationMasaki Ida, Ryuji Yamasaki. [doi]

Postvocalic /r/-deletion in standard dutch: how experimental phonology can profit from ASR technologyCatia Cucchiarini, Henk van den Heuvel. [doi]

A sinusoidal harmonic vocoder at 1.2 kbps using auditory perceptual characteristicsMinoru Kohata. [doi]

A perceptual evaluation of distance measures for concatenative speech synthesisJohan Wouters, Michael W. Macon. [doi]

The applicability of adaptive language modelling for the broadcast news taskPhilip Clarkson, Tony Robinson. [doi]

A unified framework for sublexical and linguistic modelling supporting flexible vocabulary speech understandingRaymond Lau, Stephanie Seneff. [doi]

The intellimedia workbench - a generic environment for multimodal systemsTom Brøndsted, Lars Bo Larsen, Michael Manthey, Paul McKevitt, Thomas B. Moeslund, Kristian G. Olesen. [doi]

Towards a unified model for low bit-rate speech coding using a recognition-synthesis approachWendy J. Holmes. [doi]

On optimum normalization method used for speaker verificationWeijie Liu, Toshihiro Isobe, Naoki Mukawa. [doi]

Assessment of dutch pronunciation by means of automatic speech recognition technologyCatia Cucchiarini, Febe de Wet, Helmer Strik, Lou Boves. [doi]

Speech recognition from GSM codec parametersJuan M. Huerta, Richard M. Stern. [doi]

Using linguistic knowledge to improve the design of low-bit rate LSF quantisationJohn J. Parry, Ian S. Burnett, Joe F. Chicharo. [doi]

Semi-automated incremental prototyping of spoken dialog systemsStefan Kaspar, Achim G. Hoffmann. [doi]

SHEEP, GOATS, LAMBS and WOLVES: a statistical analysis of speaker performance in the NIST 1998 speaker recognition evaluationGeorge R. Doddington, Walter Liggett, Alvin F. Martin, Mark A. Przybocki, Douglas A. Reynolds. [doi]

A study on the recognition of low bit-rate encoded speechAn-Tzyh Yu, Hsiao-Chuan Wang. [doi]

Speaker independent speech recognition method using constrained time alignment near phoneme discriminative frameTomohiro Konuma, Tetsu Suzuki, Maki Yamada, Yoshio Ono, Masakatsu Hoshimi, Katsuyuki Niyada. [doi]

Partitioning and transcription of broadcast news dataJean-Luc Gauvain, Lori Lamel, Gilles Adda. [doi]

How to handle foreign sounds in Swedish text-to-speech conversion: approaching the xenophone problemRobert Eklund, Anders Lindström. [doi]

Multi-level rhythm control for speech synthesis using hybrid data driven and rule-based approachesOliver Jokisch, Diane Hirschfeld, Matthias Eichner, Rüdiger Hoffmann. [doi]

Online adaptation of language models in spoken dialogue systemsBernd Souvignier, Andreas Kellner. [doi]

Universal speech tools: the CSLU toolkitStephen Sutton, Ronald A. Cole, Jacques de Villiers, Johan Schalkwyk, Pieter J. E. Vermeulen, Michael W. Macon, YongHong Yan, Edward C. Kaiser, Brian Rundle, Khaldoun Shobaki, John-Paul Hosom, Alexander Kain, Johan Wouters, Dominic W. Massaro, Michael M. Cohen. [doi]

Robust feature extraction for alphabet recognitionMontri Karnjanadecha, Stephen A. Zahorian. [doi]

Using an animated talking character in a web-based city guide demonstratorGeorg Fries, Stefan Feldes, Alfred Corbet. [doi]

Automatic identification of command boundaries in a conversational natural language user interfaceGanesh N. Ramaswamy, Jan Kleindienst. [doi]

Temporal organization of speech for normal and fast ratesGeetha Krishnan, Wayne Ward. [doi]

On the structure of vowel space: a genealogy of general phonetic conceptsHendrik F. V. Boshoff, Elizabeth C. Botha. [doi]

Fast decoding for statistical machine translationYe-Yi Wang, Alex Waibel. [doi]

Unsupervised training of phone duration and energy models for text-to-speech synthesisPaul C. Bagshaw. [doi]

The perception of the morae with devocalized vowels in Japanese languageKimiko Yamakawa, Ryoji Baba. [doi]

Making the most of multiplicity: a multi-parser multi-strategy architecture for the robust processing of spoken languageTobias Ruland, C. J. Rupp, Jörg Spilker, Hans Weber, Karsten L. Worm. [doi]

Local speech rate as a combination of syllable and phone rateHartmut R. Pfitzinger. [doi]

On the learnability of the voicing contrast for initial stopsRobert I. Damper, Steve R. Gunn. [doi]

A practical perceptual frequency autoregressive HMM enhancement systemBeth Logan, Tony Robinson. [doi]

The acquisition of putonghua phonologyLydia K. H. So, Zhou Jing. [doi]

Efficiency as an organizing principle of natural speechR. J. J. H. van Son, Florien J. Koopmans-van Beinum, Louis C. W. Pols. [doi]

The role of stress for lexical selection in dutchJean Vroomen, Béatrice de Gelder. [doi]

Linguistically engineered tools for speech recognition error analysisCarol Van Ess-Dykema, Klaus Ries. [doi]

Development of CAI system employing synthesized speech responsesTsubasa Shinozaki, Masanobu Abe. [doi]

Phonological rules for enhancing acoustic enrollment of unknown wordsBhuvana Ramabhadran, Abraham Ittycheriah. [doi]

Improved surname pronunciations using decision treesJulie Ngan, Aravind Ganapathiraju, Joseph Picone. [doi]

Time dependent language model for broadcast news transcription and its post-correctionAkio Kobayashi, Kazuo Onoe, Toru Imai, Akio Ando. [doi]

Frequency domain binaural model as the front end of speech recognition systemTsuyoshi Usagawa, Kenji Sakai, Masanao Ebata. [doi]

Effects of contrastive focal accent on linguopalatal articulation and coarticulation in the French [kskl] clusterYohann Meynadier, Michel Pitermann, Alain Marchal. [doi]

Prosodic parameters in emotional speechKazuhito Koike, Hirotaka Suzuki, Hiroaki Saito. [doi]

Automatic prosodic labeling of 6 languagesHalewijn Vereecken, Jean-Pierre Martens, Cynthia Grover, Justin Fackrell, Bert Van Coile. [doi]

Candidate selection based on significance testing and its use in normalisation and scoringJi-Hwan Kim, Gil-Jin Jang, Seong-Jin Yun, Yung-Hwan Oh. [doi]

Speech analysis by subspace methods of spectral line estimationNajam Malik, W. Harvey Holmes. [doi]

Growth transform of a sum of rational functions and its application in estimating HMM parametersXiaoqiang Luo. [doi]

Multilateral techniques for speaker recognitionEluned S. Parris, Michael J. Carey 0002. [doi]

SQEL: a multilingual and multifunctional dialogue systemMaria Aretoulaki, Stefan Harbeck, Florian Gallwitz, Elmar Nöth, Heinrich Niemann, Jozef Ivanecký, Ivo Ipsic, Nikola Pavesic, Václav Matousek. [doi]

Improved utterance rejection using length dependent thresholdsSunil K. Gupta, Frank K. Soong. [doi]

A nonstationary autoregressive HMM with gain adaptation for speech recognitionKi Yong Lee, Joohun Lee. [doi]

Perception of tonal rises and falls for accentuation and phrasing in SwedishDavid House, Dik J. Hermes, Frédéric Beaugendre. [doi]

Speaker identification using relaxation labelingTuan D. Pham, Michael Wagner. [doi]

HMM topology selection for accurate acoustic and duration modelingCristina Chesta, Pietro Laface, Franco Ravera. [doi]

Assimilation of place in Japanese and dutchAnne Cutler, Takashi Otake. [doi]

Stochastic calculus, non-linear filtering, and the internal model principle: implications for articulatory speech recognitionGordon Ramsay. [doi]

Speaker normalization with all-pass transformsJohn W. McDonough, William Byrne, Xiaoqiang Luo. [doi]

On the effects of speech rate upon parameters of the command-response model for the fundamental frequency contours of speechSumio Ohno, Hiroya Fujisaki, Yoshikazu Hara. [doi]

Plug and play software for designing high-level speech processing systemsThierry Dutoit, Juergen Schroeter. [doi]

Speaker recruitment methods and speaker coverage - experiences from a large multilingual speech database collectionBørge Lindberg, Robrecht Comeyne, Christoph Draxler, Francesco Senia. [doi]

Disambiguation of Korean utterances using automatic intonation recognitionTae-Yeoub Jang, Minsuck Song, Kiyeong Lee. [doi]

Multi-dimensional scaling of listener responses to complex auditory stimuliZinny S. Bond, Donald Fucci, Verna Stockmal, Douglas McColl. [doi]

High-speed speaker adaptation using phoneme dependent tree-structured speaker clusteringMotoyuki Suzuki, Toshiaki Abe, Hiroki Mori, Shozo Makino, Hirotomo Aso. [doi]

Maximum-likelihood updates of HMM duration parameters for discriminative continuous speech recognitionRathinavelu Chengalvarayan. [doi]

A signal processing system for having the sound pop-out in noise thanks to the image of the speaker s lips: new advances using multi-layer perceptronsLaurent Girin, Laurent Varin, Gang Feng, Jean-Luc Schwartz. [doi]

A new strategy of fuzzy-neural network for Thai numeral speech recognitionChai Wutiwiwatchai, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin. [doi]

On the reduction of concatenation artefacts in diphone synthesisEsther Klabbers, Raymond N. J. Veldhuis. [doi]

Dynamic vs. static spectral detail in the perception of gated stopsMichael Kiefte, Terrance M. Nearey. [doi]

The voicing feature for stop consonants: acoustic phonetic analyses and automatic speech recognition experimentsPadma Ramesh, Partha Niyogi. [doi]

EGG model of ditoneme in MandarinJiangping Kong. [doi]

A time-synchronous, tree-based search strategy in the acoustic fast match of an asynchronous speech recognition systemEllen Eide, Lalit R. Bahl. [doi]

The impact of regional variety upon specific word categories in spontaneous GermanSusanne Burger, Daniela Oppermann. [doi]

A duration-based confidence measure for automatic segmentation of noise corrupted speechBryan L. Pellom, John H. L. Hansen. [doi]

The use of confidence measures in unsupervised adaptation of speech recognizersTasos Anastasakos, Sreeram V. Balakrishnan. [doi]

Optimized POS-based language models for large vocabulary speech recognitionPetra Witschel. [doi]

Interfacing of CASA and partial recognition based on a multistream techniqueFrédéric Berthommier, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard. [doi]

Probabilistic dialogue act extraction for concept based multilingual translation systemsToshiaki Fukada, Detlef Koll, Alex Waibel, Kouichi Tanigaki. [doi]

SCAN - speech content based audio navigator: a system overviewJohn Choi, Donald Hindle, Julia Hirschberg, Ivan Magrin-Chagnolleau, Christine H. Nakatani, Fernando C. N. Pereira, Amit Singhal, Steve Whittaker. [doi]

Cultural similarities and differences in the recognition of audio-visual speech stimuliSumi Shigeno. [doi]

The selection of pronunciation variants: comparing the performance of man and machineJudith M. Kessens, Mirjam Wester, Catia Cucchiarini, Helmer Strik. [doi]

A syllable-based Chinese spoken dialogue system for telephone directory services primarily trained with a corpusYen-Ju Yang, Lin-Shan Lee. [doi]

Spoken dialogue system using corpus-based hidden Markov modelChung-Hsien Wu, Gwo-Lang Yan, Chien-Liang Lin. [doi]

Data-driven PMC and Bayesian learning integration for fast model adaptation in noisy conditionsStefano Crafa, Luciano Fissore, Claudio Vair. [doi]

Feature-based approach to speech recognitionDorota J. Iskra, William H. Edmondson. [doi]

Improved parameter tying for efficient acoustic model evaluation in large vocabulary continuous speech recognitionJacques Duchateau, Kris Demuynck, Dirk Van Compernolle, Patrick Wambacq. [doi]

Improving posterior based confidence measures in hybrid HMM/ANN speech recognition systemsGiulia Bernardis, Hervé Bourlard. [doi]

Non-adjacent segmental effects in tonal realization of accentual phrase in seoul KoreanHyuck-Joon Lee. [doi]

Telephone band LVCSR for hearing-impaired usersEa-Ee Jan, Raimo Bakis, Fu-Hua Liu, Michael Picheny. [doi]

Speaker-independent upfront dialect adaptation in a large vocabulary continuous speech recognizerVolker Fischer, Yuqing Gao, Eric Janke. [doi]

Recognition performance of a large-scale dependency grammar language modelAdam L. Berger, Harry Printz. [doi]

The relation between vocal tract shape and formant frequencies can be described by means of a system of coupled differential equationsJean Schoentgen, Alain Soquet, Véronique Lecuit, Sorin Ciocea. [doi]

Acoustic and perceptual characteristic of Italian stop consonantsLoredana Cerrato, Mauro Falcone. [doi]

Telephone-based conversational speech recognition in the JUPITER domainJames R. Glass, Timothy J. Hazen. [doi]

TRAPS - classifiers of temporal patternsHynek Hermansky, Sangita Sharma. [doi]

Speech separation based on the GMM PDF estimationXiao Yu, Guangrui Hu. [doi]

A robust dialogue model for spoken dialogue processingMasahiro Araki, Shuji Doshita. [doi]

Special speech registers: talking to australian and Thai infants, and to petsDenis Burnham. [doi]

Independence of consonantal voicing and vocoid F0 perturbation in English and JapaneseShunichi Ishihara. [doi]

Acoustic cues for the auditory identification of the Spanish fricative /f/Santiago Fernández, Sergio Feijóo, Ramón Balsa, Nieves Barros. [doi]

Blind clustering of speech utterances based on speaker and language characteristicsDouglas A. Reynolds, Elliot Singer, Beth A. Carlson, Gerald C. O Leary, Jack McLaughlin, Marc A. Zissman. [doi]

Robust entropy-based endpoint detection for speech recognition in noisy environmentsJia-lin Shen, Jeih-Weih Hung, Lin-Shan Lee. [doi]

Soft state-tying for HMM-based speech recognitionChristoph Neukirchen, Daniel Willett, Gerhard Rigoll. [doi]

Customisation and quality assessment of spoken language descriptionJ. Bruce Millar. [doi]

Text-independent speaker identification and verification using the TIMIT databaseNuala C. Ward, Dominik R. Dersch. [doi]

Selection of the optimal structure of the continuous HMM using the genetic algorithmTomio Takara, Yasushi Iha, Itaru Nagayama. [doi]

An efficient mel-LPC analysis method for speech recognitionHiroshi Matsumoto, Yoshihisa Nakatoh, Yoshinori Furuhata. [doi]

A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation trainingGoh Kawai, Keikichi Hirose. [doi]

Log-linear interpolation of language modelsDietrich Klakow. [doi]

Non-linear probability estimation method used in HMM for modeling frame correlationQing Guo, Fang Zheng, Jian Wu, Wenhu Wu. [doi]

Abnormal volume-duration relationship in parkinsonian speechAileen K. Ho, John L. Bradshaw, Robert Iansek, Robin J. Alfredson. [doi]

Recurrent substrings and data fusion for language recognitionHarvey Lloyd-Thomas, Eluned S. Parris, Jeremy H. Wright. [doi]

Compression algorithm of trigram language models based on maximum likelihood estimationNorimichi Yodo, Kiyohiro Shikano, Satoshi Nakamura. [doi]

Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection techniqueYukiko Fujisawa, Nobuaki Minematsu, Seiichi Nakagawa. [doi]

More evidence for the perceptual basis of sound change? suprasegmental effects in the development of distinctive nasalizationJohn Hajek, Ian Watson. [doi]

Reduction of English function words in switchboardDaniel Jurafsky, Alan Bell, Eric Fosler-Lussier, Cynthia Girand, William Raymond. [doi]

Utterance generation for transaction dialoguesJoris Hulstijn, Arjan van Hessen. [doi]

Data-driven extensions to HMM statistical dependenciesJeff A. Bilmes. [doi]

Language modeling for content extraction in human-computer dialoguesWolfgang Reichl, Bob Carpenter, Jennifer Chu-Carroll, Wu Chou. [doi]

A large vocabulary continuous speech recognition hybrid system for the portuguese languageJoão Paulo Neto, Ciro Martins, Luís B. Almeida. [doi]

An MRI study on the relationship between oral cavity shape and larynx positionKiyoshi Honda, Mark Tiede. [doi]

Correlation between consonantal VC transitions and degree of perceptual confusion of place contrast in hindiManjari Ohala, John J. Ohala. [doi]

Acoustic observation context modeling in segment based speech recognitionMate Szarvas, Shoichi Matsunaga. [doi]

Hidden Markov models for trajectory modelingRukmini Iyer, Herbert Gish, Man-Hung Siu, George Zavaliagkos, Spyros Matsoukas. [doi]

On the significance of temporal masking in speech codingJan Skoglund, W. Bastiaan Kleijn. [doi]

Simulated emotions: an acoustic study of voice and perturbation measuresSandra P. Whiteside. [doi]

Improvement on connected numbers recognition using prosodic informationEduardo López, Javier Caminero, Ismael Cortázar, Luis A. Hernández Gómez. [doi]

Phonetic modification of the syllable /tu/ in two spontaneous american English dialoguesNanette Veilleux, Stefanie Shattuck-Hufnagel. [doi]

Detecting topic shifts using a cache memoryBrigitte Bigi, Renato de Mori, Marc El-Bèze, Thierry Spriet. [doi]

Influence of facial views on the mcgurk effect in auditory noiseRika Kanzaki, Takashi Kato. [doi]

Speech technology in clinical environmentsJan van Doorn, Sharynne McLeod, Elise Baker, Alison Purcell, William Thorpe. [doi]

Speech recognition via phonetically featured syllablesSimon King, Todd A. Stephenson, Stephen Isard, Paul Taylor, Alex Strachan. [doi]

Improved robust speech recognition considering signal correlation approximated by taylor seriesJia-lin Shen, Jeih-Weih Hung, Lin-Shan Lee. [doi]

Speech perception and spoken language in children with impaired hearingPeter J. Blamey, Julia Sarant, Tanya Serry, Roger Wales, Christopher James, Johanna Barry, Graeme M. Clark, M. Wright, R. Tooher, C. Psarros, G. Godwin, M. Rennie, T. Meskin. [doi]

FEM analysis of aspirated air flow in three-dimensional vocal tract during fricative consonant phonationTakuya Niikawa, Masafumi Matsumura, Takashi Tachimura, Takeshi Wada. [doi]

Maximum a posteriori pitch trackingJames Droppo, Alex Acero. [doi]

Training speech through visual feedback patternsJan Nouza. [doi]

Towards a Chinese text-to-speech system with higher naturalnessRen-Hua Wang, Qingfeng Liu, Yongsheng Teng, Deyu Xia. [doi]

An efficient labeling tool for the Quicksig speech databaseMatti Karjalainen, Toomas Altosaar, Miikka Huttunen. [doi]

Recognition of vowels in fricative contextSantiago Fernández, Sergio Feijóo, Ramón Balsa, Nieves Barros. [doi]

Magnetic resonance measurements of the velum port openingDidier Demolin, Véronique Lecuit, Thierry Metens, Bruno Nazarian, Alain Soquet. [doi]

Learning words from natural audio-visual inputDeb Roy, Alex Pentland. [doi]

A study of tones and tempo in continuous Mandarin digit strings and their application in telephone quality speech recognitionChao Wang, Stephanie Seneff. [doi]

Frequency analysis of phonetic units for concatenative synthesis in catalanIgnasi Esquerra, Albert Febrer, Climent Nadeu. [doi]

An algorithm for choosing Japanese acknowledgments using prosodic cues and contextWataru Tsukahara. [doi]

Automatic generation of visual scenarios for spoken corpora acquisitionDemetrio Aiello, Cristina Delogu, Renato de Mori, Andrea Di Carlo, Marina Nisi, Silvia Tummeacciu. [doi]

Articulatory analysis using a codebook for articulatory based low bit-rate speech codingCarlos Silva, Samir Chennoukh. [doi]

Efficient high-order hidden Markov modellingJohan A. du Preez, D. M. Weber. [doi]

Improving accent identification through knowledge of English syllable structureKay Berkling, Marc A. Zissman, Julie Vonwiller, Christopher Cleirigh. [doi]

Extended linear discriminant analysis (ELDA) for speech recognitionGünther Ruske, Robert Faltlhauser, Thilo Pfau. [doi]

A PC-based tool for helping in diagnosis of pathologic voiceJuan Ignacio Godino-Llorente, Santiago Aguilera-Navarro, Sira E. Palazuelos-Cagigas, Alberto Nieto Altuzarra, Pedro Gómez Vilda. [doi]

Automatic ambiguity detectionRichard Sproat, Jan P. H. van Santen. [doi]

Improving accuracy of telephony-based, speaker-independent speech recognitionDaniel Azzopardi, Shahram Semnani, Ben Milner, Richard Wiseman. [doi]

Recovering gestures from speech signals: a preliminary study for nasal vowelsSolange Rossato, Gang Feng, Rafael Laboissière. [doi]

Neural network based pronunciation modeling with applications to speech recognitionToshiaki Fukada, Takayoshi Yoshimura, Yoshinori Sagisaka. [doi]

Efficient lattice representation and generationFuliang Weng, Andreas Stolcke, Ananth Sankar. [doi]

Interactive listening to structured speech content on the internetMakoto J. Hirayama, Taro Sugahara, Zhiyong Peng, Junichi Yamazaki. [doi]

A flexible method of creating HMM using block-diagonalization of covariance matricesRyosuke Koshiba, Mitsuyoshi Tachimori, Hiroshi Kanazawa. [doi]

A name announcement algorithm with memory size and computational power constraintsZe ev Roth, Judith Rosenhouse. [doi]

Recognition from GSM digital speechAscensión Gallardo-Antolín, Fernando Díaz-de-María, Francisco J. Valverde-Albacete. [doi]

Evaluation and integration of neural-network training techniques for continuous digit recognitionJohn-Paul Hosom, Ronald A. Cole, Piero Cosi. [doi]

Investigating the syntactic characteristics of English tone unitsAlex Chengyu Fang, Jill House, Mark Huckvale. [doi]

Creating a mexican Spanish version of the CSLU toolkitBen Serridge, Alejandro Barbosa, Ronald A. Cole, Nora Munive, Alcira Vargas. [doi]

Automatic detection of landmark for nasal consonants from speech waveformLimin Du, Kenneth N. Stevens. [doi]

Dual-route phonetic encoding: some acoustic evidenceSandra P. Whiteside, Rosemary A. Varley. [doi]

Combining articulatory and acoustic information for speech recognition in noisy and reverberant environmentsKatrin Kirchhoff. [doi]

A statistical phonemic segment model for speech recognition based on automatic phonemic segmentationKatsura Aizawa, Chieko Furuichi. [doi]

Language identification incorporating lexical informationDriss Matrouf, Martine Adda-Decker, Lori Lamel, Jean-Luc Gauvain. [doi]

A comparative study of hybrid modelling techniques for improved telephone speech recognitionRathinavelu Chengalvarayan. [doi]

Rescoring multiple pronunciations generated from spelled wordsRoland Kuhn, Jean-Claude Junqua, Philip D. Martzen. [doi]

A language modeling based on a hierarchical approach: m_n^vImed Zitouni. [doi]

On the convergence of Gaussian mixture models: improvements through vector quantizationJames Moody, Stefan Slomka, Jason W. Pelecanos, Sridha Sridharan. [doi]

De-accentuation: linguistic environments and prosodic realizationsKai Alter, Karsten Steinhauer, Angela D. Friederici. [doi]

Overview of the maya spoken language systemSimon Downey, Andrew P. Breen, Maria Fernández, Edward Kaneen. [doi]

Jitter and shimmer differences between pathological voices of school childrenNatalija Bolfan-Stosic, Tatjana Prizl. [doi]

The predictive power of game structure in dialogue act recognition: experimental results using maximum entropy estimationMassimo Poesio, Andrei Mikheev. [doi]

On loops and articulatory biomechanicsPascal Perrier, Yohan Payan, Joseph S. Perkell, Frédéric Jolly, Majid Zandipour, Melanie Matthies. [doi]

A Japanese-to-English speech translation system: ATR-MATRIXToshiyuki Takezawa, Tsuyoshi Morimoto, Yoshinori Sagisaka, Nick Campbell, Hitoshi Iida, Fumiaki Sugaya, Akio Yokoo, Seiichi Yamamoto. [doi]

Interfaces for speech recognition systems: the impact of vocabulary constraints and syntax on performanceKate S. Hone, David Golightly. [doi]

A 16 kbit/s wideband CELP coder using MEL-generalized cepstral analysis and its subjective evaluationKazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, Takao Kobayashi. [doi]

The new version of the ROMVOX text-to-speech synthesis system based on a hybrid time domain-LPC synthesis techniqueAttila Ferencz, István Nagy, Tunde-Csilla Kovács, Maria Ferencz, Teodora Ratiu. [doi]

Segmental duration control based on an articulatory modelYoshinori Shiga, Hiroshi Matsuura, Tsuneo Nitta. [doi]

Speaker clustering using direct maximisation of the MLLR-adapted likelihoodSue E. Johnson, Philip C. Woodland. [doi]

Wavelet transform-based speech enhancementEliathamby Ambikairajah, Graham Tattersall, Andrew Davis. [doi]

A large-vocabulary taiwanese (MIN-NAN) multi-syllabic word recognition system based upon right-context-dependent phones with state clustering by acoustic decision treeRen-Yuan Lyu, Yuang-jin Chiang, Wen-Ping Hsieh. [doi]

Efficient quantization of LSF parameters based on temporal decompositionSung-Joo Kim, Sangho Lee, Woo-Jin Han, Yung-Hwan Oh. [doi]

Emergent computational dialogue management architecture for task-oriented spoken dialogue systemsTakeshi Kawabata. [doi]

An undergraduate course on speech recognition based on the CSLU toolkitBen Serridge. [doi]

A context-dependent approach for speaker verification using sequential decisionHideki Noda, Katsuya Harada, Eiji Kawaguchi, Hidefumi Sawai. [doi]

Toward Markov random field modeling of speechGuillaume Gravier, Marc Sigelle, Gérard Chollet. [doi]

Multi-resolution for speech analysisMarie-José Caraty, Claude Montacié. [doi]

Organizing self-motivated dialogue with autonomous creaturesNoriko Suzuki, Kazuo Ishii, Michio Okada. [doi]

A German dialogue system for scheduling dates and meetings by naturally spoken continuous speechDaniel Willett, Arno Romer, Jörg Rottland, Gerhard Rigoll. [doi]

Hierarchical tag-graph search for spontaneous speech understanding in spoken dialog systemsBor-shen Lin, Berlin Chen, Hsin-Min Wang, Lin-Shan Lee. [doi]

The demiphone versus the triphone in a decision-tree state-tying frameworkJosé B. Mariño, Pau Pachès-Leal, Albino Nogueiras. [doi]

Concept-driven speech understanding incorporated with a statistic language modelAkito Nagai, Yasushi Ishikawa. [doi]

An analysis of the timing of turn-taking in a corpus of goal-oriented dialogueMatthew Bull, Matthew P. Aylett. [doi]

A study of noise robustness for speaker independent speech recognition method using phoneme similarity vectorMasakatsu Hoshimi, Maki Yamada, Katsuyuki Niyada, Shozo Makino. [doi]

Orthografik inkoncistensy ephekts in foneme detektion?Anne Cutler, Rebecca Treiman, Brit van Ooijen. [doi]

Phonological units in speech segmentation and phonological awarenessTakashi Otake, Kiyoko Yoneyama. [doi]

Quantification of pharyngeal articulations using measurements from laryngoscopic imagesJohn H. Esling, Jocelyn Clayards, Jerold A. Edmondson, Qiu Fuyuan, Jimmy G. Harris. [doi]

Word verification using confidence measures in speech recognitionM. Carmen Benítez, Antonio J. Rubio, Pedro García, Jesús E. Díaz-Verdejo. [doi]

Volume regulation in parkinsonian speechAileen K. Ho, John L. Bradshaw, Robert Iansek, Robin J. Alfredson. [doi]

A novel iterative signal enhancement algorithm for noise reduction in speechSimon Doclo, Ioannis Dologlou, Marc Moonen. [doi]

Statistical modeling of pronunciation and production variations for speech recognitionFilipp Korkmazskiy, Biing-Hwang Juang. [doi]

A synthesis method based on concatenation of demisyllables and a residual excited vocal tract modelSteve Pearson, Nick Kibre, Nancy Niedzielski. [doi]

Reconciling two competing views on contrastivenessEmiel Krahmer, Marc Swerts. [doi]

Lexical activation by assimilated and reduced tokensM. Louise Kelly, Ellen Gurman Bard, Catherine Sotillo. [doi]

Articulability of two consecutive morae in Japanese speech production: evidence from sound exchange errors in spontaneous speechYasushi Terao, Tadao Murata. [doi]

Word clustering for a word bi-gram modelShinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh. [doi]

The CSLU speaker recognition corpusRonald A. Cole, Mike Noel, Victoria Noel. [doi]

Expanding a time-sensitive conversational architecture for turn-taking to handle content-driven interruptionGregory Aist. [doi]

Prosynth: an integrated prosodic approach to device-independent, natural-sounding speech synthesisSarah Hawkins, Jill House, Mark Huckvale, John Local, Richard Ogden. [doi]

A contrastive study of lexical stress placement in singapore English and british EnglishEe Ling Low, Esther Grabe. [doi]

Evaluation of dialog strategies for a tourist information retrieval systemLaurence Devillers, Hélène Bonneau-Maynard. [doi]

Syntax coordination: interaction of discourse and extrapositionsSusanne Kronenberg, Franz Kummert. [doi]

Comparative experiments to evaluate a voiced-unvoiced-based pre-processing approach to robust automatic speech recognition in low-SNR environmentsHesham Tolba, Douglas D. O Shaughnessy. [doi]

Vowel quality in spontaneous speech: what makes a good vowel?Matthew P. Aylett, Alice Turk. [doi]

Implementation of coordinative nodding behavior on spoken dialogue systemsJun-ichi Hirasawa, Noboru Miyazaki, Mikio Nakano, Takeshi Kawabata. [doi]

High resolution decision tree based acoustic modeling beyond CARTWu Chou, Wolfgang Reichl. [doi]

Automatic labelling of German prosodyStefan Rapp. [doi]

On frequency averaging for spectral analysis in speech recognitionCliment Nadeu, Felix Galindo, Jaume Padrell. [doi]

Towards speech understanding across multiple languagesTodd Ward, Salim Roukos, Chalapathy Neti, Jerome Gros, Mark Epstein, Satya Dharanipragada. [doi]

Towards an automatic classification of emotions in speechN. Amir, S. Ron. [doi]

A robust tone recognition method of Chinese based on sub-syllabic F0 contoursJin-Song Zhang, Keikichi Hirose. [doi]

Acoustic speech recognition model by neural net equation with competition and cooperationTetsuro Kitazoe, Tomoyuki Ichiki, Sung-Ill Kim. [doi]

Confidence scoring for speech understanding systemsChristine Pao, Philipp Schmid, James R. Glass. [doi]

Do phonetic features help to improve consonant identification in ASR?Jacques C. Koreman, Bistra Andreeva, William J. Barry. [doi]

Generating pitch accents in a concept-to-speech system using a knowledge baseSandra Williams. [doi]

On the influence of hyperarticulated speech on recognition performanceHagen Soltau, Alex Waibel. [doi]

A bootstrap training approach for language model classifiersVolker Warnke, Elmar Nöth, Jan Buckow, Stefan Harbeck, Heinrich Niemann. [doi]

Thai polysyllabic word recognition using fuzzy-neural networkChai Wutiwiwatchai, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin. [doi]

A new confidence measure based on rank-ordering subphone scoresQiguang Lin, Subrata Das, David Lubensky, Michael Picheny. [doi]

Speech feature modeling for robust stressed speech recognitionSahar E. Bou-Ghazale, John H. L. Hansen. [doi]

The research project of man-computer dialogue system in ChineseDinghua Guan, Min Chu, Quan Zhang, Jian Liu, Xiangdong Zhang. [doi]

A very low bit rate speech coder using HMM with speaker adaptationTakashi Masuko, Keiichi Tokuda, Takao Kobayashi. [doi]

SEMOLE: a robust framework for gathering information from the world wide webHyung Jin Kim, I. Lee Hetherington. [doi]

A voice verifier for face/voice based person verification systemRongyu Qiao, Youngkyu Choi, Johnson I. Agbinya. [doi]

On robust sequential estimator based on t-distribution with forgetting factor for speech analysisJoohun Lee, Ki Yong Lee. [doi]

The influence of accents in australian English vowels and their relation to articulatory tract parametersDominik R. Dersch, Christopher Cleirigh, Julie Vonwiller. [doi]

Spoken language understanding within dialogs using a graphical model of task structureJeremy H. Wright, Allen L. Gorin, Alicia Abella. [doi]

Pacing spoken directions to suit the listenerTatsuya Iwase, Nigel Ward. [doi]

Speech production of vowel sequences using a physiological articulatory modelJianwu Dang, Kiyoshi Honda. [doi]

The distance measure for line spectrum pairs applied to speech recognitionFang Zheng, Zhanjiang Song, Ling Li, Wenjian Yu, Fengzhou Zheng, Wenhu Wu. [doi]

Plasticity of non-native phonetic perception and production: a training studySatoshi Imaizumi, Hidemi Itoh, Yuji Tamekawa, Toshisada Deguchi, Koichi Mori. [doi]

A spoken dialogue system utilizing spatial informationAnnika Flycht-Eriksson, Arne Jönsson. [doi]

Empowering knowledge based speech understanding through statisticsJulia Fischer, Jürgen Haas, Elmar Nöth, Heinrich Niemann, Frank Deinzer. [doi]

Analysis of occurrence of pauses and their durations in Japanese text readingHiroya Fujisaki, Sumio Ohno, Seiji Yamada. [doi]

Real-time probabilistic segmentation for segment-based speech recognitionSteven C. Lee, James R. Glass. [doi]

A bimodal Korean address entry/retrieval systemHyun-Yeol Chung, Cheol-Jun Hwang, Shi-wook Lee. [doi]

Improved parallel model combination based on better domain transformation for speech recognition under noisy environmentsJeih-Weih Hung, Jia-lin Shen, Lin-Shan Lee. [doi]

Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependencyKengo Hanai, Kazumasa Yamamoto, Nobuaki Minematsu, Seiichi Nakagawa. [doi]

A language model combining trigrams and stochastic context-free grammarsJohn Gillett, Wayne Ward. [doi]

Improving the noise and spectral robustness of an isolated-word recognizer using an auditory-model front endMartin Hunke, Meeran Hyun, Steve Love, Thomas Holton. [doi]

Recognition of connected digit speech in Japanese collected over the telephone networkHisashi Kawai, Norio Higuchi. [doi]

Improved feature decorrelation for HMM-based speech recognitionKris Demuynck, Jacques Duchateau, Dirk Van Compernolle, Patrick Wambacq. [doi]

Progress in speaker recognition at dragon systemsAndrés Corrada-Emmanuel, Michael Newman, Barbara Peskin, Larry Gillick, Robert Roth. [doi]

Morphological modeling of word classes for language modelsUlla Uebler, Heinrich Niemann. [doi]

A schema for illocutionary act identification with prosodic featureMasafumi Tamoto, Takeshi Kawabata. [doi]

Letter to sound rules for accented lexicon compressionVincent Pagel, Kevin A. Lenzo, Alan W. Black. [doi]

Phrase accents revisited: comparative evidence from standard and cypriot greekAmalia Arvaniti. [doi]

A silence/noise/music/speech splitting algorithmClaude Montacié, Marie-José Caraty. [doi]

Real time speaker indexing based on subspace method - application to TV news articles and debateMasafumi Nishida, Yasuo Ariki. [doi]

A new fast algorithm for automatic segmentation of continuous speechIman Gholampour, Kambiz Nayebi. [doi]

What you see is (almost) what you hear: design principles for user interfaces for accessing speech archivesSteve Whittaker, John Choi, Julia Hirschberg, Christine H. Nakatani. [doi]

Reconstructing the tongue surface from six cross-sectional contours: ultrasound dataAndrew J. Lundberg, Maureen Stone. [doi]

The use of automatic speech recognition to reduce the interference between concurrent tasks of driving and phoningRobert Graham, Chris Carter, Brian Mellor. [doi]

Nozomi - a fast, memory-efficient stack decoder for LVCSRMike Schuster. [doi]

Multi-phone strings as subword units for speech recognitionPhilip O Neill, Saeed Vaseghi, Bernard Doherty, Wooi-Haw Tan, Paul M. McCourt. [doi]

Performance evaluation of word phrase and noun category language models for broadcast news speech recognitionKazuyuki Takagi, Rei Oguro, Kenji Hashimoto, Kazuhiko Ozeki. [doi]

Speaker recognition using residual signal of linear and nonlinear prediction modelsMarcos Faúndez-Zanuy, Daniel Rodriguez-Porcheron. [doi]

Predictive adaptation and compensation for robust speech recognitionArun C. Surendran, Chin-Hui Lee. [doi]

Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scoresReiko Akahane-Yamada, Erik McDermott, Takahiro Adachi, Hideki Kawahara, John S. Pruitt. [doi]

A fast decoding algorithm based on sequential detection of the changes in distributionQi Li. [doi]

Combination of confidence measures in isolated word recognitionJ. G. A. Dolfing, Andreas Wendemuth. [doi]

Hierarchical cluster language modeling with statistical rule extraction for rescoring n-best hypotheses during speech decodingPhotina Jaeyun Jang, Alexander G. Hauptmann. [doi]

The effect of fundamental frequency on Mandarin speech recognitionSharlene Liu, Sean Doyle, Allen Morris, Farzad Ehsani. [doi]

Factor analysis invariant to linear transformations of dataRamesh A. Gopinath, Bhuvana Ramabhadran, Satya Dharanipragada. [doi]

Improvements in slovene text-to-speech synthesisTomaz Sef, Ales Dobnikar, Matjaz Gams. [doi]

A speechreading aid based on phonetic ASRPaul Duchnowski, Louis Braida, Maroula Bratakos, David Lum, Matthew Sexton, Jean Krause. [doi]

An iterative, DP-based search algorithm for statistical machine translationIsmael García-Varea, Francisco Casacuberta, Hermann Ney. [doi]

Multimodal language processingMichael Johnston. [doi]

Fundamental frequency fluctuation in continuous vowel utterance and its perceptionMasato Akagi, Mamoru Iwaki, Tomoya Minakawa. [doi]

Phonetic-distance-based hypothesis driven lexical adaptation for transcribing multlingual broadcast newsPetra Geutner, Michael Finke, Alex Waibel. [doi]

Categorical perception: important phenomenon or lasting myth?Dominic W. Massaro. [doi]

The REWARD service creation environment. an overviewTom Brøndsted, Bo Nygaard Bai, Jesper Østergaard Olsen. [doi]

Audio and audio-visual perception of consonants disturbed by white noise and cocktail party László Czap. [doi]

SIVHA, visual speech synthesis systemYolanda Blanco, Maria Cuellar, Arantxa Villanueva, Fernando Lacunza, Rafael Cabeza, Beatriz Marcotegui. [doi]

Time as a factor in the acoustic variation of schwaWilliam J. Barry. [doi]

Discriminative training of GMM using a modified EM algorithm for speaker recognitionKonstantin P. Markov, Seiichi Nakagawa. [doi]

Using x-gram for efficient speech recognitionAntonio Bonafonte, José B. Mariño. [doi]

Exploration of acoustic correlates in speaker selection for concatenative synthesisAnn K. Syrdal, Alistair Conkie, Yannis Stylianou. [doi]

Generalized phone modeling based on piecewise linear segment latticeHiroaki Kojima, Kazuyo Tanaka. [doi]

Recognition-based word counting for reliable barge-in and early endpoint detection in continuous speech recognitionAnand R. Setlur, Rafid A. Sukkar. [doi]

Automatic language recognition using high-order HMMsJohan A. du Preez, D. M. Weber. [doi]

Same news is good news: automatically collecting reoccurring radio news storiesStefan Rapp, Grzegorz Dogil. [doi]

A comparative study of speaker verification systems using the polycost databaseTomas Nordström, Håkan Melin, Johan Lindberg. [doi]

Assimilation and anticipation in word perceptionHugo Quené, Maya van Rossum, Mieke van Wijck. [doi]

Evaluation of model adaptation by HMM decomposition on telephone speech recognitionTetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano, Masatoshi Morishima, Toshihiro Isobe. [doi]

Perceptual properties of Russians with Japanese fricativesSeiya Funatsu, Shigeru Kiritani. [doi]

Phonological elements as a basis for language-independent ASRGeoff Williams, Mark Terry, Jonathan Kaye. [doi]

Recent work on a preselection module for a flexible large vocabulary speech recognition system in telephone environmentJavier Ferreiros, Javier Macías Guarasa, Ascensión Gallardo-Antolín, José Colás, Ricardo de Córdoba, José Manuel Pardo, Luis Villarrubia Grande. [doi]

Enhancement techniques to improve the intelligibility of consonants in noise : speaker and listener effectsValérie Hazan, Andrew Simpson, Mark Huckvale. [doi]

A three-dimensional linear articulatory model based on MRI dataPierre Badin, Gérard Bailly, Monica Raybaudi, Christoph Segebarth. [doi]

MSF format for the representation of speech synchronized moving imageCheol-Woo Jo. [doi]

Speech intelligibility derived from exceedingly sparse spectral informationSteven Greenberg, Takayuki Arai, Rosaria Silipo. [doi]

The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpusKatunobu Itou, Mikio Yamamoto, Kazuya Takeda, Toshiyuki Takezawa, Tatsuo Matsuoka, Tetsunori Kobayashi, Kiyohiro Shikano, Shuichi Itahashi. [doi]

Automatic language identification with perceptually guided training and recurrent neural networksJerome Braun, Haim Levkowitz. [doi]

Keyword extraction of radio news using domain identification based on categories of an encyclopediaYoshimi Suzuki, Fumiyo Fukumoto, Yoshihiro Sekiguchi. [doi]

The development of perceptual cue-weighting in children aged 6 to 12Valérie Hazan, Sarah Barrett. [doi]

Segmentation of the airway from the surrounding tissues on magnetic resonance images: a comparative studyAlain Soquet, Véronique Lecuit, Thierry Metens, Bruno Nazarian, Didier Demolin. [doi]

The CHAM model of hyperarticulate adaptation during human-computer error resolutionSharon L. Oviatt. [doi]

A pressure sensitive palatography: application of new pressure sensitive sheet for measuring tongue-palatal contact pressureMasahiko Wakumoto, Shinobu Masaki, Kiyoshi Honda, Toshikazu Ohue. [doi]

Periphear : a nonlinear active model of the auditory peripheryArnaud Robert, Jan Eriksson. [doi]

System-user interaction and response strategy in spoken dialogue systemYohei Okato, Keiji Kato, Mikio Yamamoto, Shuichi Itahashi. [doi]

A synthesis-oriented model of phrasal pitch movements in standard ChineseJinfu Ni, Goh Kawai, Keikichi Hirose. [doi]

ToBI accent type recognitionArman Maghbouleh. [doi]

Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphingAlexandre Girardi, Kiyohiro Shikano, Satoshi Nakamura. [doi]

Estimating entropy of a language from optimal word insertion penaltyKazuya Takeda, Atsunori Ogawa, Fumitada Itakura. [doi]

Resegmentation of SWITCHBOARDNeeraj Deshmukh, Aravind Ganapathiraju, Andi Gleeson, Jonathan Hamaker, Joseph Picone. [doi]

Use of high-level linguistic constraints for constructing feature-based phonological model in speech recognitionJiping Sun, Li Deng. [doi]

Automatic generation of Korean pronunciation variants by multistage applications of phonological rulesJe Hun Jeon, Sunhwa Cha, Minhwa Chung, Jun Park, Kyuwoong Hwang. [doi]

A syllable-based generalization of Japanese accentuationHaruo Kubozono. [doi]

Dovetailing of acoustics and prosody in spontaneous speech recognitionJan Buckow, Anton Batliner, Richard Huber, Elmar Nöth, Volker Warnke, Heinrich Niemann. [doi]

Effects of phonetic quality and duration on perceptual acceptability of temporal changes in speechHiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka. [doi]

A novel robust speech recognition algorithm based on multi-models and integrated decision methodShengxi Pan, Jia Liu, Jintao Jiang, Zuoying Wang, Dajin Lu. [doi]

Weighted parallel model combination for noisy speech recognitionTai-Hwei Hwang, Hsiao-Chuan Wang. [doi]

Laryngoscopic analysis of pharyngeal articulations and larynx-height voice quality settingsJohn H. Esling. [doi]

A hierarchy probability-based visual features extraction method for speechreadingYanjun Xu, Limin Du, Guoqiang Li, Ziqiang Hou. [doi]

Speaking-style dependent lexicalized filler model for key-phrase detection and verificationTatsuya Kawahara, Kentaro Ishizuka, Shuji Doshita, Chin-Hui Lee. [doi]

Segmental and tonal processing in CantoneseHsuan-Chih Chen, Michael C. W. Yip, Sum-Yin Wong. [doi]

A forensic phonetic investigation into non-contemporaneous variation in the f-pattern of similar-sounding speakersPhil Rose. [doi]

A multilingual prosodic databaseEstelle Campione, Jean Véronis. [doi]

On the influence of the delta coefficients in a HMM-based speech recognition systemFabrice Lefèvre, Claude Montacié, Marie-José Caraty. [doi]

Towards robust methods for spoken document retrievalKenney Ng. [doi]

Phonetic investigation of boundary pitch movements in JapaneseKazuaki Maeda, Jennifer J. Venditti. [doi]

Speaker recognition based on discriminative projection modelsJesper Østergaard Olsen. [doi]

The effect of background knowledge on first and second language comprehension difficultyMichael D. Tyler. [doi]

An F0 contour control model for totally speaker driven text to speech systemTakehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine. [doi]

Collection and detailed transcription of a speech database for development of language learning technologiesHarry Bratt, Leonardo Neumeyer, Elizabeth Shriberg, Horacio Franco. [doi]

Cantilever-type force-sensor-mounted palatal plate for measuring palatolingual contact stress and pattern during speech phonationMasafumi Matsumura, Takuya Niikawa, Takao Tanabe, Takashi Tachimura, Takeshi Wada. [doi]

Information theoretic approaches to model selectionJonathan Hamaker, Aravind Ganapathiraju, Joseph Picone. [doi]

The UPC text-to-speech system for Spanish and catalanAntonio Bonafonte, Ignasi Esquerra, Albert Febrer, José A. R. Fonollosa, Francesc Vallverdú. [doi]

The role of phonological, morphological, and orthographic knowledge in the intuitive syllabification of dutch words: a longitudinal approachDominiek Sandra, Steven Gillis. [doi]

Bayesian constrained frequency warping HMMS for speaker normalisationChing-Hsiang Ho, Saeed Vaseghi, Aimin Chen. [doi]

Speech enhancement using STC-based bandwidth extensionJulien Epps, W. Harvey Holmes. [doi]

Quantitative influence of speech variability factors for automatic speaker verification in forensic tasksJavier Ortega-Garcia, Santiago Cruz-Llanas, Joaquin Gonzalez-Rodriguez. [doi]

A generic algorithm for generating spoken monologuesEsther Klabbers, Emiel Krahmer, Mariët Theune. [doi]

An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi searchTakeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano. [doi]

Robust interpretation for spoken dialogue systemsLena Strömbäck, Arne Jönsson. [doi]

Speech, silence, music and noise classification of TV broadcast materialAra Samouelian, Jordi Robert-Ribes, Mike Plumpe. [doi]

Optimized stopping criteria for tree-based unit selection in concatenative synthesisAndrew Cronk, Michael W. Macon. [doi]

Fabricating conversational speech data with acoustic models: a program to examine model-data mismatchDon McAllaster, Lawrence Gillick, Francesco Scattone, Michael Newman. [doi]

The use of broad phonetic class models in speaker recognitionJohan Koolwaaij, Johan de Veth. [doi]

The provision of corrective feedback in a spoken dialogue CALL systemSarah Davies, Massimo Poesio. [doi]

Voice dictation in the secondary school classroomMichael F. McTear, Eamonn A. O Hare. [doi]

Using automatically-derived acoustic sub-word units in large vocabulary speech recognitionMichiel Bacchiani, Mari Ostendorf. [doi]

Segmentation and classification of broadcast news audioThomas Hain, Philip C. Woodland. [doi]

Modular connectionist systems for identifying complex arabic phonetic featuresSid-Ahmed Selouani, Jean Caelen. [doi]

Speaker verification using fundamental frequencyYoik Cheng, Hong C. Leung. [doi]

The tilt intonation modelPaul Taylor. [doi]

Cochlear implants in the second and third millenniaGraeme M. Clark. [doi]

The relationship between intensity and subglottal pressure with controlled pitchVéronique Lecuit, Didier Demolin. [doi]

Restoration of hyperbaric speech by correction of the formants and the pitchLaure Charonnat, Michel Guitton, Joel Crestel, Gerome Allée. [doi]

A perceptive measure of pure prosody linguistic functions with reiterant sentencesAlbert Rilliard, Véronique Aubergé. [doi]

A comparison of two unsupervised approaches to accent identificationMike Lincoln, Stephen Cox, Simon Ringland. [doi]

Spoken word identification by native and nonnative speakers of English: effects of training, modality, context and phonetic environmentDebra M. Hardison. [doi]

The influence of syllable structure on the timing of intonational events in GermanHansjörg Mixdorff, Hiroya Fujisaki. [doi]

Fast computation of maximum entropy / minimum divergence feature gainHarry Printz. [doi]

Representation of voice quality features associated with talker individualityHiroshi Kido, Hideki Kasuya. [doi]

Some acoustic characteristics of emotionCecile Pereira, Catherine I. Watson. [doi]

Separation of singing and piano soundsYoram Meron, Keikichi Hirose. [doi]

Multi-channel pulsation strategy for electric stimulation of cochleaShigeyoshi Kitazawa, Hiroyuki Kirihata, Tatsuya Kitamura. [doi]

A new look at HMM parameter tying for large vocabulary speech recognitionAnanth Sankar. [doi]

The automatic marking of prominence in spontaneous speech using duration and part of speech informationMatthew P. Aylett, Matthew Bull. [doi]

Rapid-deployment text-to-speech in the DIPLOMAT systemKevin A. Lenzo, Christopher Hogan, Jeffrey Allen. [doi]

Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkitMichael F. McTear. [doi]

Techniques for accurate automatic annotation of speech waveformsStephen Cox, Richard Brady, Peter Jackson. [doi]

New prosodic control rules for expressive synthetic speechOsamu Mizuno, Shin ya Nakajima. [doi]

Describing intonation with a parametric modelGregor Möhler. [doi]

The effect of modifying formant amplitudes on the perception of French vowels generated by copy synthesisAnne Bonneau, Yves Laprie. [doi]

Contextual effects on voicing profiles of German and Mandarin consonantsChilin Shih, Bernd Möbius. [doi]

Automatic utterance type detection using suprasegmental featuresHelen Wright. [doi]

Joint recognition and segmentation using phonetically derived features and a hybrid phoneme modelNaomi Harte, Saeed Vaseghi, Ben P. Milner. [doi]

Inference of missing spectrographic features for robust speech recognitionBhiksha Raj, Rita Singh, Richard M. Stern. [doi]

Fuzzy Gaussian mixture models for speaker recognitionDat Tran, Tu Van Le, Michael Wagner. [doi]

The modeling and realization of natural speech generation systemFang Chen, Baozong Yuan. [doi]

Automatic detection of sentence boundaries and disfluencies based on recognized wordsAndreas Stolcke, Elizabeth Shriberg, Rebecca A. Bates, Mari Ostendorf, Dilek Hakkani, Madelaine Plauché, Gökhan Tür, Yu Lu. [doi]

Speech driven 3-d face point trajectory synthesis algorithmLevent M. Arslan, David Talkin. [doi]

Pronunciation modeling for large vocabulary conversational speech recognitionKristine W. Ma, George Zavaliagkos, Rukmini Iyer. [doi]

High accuracy Chinese speech recognition approach with Chinese input technology for telecommunication useYork Chung-Ho Yang, June-Jei Kuo. [doi]

Efficient adaptation of TTS duration model to new speakersChilin Shih, Wentao Gu, Jan P. H. van Santen. [doi]

On the importance of components of the modulation spectrum for speaker verificationSarel Van Vuuren, Hynek Hermansky. [doi]

Spotting (different types of) words in (different types of) contextJames M. McQueen, Anne Cutler. [doi]

Quantitative assessment of second language learners fluency: an automatic approachCatia Cucchiarini, Helmer Strik, Lou Boves. [doi]

Parametric trajectory mixtures for LVCSRMan-Hung Siu, Rukmini Iyer, Herbert Gish, Carl Quillen. [doi]

An acoustic-phonetic description of word tone in kagoshima JapaneseShunichi Ishihara. [doi]

Acoustic and affective qualities of IDS in EnglishChristine Kitamura, Denis Burnham. [doi]

Grammar fragment acquisition using syntactic and semantic clusteringKazuhiro Arai, Jeremy H. Wright, Giuseppe Riccardi, Allen L. Gorin. [doi]

Automatic grammar induction from semantic parsingDebajit Ghosh, David Goddeau. [doi]

Cross-language merged speech units and their descriptive phonetic correlatesPaul Dalsgaard, Ove Andersen, William J. Barry. [doi]

Perceived Swedish vowel quantity: effects of postvocalic consonant durationDawn M. Behne, Peter E. Czigler, Kirk P. H. Sullivan. [doi]

A language for creating speech applicationsAndrew N. Pargellis, Qiru Zhou, Antoine Saad, Chin-Hui Lee. [doi]

A*-admissible key-phrase spotting with sub-syllable level utterance verificationBerlin Chen, Hsin-Min Wang, Lee-Feng Chien, Lin-Shan Lee. [doi]

Segmentation using a maximum entropy approachKishore Papineni, Satya Dharanipragada. [doi]

A voice user interface demonstration system for mexican SpanishCarmen García-Mateo, Qiru Zhou, Chin-Hui Lee, Andrew N. Pargellis. [doi]

On a pitch alteration technique in excited cepstral spectrum for high quality TTSJongDeuk Kim, SeongJoon Baek, MyungJin Bae. [doi]

A bootstrap technique for building domain-dependent language modelsGanesh N. Ramaswamy, Harry Printz, Ponani S. Gopalakrishnan. [doi]

Use of non-verbal information in communication between human and robotMasao Yokoyama, Kazumi Aoyama, Hideaki Kikuchi, Katsuhiko Shirai. [doi]

An analysis of dialogues with our dialogue system through a WWW pageTadahiko Kumamoto, Akira Ito. [doi]

Toward on-line learning of Chinese continuous speech recognition systemRong Zheng, Zuoying Wang. [doi]

A realistic wizard of oz simulation of a multimodal spoken language systemPeter J. Wyard, Gavin E. Churcher. [doi]

Robust speech recognition using discriminative stream weighting and parameter interpolationStephen M. Chu, Yunxin Zhao. [doi]

Robust and compact multilingual word recognizers using features extracted from a phoneme similarity front-endPhilippe Morin, Ted H. Applebaum, Robert Boman, Yi Zhao, Jean-Claude Junqua. [doi]

An adaptive gradient-search based algorithm for discriminative training of HMM sAlbino Nogueiras Rodriguez, José B. Mariño, Enric Monte. [doi]

Adaptive transformation for segmented parametric speech codingDamith J. Mudugamuwa, Alan B. Bradley. [doi]

Task adaptation of sub-lexical unit models using the minimum confusibility criterion on task independent databasesAlbino Nogueiras Rodriguez, José B. Mariño. [doi]

The use of F0 reliability function for prosodic command analysis on F0 contour generation modelMitsuru Nakai, Hiroshi Shimodaira. [doi]

Confidence measures derived from an acceptor HMMGethin Williams, Steve Renals. [doi]

Synergy between jaw and lips/tongue movements : consequences in articulatory modellingGérard Bailly, Pierre Badin, Anne Vilain. [doi]

Text segmentation and topic tracking on broadcast news via a hidden Markov model approachPaul van Mulbregt, Ira Carp, Lawrence Gillick, Steve Lowe, Jon Yamron. [doi]

Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)Masami Akamine, Takehiko Kagoshima. [doi]

An acoustic analysis of vowel production across tasks in a case of non-fluent progressive aphasiaKaren Croot. [doi]

An interlingua based on domain actions for machine translation of task-oriented dialoguesLori S. Levin, Donna Gates, Alon Lavie, Alex Waibel. [doi]

Source controlled variable bit-rate speech coder based on waveform interpolationF. Plante, Barry M. G. Cheetham, D. Marston, P. A. Barrett. [doi]

A kinematic analysis of new zealand and australian English vowel spacesCatherine I. Watson, Jonathan Harrington, Sallyanne Palethorpe. [doi]

Prosody prediction for speech synthesis using transformational rule-based learningCameron S. Fordyce, Mari Ostendorf. [doi]

Now you hear it, now you don t: empirical studies of audio browsing behavior behaviorChristine H. Nakatani, Steve Whittaker, Julia Hirschberg. [doi]

Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraintsShin Suzuki, Takeshi Okadome, Masaaki Honda. [doi]

Evidence for early effects of sentence context on word segmentationSaskia te Riele, Hugo Quené. [doi]

Estimation of voice source and vocal tract parameters using combined subspace-based and amplitude spectrum-based algorithmChang-Sheng Yang, Hideki Kasuya. [doi]

User evaluation of the mask kioskLori Lamel, Samir Bennacef, Jean-Luc Gauvain, Hervé Dartigues, Jean-Noel Temem. [doi]

Real-time recognition of broadcast newsGary Cook, Tony Robinson, James Christie. [doi]

On the application of the AM-FM model for the recovery of missing frequency bands of telephone speechHesham Tolba, Douglas D. O Shaughnessy. [doi]

Perceived prominence and acoustic parameters in american EnglishThomas Portele. [doi]

Web-based educational tools for speech technologyKåre Sjölander, Jonas Beskow, Joakim Gustafson, Erland Lewin, Rolf Carlson, Björn Granström. [doi]

Forming generic models of speech for uniform database accessToomas Altosaar, Martti Vainio. [doi]

Wavelet-based energy binning cepstral features for automatic speech recognitionSankar Basu, Stéphane H. Maes. [doi]

Optopalatograph: real-time feedback of tongue movement in 3DAlan Wrench, Alan D. McIntosh, Colin Watson, William J. Hardcastle. [doi]

Stochastic language models for speech recognition and understandingGiuseppe Riccardi, Allen L. Gorin. [doi]

Improved duration modeling of English phonemes using a root sinusoidal transformationJerome R. Bellegarda, Kim E. A. Silverman. [doi]

Context sensitive generation of descriptionsEmiel Krahmer, Mariët Theune. [doi]

On the use of automatic speech recognition for TV captioningJordi Robert-Ribes. [doi]

An analysis of modal coupling effects during the glottal cycle: formant synthesizers from time-domain finite-difference simulationsGordon Ramsay. [doi]

Formant diphone parameter extraction utilising a labelled single-speaker databaseRobert H. Mannell. [doi]

A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using k-toBI systemYong-Ju Lee, Sook-Hyang Lee, Jong Jin Kim, Hyun-Ju Ko, Young-Il Kim, Sanghun Kim, Jung-Cheol Lee. [doi]

Speaker-independent speech recognition using micro segment spectrum integrationKiyoaki Aikawa. [doi]

Speech recognition in car noise environments using multiple models according to noise masking levelsMyung Gyu Song, Hoi In Jung, Kab-Jong Shim, Hyung Soon Kim. [doi]

GALAXY-II: a reference architecture for conversational system developmentStephanie Seneff, Edward Hurley, Raymond Lau, Christine Pao, Philipp Schmid, Victor Zue. [doi]

STAMP: a suite of tools for analyzing multimodal system processingJosh Clow, Sharon L. Oviatt. [doi]

Determination of the vocal tract spectrum from the articulatory movements based on the search of an articulatory-acoustic databaseTokihiko Kaburagi, Masaaki Honda. [doi]

Text-independent speaker verification using automatically labelled acoustic segmentsDijana Petrovska-Delacrétaz, Jan Cernocký, Jean Hennebert, Gérard Chollet. [doi]

A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accentRyo Mochizuki, Yasuhiko Arai, Takashi Honda. [doi]

The BBN single-phonetic-tree fast-match algorithmLong Nguyen, Richard M. Schwartz. [doi]

Automatic segmental and prosodic labeling of Mandarin speech databaseFu-Chiang Chou, Chiu-yu Tseng, Lin-Shan Lee. [doi]

Consistencies and inconsistencies between EPG and locus equation data on coarticulationMarija Tabain. [doi]

How a French TTS system can describe loanwordsFrédérique Sannier, Rabia Belrhali, Véronique Aubergé. [doi]

Effect of task complexity on search strategies for the motorola lexicus continuous speech recognition systemSreeram V. Balakrishnan. [doi]

Non-expert access to unification based speech understandingTom Brøndsted. [doi]

Estimation of models for non-native speech in computer-assisted language learning based on linear model combinationSilke M. Witt, Steve J. Young. [doi]

Smoothing and tying for Korean flexible vocabulary isolated word recognitionJae-Seung Choi, Jong-Seok Lee, Hee-Youn Lee. [doi]

Assessing high-level language in individuals with multiple sclerosis: a pilot studyKarin Brunnegaard, Katja Laakso, Lena Hartelius, Elisabeth Ahlsen. [doi]

Speech enhancement using critical band spectral subtractionLatchman Singh, Sridha Sridharan. [doi]

A detection framework for locating phonetic eventsPartha Niyogi, Partha Mitra, Man Mohan Sondhi. [doi]

Language model adaptation for spoken language systemsGiuseppe Riccardi, Alexandros Potamianos, Shrikanth Narayanan. [doi]

Initial speech recognition results using the multinet architectureEdnaldo Brigante Pizzolato, T. Jeff Reynolds. [doi]

The perception of stressed syllables in finnishJyrki Tuomainen, Jean Vroomen, Béatrice de Gelder. [doi]

Integration of talking heads and text-to-speech synthesizers for visual TTSJörn Ostermann, Mark C. Beutnagel, Ariel Fischer, Yao Wang. [doi]

Probabilistic modeling with Bayesian networks for automatic speech recognitionGeoffrey Zweig, Stuart J. Russell. [doi]

Improving the generalization performance of the MCE/GPD learningHiroshi Shimodaira, Jun Rokui, Mitsuru Nakai. [doi]

Grammatical word graph re-generation for spontaneous speech recognitionHajime Tsukada, Hirofumi Yamamoto, Toshiyuki Takezawa, Yoshinori Sagisaka. [doi]

MOOSE: management of otago speech environmentMark R. Laws, Richard Kilgour. [doi]

Acoustic confidence measures for segmenting broadcast newsJon Barker, Gethin Williams, Steve Renals. [doi]

Improving the speaker-dependency of subword-unit-based isolated word recognitionTakuya Koizumi, Shuji Taniguchi, Kazuhiro Kohtoh. [doi]

Modeling the microprosody of pitch and loudness for speech synthesis with neural networksMartti Vainio, Toomas Altosaar. [doi]

Modeling of output probability distribution to improve small vocabulary speech recognition in adverse environmentsDavid Thambiratnam, Sridha Sridharan. [doi]

A novel text-independent speaker verification method using the global speaker modelYiying Zhang, Xiaoyan Zhu. [doi]

An efficient two-pass search algorithm using word trellis indexAkinobu Lee, Tatsuya Kawahara, Shuji Doshita. [doi]

Phonetic and phonological characteristics of paralinguistic information in spoken JapaneseKikuo Maekawa. [doi]

Towards a reversible symbolic coding of intonationJean Véronis, Estelle Campione. [doi]

Error analysis and confidence measure of Chinese word segmentationChih-Chung Kuo, Kun-Yuan Ma. [doi]

Automatic recognition of Korean broadcast news speechHa-Jin Yu, Hoon Kim, Jae-Seung Choi, Joon-Mo Hong, Kew-Suh Park, Jong-Seok Lee, Hee-Youn Lee. [doi]

Enhanced ASR by acoustic feature filteringChristian Wellekens. [doi]

Lexical access for large-vocabulary speech recognitionRoger Ho-Yin Leung, Hong C. Leung. [doi]

Evaluation and implementation of a voice-activated dialing system with utterance verificationBeng Tiong Tan, Yong Gu, Trevor Thomas. [doi]

Context dependent anti subword modeling for utterance verificationPadma Ramesh, Chin-Hui Lee, Biing-Hwang Juang. [doi]

VPQ: a spoken language interface to large scale directory informationBruce Buntschuh, Candace A. Kamm, Giuseppe Di Fabbrizio, Alicia Abella, Mehryar Mohri, Shrikanth Narayanan, Ilija Zeljkovic, R. D. Sharp, Jeremy H. Wright, S. Marcus, J. Shaffer, R. Duncan, Jay G. Wilpon. [doi]

Spanish dialects: phonetic transcriptionAsunción Moreno, José B. Mariño. [doi]

A phonologically motivated method of selecting non-uniform unitsAndrew P. Breen, Peter Jackson. [doi]

Comparative evaluation of synthetic prosody with the PURR methodGerit P. Sonntag, Thomas Portele. [doi]

Noise model selection for robust speech recognitionLaura Docío Fernández, Carmen García-Mateo. [doi]

Automatic detection of prominence (as defined by listeners judgements) in read aloud dutch sentencesBarbertje M. Streefkerk, Louis C. W. Pols, Louis ten Bosch. [doi]

Voice onset time patterns in 7-, 9- and 11-year old childrenSandra P. Whiteside, Jeni Marshall. [doi]

Acoustic backing-off in the local distance computation for robust automatic speech recognitionJohan de Veth, Bert Cranen, Lou Boves. [doi]

Beyond structured dialogues: factoring out groundingPeter A. Heeman, Michael Johnston, Justin Denney, Edward C. Kaiser. [doi]

Transform coding of LSF parameters using waveletsDavor Petrinovic. [doi]

An educational dialogue system with a user controllable dialogue managerJoakim Gustafson, Patrik Elmberg, Rolf Carlson, Arne Jönsson. [doi]

An electropalatographic, kinematic, and acoustic analysis of supralaryngeal correlates of word-level prominence contrasts in EnglishJonathan Harrington, Mary E. Beckman, Janet Fletcher, Sallyanne Palethorpe. [doi]

Robust spoken dialogue systems for consumer products: a concrete applicationXavier Pouteau, Luis Arévalo. [doi]

Nonreciprocal data sharing in estimating HMM parametersXiaoqiang Luo, Frederick Jelinek. [doi]

Language independent and language adaptive large vocabulary speech recognitionTanja Schultz, Alex Waibel. [doi]

Improving pitch estimation with short duration speech samplesWilliam A. Ainsworth, Charles R. Day, Georg F. Meyer. [doi]

Phonetic invariance and phonological stability: lithuanian pitch accentsGrzegorz Dogil, Gregor Möhler. [doi]

Speech recognition using the probabilistic neural networkRaymond Low, Roberto Togneri. [doi]

Robust measurement of fundamental frequency and degree of voicingJohn N. Holmes. [doi]

Efficient computation of MMI neural networks for large vocabulary speech recognition systemsJörg Rottland, Andre Ludecke, Gerhard Rigoll. [doi]

Neural network motivation for segmental distributionEric Keller. [doi]

Japanese large-vocabulary continuous speech recognition system based on microsoft whisperHsiao-Wuen Hon, Yun-Cheng Ju, Keiko Otani. [doi]

A recursive algorithm for the forced alignment of very long audio segmentsPedro J. Moreno, Christopher F. Joerg, Jean-Manuel Van Thong, Oren Glickman. [doi]

Differential lengthening of syllabic constituents in French: the effect of accent type and speaking styleDaniel Hirst, Corine Astésano, Albert Di Cristo. [doi]

Generating emotional speech with a concatenative synthesizerErhard Rank, Hannes Pirker. [doi]

Source-extended language model for large vocabulary continuous speech recognitionTetsunori Kobayashi, Yosuke Wada, Norihiko Kobayashi. [doi]

Referential features and linguistic indirection in multimodal languageSharon L. Oviatt, Karen Kuhn. [doi]

Speech recognition based on the distance calculation between intermediate phonetic code sequences in symbolic domainKazuyo Tanaka, Hiroaki Kojima. [doi]

Hierarchical temporal decomposition: a novel approach to efficient compression of spectral characteristics of speechShahrokh Ghaemmaghami, Mohamed Deriche, Sridha Sridharan. [doi]

Analysis of disordered speech signal using wavelet transformCheol-Woo Jo, Dae-Hyun Kim. [doi]

Enhancing speech processing of Japanese learners of English utilizing time-scale expansion with constant pitchKaoru Tomita-Nakayama, Kazuo Nakayama, Masayuki Misaki. [doi]

Unsupervised training of HMMs with variable number of mixture components per stateCesar Martín del Alamo, Luis Villarrubia, Francisco Javier Gonzalez, Luis A. Hernández Gómez. [doi]

Are you my little pussy-cat? acoustic, phonetic and affective qualities of infant- and pet-directed speechDenis Burnham, Elizabeth Francis, Ute Vollmer-Conna, Christine Kitamura, Vicky Averkiou, Amanda Olley, Mary Nguyen, Cal Paterson. [doi]

Perceptual and acoustic properties of phonemes in continuous speech for different speaking rateHisao Kuwabara. [doi]

Effective structural adaptation of LVCSR systems to unseen domains using hierarchical connectionist acoustic modelsJürgen Fritsch, Michael Finke, Alex Waibel. [doi]

Spectral basis functions from discriminant analysisHynek Hermansky, Narendranath Malayath. [doi]

On robust speech analysis based on time-varying complex AR modelKeiichi Funaki, Yoshikazu Miyanaga, Koji Tochinai. [doi]

On the interaction between time and frequency filtering of speech parameters for robust speech recognitionDusan Macho, Climent Nadeu. [doi]

Temporal variables in lectures in the Japanese languageMichiko Watanabe. [doi]

Time shift invariant speech recognitionSankar Basu, Abraham Ittycheriah, Stéphane H. Maes. [doi]

Multi-Span statistical language modeling for large vocabulary speech recognitionJerome R. Bellegarda. [doi]

Effectiveness of phase-corrected rasta for continuous speech recognitionJohan de Veth, Lou Boves. [doi]

Estimation of mental lexicon size with word familiarity databaseShigeaki Amano, Tadahisa Kondo. [doi]

Comparison study on VQ codevector index assignmentJeng-Shyang Pan, Chin-Shiuh Shieh, Shu-Chuan Chu. [doi]

Comparison of language modelling techniques for Russian and EnglishEdward W. D. Whittaker, Philip C. Woodland. [doi]

A comparative study between polyclass and multiclass language modelsImed Zitouni, Kamel Smaïli, Jean-Paul Haton, Sabine Deligne, Frédéric Bimbot. [doi]

A high-performance text-independent speaker identification system based on BCDMQin Jin, Luo Si, Qixiu Hu. [doi]

Creating hidden Markov models for fast speechThilo Pfau, Günther Ruske. [doi]

Japanese forensic phonetics: non-contemporaneous within-speaker variation in natural and read-out speechYuko Kinoshita. [doi]

The use of meta-HMM in multistream HMM training for automatic speech recognitionChristian Wellekens, Jussi Kangasharju, Cedric Milesi. [doi]

Speaker verification with ensemble classifiers based on linear speech transformsJesper Østergaard Olsen. [doi]

Total quality evaluation of speech synthesis systemsJialu Zhang, Shiwei Dong, Ge Yu. [doi]

Conversational speech systems for on-board car navigation and assistancePetra Geutner, Matthias Denecke, Uwe Meier, Martin Westphal, Alex Waibel. [doi]

A model to represent propagation and radiation of higher-order modes for 3-d vocal-tract configurationKunitoshi Motoki, Hiroki Matsuzaki. [doi]

The differential status of semivowels in the acoustic phonetic realisation of tonePhil Rose. [doi]

Learning phrase-based head transduction models for translation of spoken utterancesHiyan Alshawi, Srinivas Bangalore, Shona Douglas. [doi]

Spoken language identification using the speechdat corpusDiamantino Caseiro, Isabel Trancoso. [doi]

Context dependent tree based transforms for phonetic speech recognitionBernard Doherty, Saeed Vaseghi, Paul M. McCourt. [doi]

A model for speech reverberation and intelligibility restoring filtersOwen P. Kenny, Douglas J. Nelson. [doi]

Eigenvoices for speaker adaptationRoland Kuhn, Patrick Nguyen, Jean-Claude Junqua, Lloyd Goldwasser, Nancy Niedzielski, Steven Fincke, Ken Field, Matteo Contolini. [doi]

Robust speech/non-speech detection in adverse conditions based on noise and speech statisticsLamia Karray, Jean Monné. [doi]

Speech perception in dyslexia: measurements from birth onwardsFlorien J. Koopmans-van Beinum, Caroline E. Schwippert, Cecile T. L. Kuijpers. [doi]

Relationship between lip shapes and acoustical characteristics during speechKeisuke Mori, Yorinobu Sonoda. [doi]

How far do speakers back up in repairs? a quantitatve modelElizabeth Shriberg, Andreas Stolcke. [doi]

Auditory modeling techniques for robust pitch extraction and noise reductionPiero Cosi, Stefano Pasquin, Enrico Zovato. [doi]

Within-speaker variability due to speaking mannersInger Karlsson, Tanja Bänziger, Jana Dankovicová, Tom Johnstone, Johan Lindberg, Håkan Melin, Francis Nolan, Klaus R. Scherer. [doi]

Towards a Mandarin voice memo systemHsin-Min Wang, Bor-shen Lin, Berlin Chen, Bo-Ren Bai. [doi]

External Links

Cite Key

Statistics

PDF

Tags

Researchr

The 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November - 4th December 1998

Abstract

Table of Contents