Abstract is missing.
- A statistical phonemic segment model for speech recognition based on automatic phonemic segmentationKatsura Aizawa, Chieko Furuichi. [doi]
- The use of linguistic hierarchies in speech understandingStephanie Seneff. [doi]
- Speaker detection in broadcast speech databasesAaron E. Rosenberg, Ivan Magrin-Chagnolleau, S. Parthasarathy, Qian Huang. [doi]
- A study of noise robustness for speaker independent speech recognition method using phoneme similarity vectorMasakatsu Hoshimi, Maki Yamada, Katsuyuki Niyada, Shozo Makino. [doi]
- Recognition performance of a large-scale dependency grammar language modelAdam L. Berger, Harry Printz. [doi]
- A novel robust speech recognition algorithm based on multi-models and integrated decision methodShengxi Pan, Jia Liu, Jintao Jiang, Zuoying Wang, Dajin Lu. [doi]
- Optimized POS-based language models for large vocabulary speech recognitionPetra Witschel. [doi]
- A contrastive study of lexical stress placement in singapore English and british EnglishEe Ling Low, Esther Grabe. [doi]
- Towards robust methods for spoken document retrievalKenney Ng. [doi]
- An analysis of modal coupling effects during the glottal cycle: formant synthesizers from time-domain finite-difference simulationsGordon Ramsay. [doi]
- A syllable-based generalization of Japanese accentuationHaruo Kubozono. [doi]
- Modelling tongue configuration in German vowel productionPhilip Hoole. [doi]
- A forensic phonetic investigation into non-contemporaneous variation in the f-pattern of similar-sounding speakersPhil Rose. [doi]
- Improved duration modeling of English phonemes using a root sinusoidal transformationJerome R. Bellegarda, Kim E. A. Silverman. [doi]
- An iterative, DP-based search algorithm for statistical machine translationIsmael García-Varea, Francisco Casacuberta, Hermann Ney. [doi]
- Heterogeneous measurements and multiple classifiers for speech recognitionAndrew K. Halberstadt, James R. Glass. [doi]
- The modeling and realization of natural speech generation systemFang Chen, Baozong Yuan. [doi]
- Prosody-based detection of the context of backchannel responsesHiroaki Noguchi, Yasuharu Den. [doi]
- SCAN - speech content based audio navigator: a system overviewJohn Choi, Donald Hindle, Julia Hirschberg, Ivan Magrin-Chagnolleau, Christine H. Nakatani, Fernando C. N. Pereira, Amit Singhal, Steve Whittaker. [doi]
- Analysis of disordered speech signal using wavelet transformCheol-Woo Jo, Dae-Hyun Kim. [doi]
- Towards a unified model for low bit-rate speech coding using a recognition-synthesis approachWendy J. Holmes. [doi]
- Speech separation based on the GMM PDF estimationXiao Yu, Guangrui Hu. [doi]
- An effect of adaptive beamforming on hands-free speech recognition based on 3-d viterbi searchTakeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano. [doi]
- A signal processing system for having the sound pop-out in noise thanks to the image of the speaker s lips: new advances using multi-layer perceptronsLaurent Girin, Laurent Varin, Gang Feng, Jean-Luc Schwartz. [doi]
- Common patterns in word level prosodyFrode Holm, Kazue Hata. [doi]
- Maximum a posteriori pitch trackingJames Droppo, Alex Acero. [doi]
- A proposed decision rule for speaker recognition based on fuzzy c-means clusteringDat Tran, Michael Wagner, Tu Van Le. [doi]
- Adaptive transformation for segmented parametric speech codingDamith J. Mudugamuwa, Alan B. Bradley. [doi]
- MOOSE: management of otago speech environmentMark R. Laws, Richard Kilgour. [doi]
- Using combined decisions and confidence measures for name recognition in automatic directory assistance systemsAndreas Kellner, Bernhard Rueber, Hauke Schramm. [doi]
- Multimodal language processingMichael Johnston. [doi]
- On frequency averaging for spectral analysis in speech recognitionCliment Nadeu, Felix Galindo, Jaume Padrell. [doi]
- System-user interaction and response strategy in spoken dialogue systemYohei Okato, Keiji Kato, Mikio Yamamoto, Shuichi Itahashi. [doi]
- Steps toward the integration of speaker recognition in real-world telecom applicationsAxel Glaeser, Frédéric Bimbot. [doi]
- Estimation of models for non-native speech in computer-assisted language learning based on linear model combinationSilke M. Witt, Steve J. Young. [doi]
- Hierarchical cluster language modeling with statistical rule extraction for rescoring n-best hypotheses during speech decodingPhotina Jaeyun Jang, Alexander G. Hauptmann. [doi]
- A comparative study between polyclass and multiclass language modelsImed Zitouni, Kamel Smaïli, Jean-Paul Haton, Sabine Deligne, Frédéric Bimbot. [doi]
- Effectiveness of phase-corrected rasta for continuous speech recognitionJohan de Veth, Lou Boves. [doi]
- Vocabulary-independent word confidence measure using subword featuresLi Jiang, Xuedong Huang. [doi]
- Speaker recognition based on discriminative projection modelsJesper Østergaard Olsen. [doi]
- Word clustering for a word bi-gram modelShinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh. [doi]
- Non-native productions of Japanese single stops that are too long for one mora unitYasuyo Minagawa-Kawai, Shigeru Kiritani. [doi]
- Noise model selection for robust speech recognitionLaura Docío Fernández, Carmen García-Mateo. [doi]
- A novel text-independent speaker verification method using the global speaker modelYiying Zhang, Xiaoyan Zhu. [doi]
- Language modeling for content extraction in human-computer dialoguesWolfgang Reichl, Bob Carpenter, Jennifer Chu-Carroll, Wu Chou. [doi]
- The effect of modifying formant amplitudes on the perception of French vowels generated by copy synthesisAnne Bonneau, Yves Laprie. [doi]
- A linguistic analysis of repair signals in co-operative spoken dialoguesShu-Chuan Tseng. [doi]
- Nonlinear interpolation of topic models for language model adaptationKristie Seymore, Stanley F. Chen, Ronald Rosenfeld. [doi]
- A perceptual evaluation of distance measures for concatenative speech synthesisJohan Wouters, Michael W. Macon. [doi]
- Progress in speaker recognition at dragon systemsAndrés Corrada-Emmanuel, Michael Newman, Barbara Peskin, Larry Gillick, Robert Roth. [doi]
- Modeling vowel duration for Japanese text-to-speech synthesisJennifer J. Venditti, Jan P. H. van Santen. [doi]
- Using automatically-derived acoustic sub-word units in large vocabulary speech recognitionMichiel Bacchiani, Mari Ostendorf. [doi]
- Soft state-tying for HMM-based speech recognitionChristoph Neukirchen, Daniel Willett, Gerhard Rigoll. [doi]
- Missing data reconstruction for robust automatic speech recognition in the framework of hybrid HMM/ANN systemsStéphane Dupont. [doi]
- Speech recognition performance on a new voicemail transcription taskMukund Padmanabhan, Bhuvana Ramabhadran, Sankar Basu. [doi]
- A comparison of two unsupervised approaches to accent identificationMike Lincoln, Stephen Cox, Simon Ringland. [doi]
- Duration modeling for HMM-based speech synthesisTakayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura. [doi]
- Improving accent identification through knowledge of English syllable structureKay Berkling, Marc A. Zissman, Julie Vonwiller, Christopher Cleirigh. [doi]
- Towards a Mandarin voice memo systemHsin-Min Wang, Bor-shen Lin, Berlin Chen, Bo-Ren Bai. [doi]
- Unsupervised training of phone duration and energy models for text-to-speech synthesisPaul C. Bagshaw. [doi]
- On the amount and domain of focal lengthening in SwedishEva Strangert, Mattias Heldner. [doi]
- A large vocabulary continuous speech recognition hybrid system for the portuguese languageJoão Paulo Neto, Ciro Martins, Luís B. Almeida. [doi]
- Integrated recognition of words and phrase boundariesFlorian Gallwitz, Anton Batliner, Jan Buckow, Richard Huber, Heinrich Niemann, Elmar Nöth. [doi]
- Acoustic-articulatory evaluation of the upper vowel-formant region and its presumed speaker-specific potencyFrantz Clermont, Parham Mokhtari. [doi]
- Perception of concurrent approximant-vowel syllablesWilliam A. Ainsworth. [doi]
- An F0 contour control model for totally speaker driven text to speech systemTakehiko Kagoshima, Masahiro Morita, Shigenobu Seto, Masami Akamine. [doi]
- Separation of singing and piano soundsYoram Meron, Keikichi Hirose. [doi]
- Modular neural networks for low-complex phoneme recognitionAxel Glaeser. [doi]
- Improvements in speech understanding accuracy through the integration of hierarchical linguistic, prosodic, and phonological constraints in the jupiter domainGrace Chung, Stephanie Seneff. [doi]
- Phonetic-distance-based hypothesis driven lexical adaptation for transcribing multlingual broadcast newsPetra Geutner, Michael Finke, Alex Waibel. [doi]
- Speaker verification with ensemble classifiers based on linear speech transformsJesper Østergaard Olsen. [doi]
- Predictive adaptation and compensation for robust speech recognitionArun C. Surendran, Chin-Hui Lee. [doi]
- Phonetic modification of the syllable /tu/ in two spontaneous american English dialoguesNanette Veilleux, Stefanie Shattuck-Hufnagel. [doi]
- Maximum-likelihood updates of HMM duration parameters for discriminative continuous speech recognitionRathinavelu Chengalvarayan. [doi]
- Multi-Span statistical language modeling for large vocabulary speech recognitionJerome R. Bellegarda. [doi]
- A new strategy of fuzzy-neural network for Thai numeral speech recognitionChai Wutiwiwatchai, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin. [doi]
- Syllable-onset acoustic properties associated with syllable-coda voicingNoël Nguyen, Sarah Hawkins. [doi]
- Spectral basis functions from discriminant analysisHynek Hermansky, Narendranath Malayath. [doi]
- Log-linear interpolation of language modelsDietrich Klakow. [doi]
- HMM topology selection for accurate acoustic and duration modelingCristina Chesta, Pietro Laface, Franco Ravera. [doi]
- Sub-band based speaker verification using dynamic recombination weightsPerasiriyan Sivakumaran, Aladdin M. Ariyaeeinia, Jill A. Hewitt. [doi]
- A very low bit rate speech coder using HMM with speaker adaptationTakashi Masuko, Keiichi Tokuda, Takao Kobayashi. [doi]
- The use of meta-HMM in multistream HMM training for automatic speech recognitionChristian Wellekens, Jussi Kangasharju, Cedric Milesi. [doi]
- Intonative structure as a determinant of word order variation in dutch verbal endgroupsMarc Swerts. [doi]
- VPQ: a spoken language interface to large scale directory informationBruce Buntschuh, Candace A. Kamm, Giuseppe Di Fabbrizio, Alicia Abella, Mehryar Mohri, Shrikanth Narayanan, Ilija Zeljkovic, R. D. Sharp, Jeremy H. Wright, S. Marcus, J. Shaffer, R. Duncan, Jay G. Wilpon. [doi]
- Emotional speech synthesis: from speech database to TTSJuan Manuel Montero, Juana M. Gutiérrez-Arriola, Sira E. Palazuelos, Emilia Enríquez, Santiago Aguilera, José Manuel Pardo. [doi]
- Reducing the OOV rate in broadcast news speech recognitionThomas Kemp, Alex Waibel. [doi]
- Duration modeling using cumulative duration probability and speaking rate compensationTae Young Yang, Ji-Sung Kim, Chungyong Lee, Dae Hee Youn, Il Whan Cha. [doi]
- Combining articulatory and acoustic information for speech recognition in noisy and reverberant environmentsKatrin Kirchhoff. [doi]
- Confidence scoring for speech understanding systemsChristine Pao, Philipp Schmid, James R. Glass. [doi]
- Language model adaptation for spoken language systemsGiuseppe Riccardi, Alexandros Potamianos, Shrikanth Narayanan. [doi]
- Using an animated talking character in a web-based city guide demonstratorGeorg Fries, Stefan Feldes, Alfred Corbet. [doi]
- Tones of a tridialectal: acoustic and perceptual data on ten linguistic tonetic contrasts between lao, nyo and standard ThaiPhil Rose. [doi]
- A detection framework for locating phonetic eventsPartha Niyogi, Partha Mitra, Man Mohan Sondhi. [doi]
- A recursive algorithm for the forced alignment of very long audio segmentsPedro J. Moreno, Christopher F. Joerg, Jean-Manuel Van Thong, Oren Glickman. [doi]
- Improving speech recognizer by broader acoustic-phonetic group classificationYoungjoo Suh, Kyuwoong Hwang, Oh-Wook Kwon, Jun Park. [doi]
- Robust spoken dialogue systems for consumer products: a concrete applicationXavier Pouteau, Luis Arévalo. [doi]
- Interfaces for speech recognition systems: the impact of vocabulary constraints and syntax on performanceKate S. Hone, David Golightly. [doi]
- Optimized stopping criteria for tree-based unit selection in concatenative synthesisAndrew Cronk, Michael W. Macon. [doi]
- Modeling dynamic prosodic variation for speaker verificationM. Kemal Sönmez, Elizabeth Shriberg, Larry P. Heck, Mitchel Weintraub. [doi]
- Don t blame it (all) on the pause: further ERP evidence for a prosody-induced garden-path in running speechKarsten Steinhauer, Kai Alter, Angela D. Friederici. [doi]
- Semi-automated incremental prototyping of spoken dialog systemsStefan Kaspar, Achim G. Hoffmann. [doi]
- Automatic classification of dialogue contexts for dialogue predictionsCosmin Popovici, Paolo Baggia, Pietro Laface, Loreta Moisa. [doi]
- Indexing and classification of TV news articles based on speech dictation using word bigramJun Ogata, Yasuo Ariki. [doi]
- High resolution decision tree based acoustic modeling beyond CARTWu Chou, Wolfgang Reichl. [doi]
- The relation between perceptual and production categories in acquisitionIan Watson. [doi]
- Automatic detection of prominence (as defined by listeners judgements) in read aloud dutch sentencesBarbertje M. Streefkerk, Louis C. W. Pols, Louis ten Bosch. [doi]
- Prosodic parameters in emotional speechKazuhito Koike, Hirotaka Suzuki, Hiroaki Saito. [doi]
- High accuracy Chinese speech recognition approach with Chinese input technology for telecommunication useYork Chung-Ho Yang, June-Jei Kuo. [doi]
- Korean prosodic break index labelling by a new mixed method of LDA and VQPyungsu Kang, Jiyoung Kang, Jinyoung Kim. [doi]
- A hierarchy probability-based visual features extraction method for speechreadingYanjun Xu, Limin Du, Guoqiang Li, Ziqiang Hou. [doi]
- Voice conversion based on parameter transformationJuana M. Gutiérrez-Arriola, Yung-Sheng Hsiao, Juan Manuel Montero, José Manuel Pardo, Donald G. Childers. [doi]
- Robust automatic continuous-speech recognition based on a voiced-unvoiced decisionHesham Tolba, Douglas D. O Shaughnessy. [doi]
- An efficient labeling tool for the Quicksig speech databaseMatti Karjalainen, Toomas Altosaar, Miikka Huttunen. [doi]
- Performance evaluation of word phrase and noun category language models for broadcast news speech recognitionKazuyuki Takagi, Rei Oguro, Kenji Hashimoto, Kazuhiko Ozeki. [doi]
- A robust tone recognition method of Chinese based on sub-syllabic F0 contoursJin-Song Zhang, Keikichi Hirose. [doi]
- Automatic prosodic labeling of 6 languagesHalewijn Vereecken, Jean-Pierre Martens, Cynthia Grover, Justin Fackrell, Bert Van Coile. [doi]
- Recent work on a preselection module for a flexible large vocabulary speech recognition system in telephone environmentJavier Ferreiros, Javier Macías Guarasa, Ascensión Gallardo-Antolín, José Colás, Ricardo de Córdoba, José Manuel Pardo, Luis Villarrubia Grande. [doi]
- Development of CAI system employing synthesized speech responsesTsubasa Shinozaki, Masanobu Abe. [doi]
- Optopalatograph: real-time feedback of tongue movement in 3DAlan Wrench, Alan D. McIntosh, Colin Watson, William J. Hardcastle. [doi]
- Integration of talking heads and text-to-speech synthesizers for visual TTSJörn Ostermann, Mark C. Beutnagel, Ariel Fischer, Yao Wang. [doi]
- Information theoretic approaches to model selectionJonathan Hamaker, Aravind Ganapathiraju, Joseph Picone. [doi]
- Vector quantizer acceleration for an automatic speech recognition applicationAntonio J. Araujo, Vitor C. Pera, Márcio N. de Souza. [doi]
- Blind clustering of speech utterances based on speaker and language characteristicsDouglas A. Reynolds, Elliot Singer, Beth A. Carlson, Gerald C. O Leary, Jack McLaughlin, Marc A. Zissman. [doi]
- Evaluation of dialog strategies for a tourist information retrieval systemLaurence Devillers, Hélène Bonneau-Maynard. [doi]
- Automatic generation of Korean pronunciation variants by multistage applications of phonological rulesJe Hun Jeon, Sunhwa Cha, Minhwa Chung, Jun Park, Kyuwoong Hwang. [doi]
- Recurrent substrings and data fusion for language recognitionHarvey Lloyd-Thomas, Eluned S. Parris, Jeremy H. Wright. [doi]
- Categorical perception of vowelsEllen Gerrits, Bert Schouten. [doi]
- IVie - a comparative transcription system for intonational variation in EnglishEsther Grabe, Francis Nolan, Kimberley J. Farrar. [doi]
- Efficient quantization of LSF parameters based on temporal decompositionSung-Joo Kim, Sangho Lee, Woo-Jin Han, Yung-Hwan Oh. [doi]
- Recognizing emotions in speech using short-term and long-term featuresYang Li, Yunxin Zhao. [doi]
- Improving the speaker-dependency of subword-unit-based isolated word recognitionTakuya Koizumi, Shuji Taniguchi, Kazuhiro Kohtoh. [doi]
- Simulated emotions: an acoustic study of voice and perturbation measuresSandra P. Whiteside. [doi]
- Favourable and unfavourable short duration segments of speech in noiseDaniel Woo. [doi]
- Syntax coordination: interaction of discourse and extrapositionsSusanne Kronenberg, Franz Kummert. [doi]
- Candidate selection based on significance testing and its use in normalisation and scoringJi-Hwan Kim, Gil-Jin Jang, Seong-Jin Yun, Yung-Hwan Oh. [doi]
- ko tok ples ensin bilong tok pisin or the TP-CLE: a first report from a pilot speech-to-speech translation project from Swedish to tok pisinRobert Eklund. [doi]
- Speaker independent speech recognition method using constrained time alignment near phoneme discriminative frameTomohiro Konuma, Tetsu Suzuki, Maki Yamada, Yoshio Ono, Masakatsu Hoshimi, Katsuyuki Niyada. [doi]
- A comparative evaluation of variance flooring techniques in HMM-based speaker verificationHåkan Melin, Johan Koolwaaij, Johan Lindberg, Frédéric Bimbot. [doi]
- Waveform interpolation coding with pitch-spaced subbandsW. Bastiaan Kleijn, Huimin Yang, Ed F. Deprettere. [doi]
- Factor analysis invariant to linear transformations of dataRamesh A. Gopinath, Bhuvana Ramabhadran, Satya Dharanipragada. [doi]
- An asymmetric stochastic language model based on multi-tagged wordsJulio Pastor, José Colás, Rubén San Segundo, José Manuel Pardo. [doi]
- Segmental and tonal processing in CantoneseHsuan-Chih Chen, Michael C. W. Yip, Sum-Yin Wong. [doi]
- Confidence measures derived from an acceptor HMMGethin Williams, Steve Renals. [doi]
- Analysis of occurrence of pauses and their durations in Japanese text readingHiroya Fujisaki, Sumio Ohno, Seiji Yamada. [doi]
- An event driven model for dialogue systemsKuansan Wang. [doi]
- Fast computation of maximum entropy / minimum divergence feature gainHarry Printz. [doi]
- Plasticity of non-native phonetic perception and production: a training studySatoshi Imaizumi, Hidemi Itoh, Yuji Tamekawa, Toshisada Deguchi, Koichi Mori. [doi]
- Use of non-verbal information in communication between human and robotMasao Yokoyama, Kazumi Aoyama, Hideaki Kikuchi, Katsuhiko Shirai. [doi]
- The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpusKatunobu Itou, Mikio Yamamoto, Kazuya Takeda, Toshiyuki Takezawa, Tatsuo Matsuoka, Tetsunori Kobayashi, Kiyohiro Shikano, Shuichi Itahashi. [doi]
- Bayesian constrained frequency warping HMMS for speaker normalisationChing-Hsiang Ho, Saeed Vaseghi, Aimin Chen. [doi]
- Collection and detailed transcription of a speech database for development of language learning technologiesHarry Bratt, Leonardo Neumeyer, Elizabeth Shriberg, Horacio Franco. [doi]
- Efficiency as an organizing principle of natural speechR. J. J. H. van Son, Florien J. Koopmans-van Beinum, Louis C. W. Pols. [doi]
- Nozomi - a fast, memory-efficient stack decoder for LVCSRMike Schuster. [doi]
- A novel technique for the combination of utterance and speaker verification systems in a text-dependent speaker verification taskLeandro Rodríguez Liñares, Carmen García-Mateo. [doi]
- Representing prosodic words using statistical models of moraic transition of fundamental frequency contours of JapaneseKoji Iwano, Keikichi Hirose. [doi]
- Capturing discriminative information using multiple modeling techniquesJi Ming, Philip Hanna, Darryl Stewart, Saeed Vaseghi, F. Jack Smith. [doi]
- Tonal complexity as a dialectal feature: 25 different citation tones from four zhejiang wu dialectsSean Zhu, Phil Rose. [doi]
- A method for modeling liaison in a speech recognition system for FrenchLalit R. Bahl, S. V. De Gennaro, Pieter de Souza, E. Epstein, J. M. Le Roux, B. Lewis, Claire Waast. [doi]
- Dialect maps and dialect research; useful tools for automatic speech recognition?Arne Kjell Foldvik, Knut Kvale. [doi]
- Discriminative weighting of multi-resolution sub-band cepstral features for speech recognitionPhilip McMahon, Paul M. McCourt, Saeed Vaseghi. [doi]
- Powerful syllabic fillers for general-task keyword-spotting and unlimited-vocabulary continuous-speech recognitionRachida El Méliani, Douglas D. O Shaughnessy. [doi]
- Audio-visual segmentation for content-based retrievalDavid Pye, Nicholas J. Hollinghurst, Timothy J. Mills, Kenneth R. Wood. [doi]
- Phonetic-level mispronunciation detection in non-native Swedish speechPhilippe Langlais, Anne-Marie Öster, Björn Granström. [doi]
- Telephone speech multi-keyword spotting using fuzzy search algorithm and prosodic verificationChung-Hsien Wu, Yeou-Jiunn Chen, Yu-Chun Hung. [doi]
- Voice onset time patterns in 7-, 9- and 11-year old childrenSandra P. Whiteside, Jeni Marshall. [doi]
- Modular connectionist systems for identifying complex arabic phonetic featuresSid-Ahmed Selouani, Jean Caelen. [doi]
- Task adaptation of sub-lexical unit models using the minimum confusibility criterion on task independent databasesAlbino Nogueiras Rodriguez, José B. Mariño. [doi]
- Context dependent tree based transforms for phonetic speech recognitionBernard Doherty, Saeed Vaseghi, Paul M. McCourt. [doi]
- Spoken language identification using the speechdat corpusDiamantino Caseiro, Isabel Trancoso. [doi]
- Neural network based pronunciation modeling with applications to speech recognitionToshiaki Fukada, Takayoshi Yoshimura, Yoshinori Sagisaka. [doi]
- Perception of words with vowel reductionJohan Frid. [doi]
- Is speech the right thing for your application?Niels Ole Bernsen, Laila Dybkjær. [doi]
- SABLE: a standard for TTS markupRichard Sproat, Andrew Hunt, Mari Ostendorf, Paul Taylor, Alan W. Black, Kevin A. Lenzo, Mike Eddington. [doi]
- A language model combining trigrams and stochastic context-free grammarsJohn Gillett, Wayne Ward. [doi]
- Efficient high-order hidden Markov modellingJohan A. du Preez, D. M. Weber. [doi]
- Using linguistic knowledge to improve the design of low-bit rate LSF quantisationJohn J. Parry, Ian S. Burnett, Joe F. Chicharo. [doi]
- A name announcement algorithm with memory size and computational power constraintsZe ev Roth, Judith Rosenhouse. [doi]
- Speaker recruitment methods and speaker coverage - experiences from a large multilingual speech database collectionBørge Lindberg, Robrecht Comeyne, Christoph Draxler, Francesco Senia. [doi]
- Support vector machines for speech recognitionAravind Ganapathiraju, Jonathan Hamaker, Joseph Picone. [doi]
- Expanding a time-sensitive conversational architecture for turn-taking to handle content-driven interruptionGregory Aist. [doi]
- Improved utterance rejection using length dependent thresholdsSunil K. Gupta, Frank K. Soong. [doi]
- Linear discriminant - a new criterion for speaker normalizationMartin Westphal, Tanja Schultz, Alex Waibel. [doi]
- Efficient lexical retrieval for English text-to-speech synthesisDaniel Faulkner, Charles Bryant. [doi]
- Improvements in slovene text-to-speech synthesisTomaz Sef, Ales Dobnikar, Matjaz Gams. [doi]
- Comparative experiments to evaluate a voiced-unvoiced-based pre-processing approach to robust automatic speech recognition in low-SNR environmentsHesham Tolba, Douglas D. O Shaughnessy. [doi]
- Cooperation and competition of burst and formant transitions for the perception and identification of French stopsAdrian Neagu, Gérard Bailly. [doi]
- Low bit rate coding for speech and audio using mel linear predictive coding (MLPC) analysisYoshihisa Nakatoh, Takeshi Norimatsu, Ah Heng Low, Hiroshi Matsumoto. [doi]
- Total quality evaluation of speech synthesis systemsJialu Zhang, Shiwei Dong, Ge Yu. [doi]
- Coarticulation and degrees of freedom in the elaboration of a new articulatory plant: GENTIANEAnne Vilain, Christian Abry, Pierre Badin. [doi]
- A new fast algorithm for automatic segmentation of continuous speechIman Gholampour, Kambiz Nayebi. [doi]
- Cochlear implants in the second and third millenniaGraeme M. Clark. [doi]
- Grammatical word graph re-generation for spontaneous speech recognitionHajime Tsukada, Hirofumi Yamamoto, Toshiyuki Takezawa, Yoshinori Sagisaka. [doi]
- Interactive listening to structured speech content on the internetMakoto J. Hirayama, Taro Sugahara, Zhiyong Peng, Junichi Yamazaki. [doi]
- Spoken language understanding within dialogs using a graphical model of task structureJeremy H. Wright, Allen L. Gorin, Alicia Abella. [doi]
- Synthetic faces as a lipreading supportEva Agelfors, Jonas Beskow, Martin Dahlquist, Björn Granström, Magnus Lundeberg, Karl-Erik Spens, Tobias Öhman. [doi]
- An evaluation of keyword spotting performance utilizing false alarm rejection based on prosodic informationMasaki Ida, Ryuji Yamasaki. [doi]
- Spectral smoothing for concatenative speech synthesisDavid T. Chappell, John H. L. Hansen. [doi]
- Rapid-deployment text-to-speech in the DIPLOMAT systemKevin A. Lenzo, Christopher Hogan, Jeffrey Allen. [doi]
- Information extraction and text generation of news reports for a Swedish-English bilingual spoken dialogue systemBarbara Gawronska, David House. [doi]
- Describing intonation with a parametric modelGregor Möhler. [doi]
- Dynamic features in children s vowelsSteve Cassidy, Catherine Watson. [doi]
- Energy contour generation for a sentence using a neural network learning methodJungchul Lee, Donggyu Kang, Sanghoon Kim, Koengmo Sung. [doi]
- Improved parallel model combination based on better domain transformation for speech recognition under noisy environmentsJeih-Weih Hung, Jia-lin Shen, Lin-Shan Lee. [doi]
- Real time speaker indexing based on subspace method - application to TV news articles and debateMasafumi Nishida, Yasuo Ariki. [doi]
- The role of phonological, morphological, and orthographic knowledge in the intuitive syllabification of dutch words: a longitudinal approachDominiek Sandra, Steven Gillis. [doi]
- Suprasegmental cues for the segmentation of identical vowel sequences in JapaneseKazuhiko Kakehi, Yuki Hirose. [doi]
- Correspondence between the glottal gesture overlap pattern and vowel devoicing in JapaneseMasako Fujimoto, Emi Murano, Seiji Niimi, Shigeru Kiritani. [doi]
- A linguistic and prosodic database for data-driven Japanese TTS synthesisAtsuhiro Sakurai, Takashi Natsume, Keikichi Hirose. [doi]
- De-accentuation: linguistic environments and prosodic realizationsKai Alter, Karsten Steinhauer, Angela D. Friederici. [doi]
- Stochastic language models for speech recognition and understandingGiuseppe Riccardi, Allen L. Gorin. [doi]
- Prosodic analysis of fillers and self-repair in Japanese speechFelix C. M. Quimbo, Tatsuya Kawahara, Shuji Doshita. [doi]
- Estimation of mental lexicon size with word familiarity databaseShigeaki Amano, Tadahisa Kondo. [doi]
- Wavelet transform domain blind equalization and its application to speech analysisMunehiro Namba, Yoshihisa Ishida. [doi]
- Influence of facial views on the mcgurk effect in auditory noiseRika Kanzaki, Takashi Kato. [doi]
- The selection of pronunciation variants: comparing the performance of man and machineJudith M. Kessens, Mirjam Wester, Catia Cucchiarini, Helmer Strik. [doi]
- Segmentation and classification of broadcast news audioThomas Hain, Philip C. Woodland. [doi]
- Extended linear discriminant analysis (ELDA) for speech recognitionGünther Ruske, Robert Faltlhauser, Thilo Pfau. [doi]
- Beyond structured dialogues: factoring out groundingPeter A. Heeman, Michael Johnston, Justin Denney, Edward C. Kaiser. [doi]
- Telephone band LVCSR for hearing-impaired usersEa-Ee Jan, Raimo Bakis, Fu-Hua Liu, Michael Picheny. [doi]
- Multilateral techniques for speaker recognitionEluned S. Parris, Michael J. Carey 0002. [doi]
- A study of tones and tempo in continuous Mandarin digit strings and their application in telephone quality speech recognitionChao Wang, Stephanie Seneff. [doi]
- Comparison of spectral estimation techniques for low bit-rate speech codingDerek J. Molyneux, C. I. Parris, X. Q. Sun, Barry M. G. Cheetham. [doi]
- Word sequence pair spotting for synchronization of speech and text in production of closed-caption TV programs for the hearing impairedIchiro Maruyama, Yoshiharu Abe, Takahiro Wakao, Eiji Sawamura, Terumasa Ehara, Katsuhiko Shirai. [doi]
- Speaking-style dependent lexicalized filler model for key-phrase detection and verificationTatsuya Kawahara, Kentaro Ishizuka, Shuji Doshita, Chin-Hui Lee. [doi]
- Analysis and treatment of esophageal speech for the enhancement of its comprehensionJorge Miquélez, Rocio Sesma, Yolanda Blanco. [doi]
- Using x-gram for efficient speech recognitionAntonio Bonafonte, José B. Mariño. [doi]
- Speaker normalization with all-pass transformsJohn W. McDonough, William Byrne, Xiaoqiang Luo. [doi]
- A voice verifier for face/voice based person verification systemRongyu Qiao, Youngkyu Choi, Johnson I. Agbinya. [doi]
- Acoustic backing-off in the local distance computation for robust automatic speech recognitionJohan de Veth, Bert Cranen, Lou Boves. [doi]
- Robust entropy-based endpoint detection for speech recognition in noisy environmentsJia-lin Shen, Jeih-Weih Hung, Lin-Shan Lee. [doi]
- A Japanese-to-English speech translation system: ATR-MATRIXToshiyuki Takezawa, Tsuyoshi Morimoto, Yoshinori Sagisaka, Nick Campbell, Hitoshi Iida, Fumiaki Sugaya, Akio Yokoo, Seiichi Yamamoto. [doi]
- Modeling of output probability distribution to improve small vocabulary speech recognition in adverse environmentsDavid Thambiratnam, Sridha Sridharan. [doi]
- Automatic detection of semantic boundaries based on acoustic and lexical knowledgeMauro Cettolo, Daniele Falavigna. [doi]
- On the relationship of speech rates with prosodic units in dialogue speechKeikichi Hirose, Hiromichi Kawanami. [doi]
- Sharable software repository for Japanese large vocabulary continuous speech recognitionTatsuya Kawahara, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano. [doi]
- Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraintsShin Suzuki, Takeshi Okadome, Masaaki Honda. [doi]
- Estimation of voice source and vocal tract parameters using combined subspace-based and amplitude spectrum-based algorithmChang-Sheng Yang, Hideki Kasuya. [doi]
- A time-synchronous, tree-based search strategy in the acoustic fast match of an asynchronous speech recognition systemEllen Eide, Lalit R. Bahl. [doi]
- Partitioning and transcription of broadcast news dataJean-Luc Gauvain, Lori Lamel, Gilles Adda. [doi]
- Implementation of coordinative nodding behavior on spoken dialogue systemsJun-ichi Hirasawa, Noboru Miyazaki, Mikio Nakano, Takeshi Kawabata. [doi]
- Context dependent anti subword modeling for utterance verificationPadma Ramesh, Chin-Hui Lee, Biing-Hwang Juang. [doi]
- Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependencyKengo Hanai, Kazumasa Yamamoto, Nobuaki Minematsu, Seiichi Nakagawa. [doi]
- An improved decomposition method for WI using IIR wavelet filter banksNicola R. Chong, Ian S. Burnett, Joe F. Chicharo. [doi]
- Detecting topic shifts using a cache memoryBrigitte Bigi, Renato de Mori, Marc El-Bèze, Thierry Spriet. [doi]
- Joint recognition and segmentation using phonetically derived features and a hybrid phoneme modelNaomi Harte, Saeed Vaseghi, Ben P. Milner. [doi]
- An efficient two-pass search algorithm using word trellis indexAkinobu Lee, Tatsuya Kawahara, Shuji Doshita. [doi]
- Online adaptation of language models in spoken dialogue systemsBernd Souvignier, Andreas Kellner. [doi]
- A new look at HMM parameter tying for large vocabulary speech recognitionAnanth Sankar. [doi]
- An analysis of dialogues with our dialogue system through a WWW pageTadahiko Kumamoto, Akira Ito. [doi]
- The use of broad phonetic class models in speaker recognitionJohan Koolwaaij, Johan de Veth. [doi]
- Towards a Chinese text-to-speech system with higher naturalnessRen-Hua Wang, Qingfeng Liu, Yongsheng Teng, Deyu Xia. [doi]
- Improving pitch estimation with short duration speech samplesWilliam A. Ainsworth, Charles R. Day, Georg F. Meyer. [doi]
- Reducing peak search effort using two-tier pruningMark Wright, Simon Hovell, Simon Ringland. [doi]
- An educational dialogue system with a user controllable dialogue managerJoakim Gustafson, Patrik Elmberg, Rolf Carlson, Arne Jönsson. [doi]
- How disagreement expressions are used in cooperative tasksHiroyuki Yano, Akira Ito. [doi]
- A HMM-based recognition system for perceptive relevant pitch movements of spontaneous German speechChristel Brindöpke, Gernot A. Fink, Franz Kummert, Gerhard Sagerer. [doi]
- Robust and compact multilingual word recognizers using features extracted from a phoneme similarity front-endPhilippe Morin, Ted H. Applebaum, Robert Boman, Yi Zhao, Jean-Claude Junqua. [doi]
- Analyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditionsPaavo Alku, Juha Vintturi, Erkki Vilkman. [doi]
- Organizing self-motivated dialogue with autonomous creaturesNoriko Suzuki, Kazuo Ishii, Michio Okada. [doi]
- Representation of voice quality features associated with talker individualityHiroshi Kido, Hideki Kasuya. [doi]
- Assimilation of place in Japanese and dutchAnne Cutler, Takashi Otake. [doi]
- Segmentation using a maximum entropy approachKishore Papineni, Satya Dharanipragada. [doi]
- Robust features for speech recognition systemsAruna Bayya, B. Yegnanarayana. [doi]
- Automatic detection of sentence boundaries and disfluencies based on recognized wordsAndreas Stolcke, Elizabeth Shriberg, Rebecca A. Bates, Mari Ostendorf, Dilek Hakkani, Madelaine Plauché, Gökhan Tür, Yu Lu. [doi]
- Resegmentation of SWITCHBOARDNeeraj Deshmukh, Aravind Ganapathiraju, Andi Gleeson, Jonathan Hamaker, Joseph Picone. [doi]
- On the effects of speech rate upon parameters of the command-response model for the fundamental frequency contours of speechSumio Ohno, Hiroya Fujisaki, Yoshikazu Hara. [doi]
- Experiments on the meaning of two pitch accent types: the pointed hat versus the accent-lending fall in dutchJohanneke Caspers. [doi]
- Text analysis for the bell labs French text-to-speech systemEvelyne Tzoukermann. [doi]
- Acoustic cues for the auditory identification of the Spanish fricative /f/Santiago Fernández, Sergio Feijóo, Ramón Balsa, Nieves Barros. [doi]
- A study on the natural-sounding Japanese phonetic word synthesis by using the VCV-balanced word database that consists of the words uttered forcibly in two types of pitch accentRyo Mochizuki, Yasuhiko Arai, Takashi Honda. [doi]
- Comparative evaluation of synthetic prosody with the PURR methodGerit P. Sonntag, Thomas Portele. [doi]
- Same talker, different languageVerna Stockmal, Danny R. Moates, Zinny S. Bond. [doi]
- Influence of the speaking style and the noise spectral tilt on the lombard reflex and automatic speech recognitionJean-Claude Junqua, Steven Fincke, Ken Field. [doi]
- The efficiency of multimodal interaction: a case studyPhilip R. Cohen, Michael Johnston, David McGee, Sharon L. Oviatt, Josh Clow, Ira A. Smith. [doi]
- Unsupervised training of a speech recognizer using TV broadcastsThomas Kemp, Alex Waibel. [doi]
- A new confidence measure based on rank-ordering subphone scoresQiguang Lin, Subrata Das, David Lubensky, Michael Picheny. [doi]
- A four layer sharing HMM system for very large vocabulary isolated word recognitionRuxin Chen, Miyuki Tanaka, Duanpei Wu, Lex Olorenshaw, Mariscela Amador. [doi]
- A novel iterative signal enhancement algorithm for noise reduction in speechSimon Doclo, Ioannis Dologlou, Marc Moonen. [doi]
- Measuring the dynamic encoding of speaker identity and dialect in prosodic parametersMichael Barlow, Michael Wagner. [doi]
- Performance improvements through combining phone- and syllable-scale information in automatic speech recognitionSu-Lin Wu, Brian Kingsbury, Nelson Morgan, Steven Greenberg. [doi]
- On variable sampling frequencies in speech recognitionFu-Hua Liu, Michael Picheny. [doi]
- An adaptive beamforming microphone array system using a blind deconvolutionJin-Nam Park, Tsuyoshi Usagawa, Masanao Ebata. [doi]
- Comparison study on VQ codevector index assignmentJeng-Shyang Pan, Chin-Shiuh Shieh, Shu-Chuan Chu. [doi]
- Effective structural adaptation of LVCSR systems to unseen domains using hierarchical connectionist acoustic modelsJürgen Fritsch, Michael Finke, Alex Waibel. [doi]
- A robust dialogue model for spoken dialogue processingMasahiro Araki, Shuji Doshita. [doi]
- Speech recognition based on the distance calculation between intermediate phonetic code sequences in symbolic domainKazuyo Tanaka, Hiroaki Kojima. [doi]
- Controlling a HIFI with a continuous speech understanding systemJavier Ferreiros, José Colás, Javier Macías Guarasa, Alejandro Ruiz, José Manuel Pardo. [doi]
- The interactive systems labs view4you video indexing systemThomas Kemp, Petra Geutner, Michael Schmidt, Borislav Tomaz, Manfred Weber, Martin Westphal, Alex Waibel. [doi]
- Two-pass utterance verification algorithm for long natural numbers recognitionJavier Caminero, Eduardo López, Luis A. Hernández Gómez. [doi]
- SIVHA, visual speech synthesis systemYolanda Blanco, Maria Cuellar, Arantxa Villanueva, Fernando Lacunza, Rafael Cabeza, Beatriz Marcotegui. [doi]
- Articulatory, acoustic and perceptual aspects of fricative-stop coarticulationNoël Nguyen, Alan Wrench, Fiona Gibbon, William J. Hardcastle. [doi]
- Audio and audio-visual perception of consonants disturbed by white noise and cocktail party László Czap. [doi]
- Inference of missing spectrographic features for robust speech recognitionBhiksha Raj, Rita Singh, Richard M. Stern. [doi]
- On optimum normalization method used for speaker verificationWeijie Liu, Toshihiro Isobe, Naoki Mukawa. [doi]
- Plug and play software for designing high-level speech processing systemsThierry Dutoit, Juergen Schroeter. [doi]
- Interfacing of CASA and partial recognition based on a multistream techniqueFrédéric Berthommier, Hervé Glotin, Emmanuel Tessier, Hervé Bourlard. [doi]
- A comparative study of speaker verification systems using the polycost databaseTomas Nordström, Håkan Melin, Johan Lindberg. [doi]
- A context-dependent approach for speaker verification using sequential decisionHideki Noda, Katsuya Harada, Eiji Kawaguchi, Hidefumi Sawai. [doi]
- On different functions of repetitive utterancesMarc Swerts, Hanae Koiso, Atsushi Shimojima, Yasuhiro Katagiri. [doi]
- Pragmatic characteristics of infant directed speechSudaporn Luksaneeyanawin, Chayada Thanavisuth, Suthasinee Sittigasorn, Onwadee Rukkarangsarit. [doi]
- Spotting (different types of) words in (different types of) contextJames M. McQueen, Anne Cutler. [doi]
- Pausing in Swedish spontaneous speechPetra Hansson. [doi]
- Predictive speaker adaptation and its prior trainingDieu Tran, Ken-ichi Iso. [doi]
- Effects of phonetic quality and duration on perceptual acceptability of temporal changes in speechHiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka. [doi]
- Fast decoding for statistical machine translationYe-Yi Wang, Alex Waibel. [doi]
- Perceptual and acoustic properties of phonemes in continuous speech for different speaking rateHisao Kuwabara. [doi]
- Feature decorrelation methods in speech recognition. a comparative studyEloi Batlle, Climent Nadeu, José A. R. Fonollosa. [doi]
- Phrase accents revisited: comparative evidence from standard and cypriot greekAmalia Arvaniti. [doi]
- Evidence of dual-route phonetic encoding from apraxia of speech: implications for phonetic encoding modelsRosemary A. Varley, Sandra P. Whiteside. [doi]
- The UPC text-to-speech system for Spanish and catalanAntonio Bonafonte, Ignasi Esquerra, Albert Febrer, José A. R. Fonollosa, Francesc Vallverdú. [doi]
- A comparative study of hybrid modelling techniques for improved telephone speech recognitionRathinavelu Chengalvarayan. [doi]
- Categorical perception: important phenomenon or lasting myth?Dominic W. Massaro. [doi]
- Speaker-independent speech recognition using micro segment spectrum integrationKiyoaki Aikawa. [doi]
- Referential features and linguistic indirection in multimodal languageSharon L. Oviatt, Karen Kuhn. [doi]
- Acoustic indicators of topic segmentationJulia Hirschberg, Christine H. Nakatani. [doi]
- New prosodic control rules for expressive synthetic speechOsamu Mizuno, Shin ya Nakajima. [doi]
- Calibration of machine scores for pronunciation gradingHoracio Franco, Leonardo Neumeyer. [doi]
- Emergent computational dialogue management architecture for task-oriented spoken dialogue systemsTakeshi Kawabata. [doi]
- Frequency analysis of phonetic units for concatenative synthesis in catalanIgnasi Esquerra, Albert Febrer, Climent Nadeu. [doi]
- On robust sequential estimator based on t-distribution with forgetting factor for speech analysisJoohun Lee, Ki Yong Lee. [doi]
- On the use of F0 features in automatic segmentation for speech synthesisTakashi Saito. [doi]
- Hierarchical neural networks (HNN) for Chinese continuous speech recognitionYing Jia, Limin Du, Ziqiang Hou. [doi]
- Speaker recognition using residual signal of linear and nonlinear prediction modelsMarcos Faúndez-Zanuy, Daniel Rodriguez-Porcheron. [doi]
- Language identification incorporating lexical informationDriss Matrouf, Martine Adda-Decker, Lori Lamel, Jean-Luc Gauvain. [doi]
- Recognition from GSM digital speechAscensión Gallardo-Antolín, Fernando Díaz-de-María, Francisco J. Valverde-Albacete. [doi]
- Automatic labelling of German prosodyStefan Rapp. [doi]
- TRAPS - classifiers of temporal patternsHynek Hermansky, Sangita Sharma. [doi]
- Reconciling two competing views on contrastivenessEmiel Krahmer, Marc Swerts. [doi]
- Assimilation and anticipation in word perceptionHugo Quené, Maya van Rossum, Mieke van Wijck. [doi]
- An algorithm for automatic generation of Mandarin phonetic balanced corpusJyh-Shing Shyuu, Jhing-Fa Wang. [doi]
- Automatic segmental and prosodic labeling of Mandarin speech databaseFu-Chiang Chou, Chiu-yu Tseng, Lin-Shan Lee. [doi]
- A novel method of formant analysis and glottal inverse filteringSteve Pearson. [doi]
- GALAXY-II: a reference architecture for conversational system developmentStephanie Seneff, Edward Hurley, Raymond Lau, Christine Pao, Philipp Schmid, Victor Zue. [doi]
- Generalized phone modeling based on piecewise linear segment latticeHiroaki Kojima, Kazuyo Tanaka. [doi]
- SALSA version 1.0: a speech-based web browser for hong kong EnglishPascale Fung, Chi Shun Cheung, Kwok Leung Lam, Wai Kat Liu, Yuen Yee Lo. [doi]
- Duration compensation in non-adjacent consonant and temporal regularityHee-Sun Kim. [doi]
- From novice to expert: the effect of tutorials on user expertise with spoken dialogue systemsCandace A. Kamm, Diane J. Litman, Marilyn A. Walker. [doi]
- Speech technology in clinical environmentsJan van Doorn, Sharynne McLeod, Elise Baker, Alison Purcell, William Thorpe. [doi]
- The provision of corrective feedback in a spoken dialogue CALL systemSarah Davies, Massimo Poesio. [doi]
- Vowel quality in spontaneous speech: what makes a good vowel?Matthew P. Aylett, Alice Turk. [doi]
- A silence/noise/music/speech splitting algorithmClaude Montacié, Marie-José Caraty. [doi]
- Selection of the optimal structure of the continuous HMM using the genetic algorithmTomio Takara, Yasushi Iha, Itaru Nagayama. [doi]
- Frequency domain binaural model as the front end of speech recognition systemTsuyoshi Usagawa, Kenji Sakai, Masanao Ebata. [doi]
- A model for speech reverberation and intelligibility restoring filtersOwen P. Kenny, Douglas J. Nelson. [doi]
- Utterance generation for transaction dialoguesJoris Hulstijn, Arjan van Hessen. [doi]
- The intellimedia workbench - a generic environment for multimodal systemsTom Brøndsted, Lars Bo Larsen, Michael Manthey, Paul McKevitt, Thomas B. Moeslund, Kristian G. Olesen. [doi]
- On the reduction of concatenation artefacts in diphone synthesisEsther Klabbers, Raymond N. J. Veldhuis. [doi]
- Linguistically engineered tools for speech recognition error analysisCarol Van Ess-Dykema, Klaus Ries. [doi]
- Creating speaker independent HMM models for restricted database using STRAIGHT-TEMPO morphingAlexandre Girardi, Kiyohiro Shikano, Satoshi Nakamura. [doi]
- Text-independent speaker identification and verification using the TIMIT databaseNuala C. Ward, Dominik R. Dersch. [doi]
- Improved parameter tying for efficient acoustic model evaluation in large vocabulary continuous speech recognitionJacques Duchateau, Kris Demuynck, Dirk Van Compernolle, Patrick Wambacq. [doi]
- A study on the recognition of low bit-rate encoded speechAn-Tzyh Yu, Hsiao-Chuan Wang. [doi]
- Creating hidden Markov models for fast speechThilo Pfau, Günther Ruske. [doi]
- An MRI study on the relationship between oral cavity shape and larynx positionKiyoshi Honda, Mark Tiede. [doi]
- The use of confidence measures in unsupervised adaptation of speech recognizersTasos Anastasakos, Sreeram V. Balakrishnan. [doi]
- Weighted parallel model combination for noisy speech recognitionTai-Hwei Hwang, Hsiao-Chuan Wang. [doi]
- High-speed speaker adaptation using phoneme dependent tree-structured speaker clusteringMotoyuki Suzuki, Toshiaki Abe, Hiroki Mori, Shozo Makino, Hirotomo Aso. [doi]
- A statistical study of pitch target points in five languagesEstelle Campione, Jean Véronis. [doi]
- Speaker verification on the polycost database using frequency filtered spectral energiesJavier Hernando, Climent Nadeu. [doi]
- A language for creating speech applicationsAndrew N. Pargellis, Qiru Zhou, Antoine Saad, Chin-Hui Lee. [doi]
- Product-code vector quantization of cepstral parameters for speech recognition over the WWWVassilios Digalakis, Leonardo Neumeyer, Manolis Perakakis. [doi]
- SQEL: a multilingual and multifunctional dialogue systemMaria Aretoulaki, Stefan Harbeck, Florian Gallwitz, Elmar Nöth, Heinrich Niemann, Jozef Ivanecký, Ivo Ipsic, Nikola Pavesic, Václav Matousek. [doi]
- MSF format for the representation of speech synchronized moving imageCheol-Woo Jo. [doi]
- Can we hear smile?Marc Schröder, Véronique Aubergé, Marie-Agnès Cathiard. [doi]
- Hidden Markov models for trajectory modelingRukmini Iyer, Herbert Gish, Man-Hung Siu, George Zavaliagkos, Spyros Matsoukas. [doi]
- Cantilever-type force-sensor-mounted palatal plate for measuring palatolingual contact stress and pattern during speech phonationMasafumi Matsumura, Takuya Niikawa, Takao Tanabe, Takashi Tachimura, Takeshi Wada. [doi]
- Modeling the microprosody of pitch and loudness for speech synthesis with neural networksMartti Vainio, Toomas Altosaar. [doi]
- Cross-language merged speech units and their descriptive phonetic correlatesPaul Dalsgaard, Ove Andersen, William J. Barry. [doi]
- Topic recognition for news speech based on keyword spottingYoichi Yamashita, Toshikatsu Tsunekawa, Riichiro Mizoguchi. [doi]
- A multimodal-input multimedia-output guidance system: MMGSToshiyuki Takezawa, Tsuyoshi Morimoto. [doi]
- Using untranscribed training data to improve performanceGeorge Zavaliagkos, Man-Hung Siu, Thomas Colthurst, Jayadev Billa. [doi]
- Improvement on connected numbers recognition using prosodic informationEduardo López, Javier Caminero, Ismael Cortázar, Luis A. Hernández Gómez. [doi]
- Rescoring multiple pronunciations generated from spelled wordsRoland Kuhn, Jean-Claude Junqua, Philip D. Martzen. [doi]
- A practical perceptual frequency autoregressive HMM enhancement systemBeth Logan, Tony Robinson. [doi]
- Performance and optimization of the SEEVOC algorithmWeihua Zhang, W. Harvey Holmes. [doi]
- A comparison of Thai speech recognition systems using hidden Markov model, neural network, and fuzzy-neural networkVisarut Ahkuputra, Somchai Jitapunkul, Nutthacha Jittiwarangkul, Ekkarit Maneenoi, Sawit Kasuriya. [doi]
- Speech pre-processing against intentional imposture in speaker recognitionDominique Genoud, Gérard Chollet. [doi]
- Additional use of phoneme duration hypotheses in automatic speech segmentationKarlheinz Stöber, Wolfgang Hess. [doi]
- The effect of background knowledge on first and second language comprehension difficultyMichael D. Tyler. [doi]
- Cluster adaptive training for speech recognitionMark J. F. Gales. [doi]
- Coherence-based subband decomposition for robust speech and speaker recognition in noisy and reverberant roomsJoaquin Gonzalez-Rodriguez, Santiago Cruz-Llanas, Javier Ortega-Garcia. [doi]
- A new linear predictive method for compression of speech signalsPaavo Alku, Susanna Varho. [doi]
- How far do speakers back up in repairs? a quantitatve modelElizabeth Shriberg, Andreas Stolcke. [doi]
- Magnetic resonance measurements of the velum port openingDidier Demolin, Véronique Lecuit, Thierry Metens, Bruno Nazarian, Alain Soquet. [doi]
- A synthesis method based on concatenation of demisyllables and a residual excited vocal tract modelSteve Pearson, Nick Kibre, Nancy Niedzielski. [doi]
- Learning phrase-based head transduction models for translation of spoken utterancesHiyan Alshawi, Srinivas Bangalore, Shona Douglas. [doi]
- Statistical integration of temporal filter banks for robust speech recognition using linear discriminant analysis (LDA)Jia-lin Shen, Wen-Liang Hwang. [doi]
- Thai polysyllabic word recognition using fuzzy-neural networkChai Wutiwiwatchai, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin. [doi]
- Restoration of hyperbaric speech by correction of the formants and the pitchLaure Charonnat, Michel Guitton, Joel Crestel, Gerome Allée. [doi]
- Text-independent speaker recognition using multiple information sourcesKonstantin P. Markov, Seiichi Nakagawa. [doi]
- Determination of the vocal tract spectrum from the articulatory movements based on the search of an articulatory-acoustic databaseTokihiko Kaburagi, Masaaki Honda. [doi]
- Effects of shapes of radiational aperture on radiation characteristicsHiroki Matsuzaki, Kunitoshi Motoki, Nobuhiro Miki. [doi]
- Phonetic invariance and phonological stability: lithuanian pitch accentsGrzegorz Dogil, Gregor Möhler. [doi]
- Combining connectionist multi-band and full-band probability streams for speech recognition of natural numbersNikki Mirghafori, Nelson Morgan. [doi]
- Improved feature decorrelation for HMM-based speech recognitionKris Demuynck, Jacques Duchateau, Dirk Van Compernolle, Patrick Wambacq. [doi]
- A spoken dialogue system utilizing spatial informationAnnika Flycht-Eriksson, Arne Jönsson. [doi]
- Global optimisation of neural network models via sequential sampling-importance resamplingJoão F. G. de Freitas, Sue E. Johnson, Mahesan Niranjan, Andrew H. Gee. [doi]
- User evaluation of the mask kioskLori Lamel, Samir Bennacef, Jean-Luc Gauvain, Hervé Dartigues, Jean-Noel Temem. [doi]
- A computational algorithm for F0 contour generation in Korean developed with prosodically labeled databases using k-toBI systemYong-Ju Lee, Sook-Hyang Lee, Jong Jin Kim, Hyun-Ju Ko, Young-Il Kim, Sanghun Kim, Jung-Cheol Lee. [doi]
- Representing the environments for phonological processes in an accent-independent lexicon for synthesis of EnglishSusan Fitt, Stephen Isard. [doi]
- Japanese large-vocabulary continuous speech recognition system based on microsoft whisperHsiao-Wuen Hon, Yun-Cheng Ju, Keiko Otani. [doi]
- Vowel separation using the reassigned amplitude-modulation spectrumDekun Yang, Georg F. Meyer, William A. Ainsworth. [doi]
- Natural language call routing: a robust, self-organizing approachBob Carpenter, Jennifer Chu-Carroll. [doi]
- Prosody prediction for speech synthesis using transformational rule-based learningCameron S. Fordyce, Mari Ostendorf. [doi]
- Dynamic vs. static spectral detail in the perception of gated stopsMichael Kiefte, Terrance M. Nearey. [doi]
- Consistencies and inconsistencies between EPG and locus equation data on coarticulationMarija Tabain. [doi]
- Using automatic speech recognition and its possible effects on the voiceChristel G. de Bruijn, Sandra P. Whiteside, P. A. Cudd, D. Syder, K. M. Rosen, L. Nord. [doi]
- HMM-based visual speech recognition using intensity and location normalizationOscar Vanegas, Akiji Tanaka, Keiichi Tokuda, Tadashi Kitamura. [doi]
- Text segmentation and topic tracking on broadcast news via a hidden Markov model approachPaul van Mulbregt, Ira Carp, Lawrence Gillick, Steve Lowe, Jon Yamron. [doi]
- Techniques for capturing temporal variations in speech signals with fixed-rate processingSatya Dharanipragada, Ramesh A. Gopinath, Bhaskar D. Rao. [doi]
- Robust speech/non-speech detection in adverse conditions based on noise and speech statisticsLamia Karray, Jean Monné. [doi]
- Local speech rate as a combination of syllable and phone rateHartmut R. Pfitzinger. [doi]
- Extraction of the dialog act and the topic from utterances in a spoken dialog systemYasuhisa Niimi, Noboru Takinaga, Takuya Nishimoto. [doi]
- A phonetic and acoustic study of babbling in an Italian childClaudio Zmarich, Roberta Lanni. [doi]
- Recovering gestures from speech signals: a preliminary study for nasal vowelsSolange Rossato, Gang Feng, Rafael Laboissière. [doi]
- Quantitative influence of speech variability factors for automatic speaker verification in forensic tasksJavier Ortega-Garcia, Santiago Cruz-Llanas, Joaquin Gonzalez-Rodriguez. [doi]
- Incorporating linguistic knowledge into automatic dialect identification of SpanishLisa Yanguas, Gerald C. O Leary, Marc A. Zissman. [doi]
- Automatic language identification with perceptually guided training and recurrent neural networksJerome Braun, Haim Levkowitz. [doi]
- Fuzzy-integration based normalization for speaker verificationTuan Pham, Michael Wagner. [doi]
- Laryngoscopic analysis of pharyngeal articulations and larynx-height voice quality settingsJohn H. Esling. [doi]
- The applicability of adaptive language modelling for the broadcast news taskPhilip Clarkson, Tony Robinson. [doi]
- Exploiting transitions and focussing on linguistic properties for ASRJacques C. Koreman, William J. Barry, Bistra Andreeva. [doi]
- Auditory modeling techniques for robust pitch extraction and noise reductionPiero Cosi, Stefano Pasquin, Enrico Zovato. [doi]
- Recognition-based word counting for reliable barge-in and early endpoint detection in continuous speech recognitionAnand R. Setlur, Rafid A. Sukkar. [doi]
- Spanish dialects: phonetic transcriptionAsunción Moreno, José B. Mariño. [doi]
- Segmental duration control based on an articulatory modelYoshinori Shiga, Hiroshi Matsuura, Tsuneo Nitta. [doi]
- The perception of stressed syllables in finnishJyrki Tuomainen, Jean Vroomen, Béatrice de Gelder. [doi]
- Human vs. machine speaker identification with telephone speechAstrid Schmidt-Nielsen, Thomas H. Crystal. [doi]
- Real-time probabilistic segmentation for segment-based speech recognitionSteven C. Lee, James R. Glass. [doi]
- Acoustic and affective qualities of IDS in EnglishChristine Kitamura, Denis Burnham. [doi]
- On the application of the AM-FM model for the recovery of missing frequency bands of telephone speechHesham Tolba, Douglas D. O Shaughnessy. [doi]
- The impact of regional variety upon specific word categories in spontaneous GermanSusanne Burger, Daniela Oppermann. [doi]
- Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding systemAtsuhiko Kai, Yoshifumi Hirose, Seiichi Nakagawa. [doi]
- Prosodic vs. segmental contributions to naturalness in a diphone synthesizerH. Timothy Bunnell, Steve R. Hoskins, Debra Yarrington. [doi]
- Speech driven 3-d face point trajectory synthesis algorithmLevent M. Arslan, David Talkin. [doi]
- Dovetailing of acoustics and prosody in spontaneous speech recognitionJan Buckow, Anton Batliner, Richard Huber, Elmar Nöth, Volker Warnke, Heinrich Niemann. [doi]
- A language modeling based on a hierarchical approach: m_n^vImed Zitouni. [doi]
- Articulability of two consecutive morae in Japanese speech production: evidence from sound exchange errors in spontaneous speechYasushi Terao, Tadao Murata. [doi]
- Wavelet transform-based speech enhancementEliathamby Ambikairajah, Graham Tattersall, Andrew Davis. [doi]
- Control of larynx height in vowel productionPhilip Hoole, Christian Kroos. [doi]
- Crosslinguistic disfluency modelling: a comparative analysis of Swedish and american English human-human and human-machine dialoguesRobert Eklund, Elizabeth Shriberg. [doi]
- Prosody and voice quality in the expression of emotionsElisabeth Zetterholm. [doi]
- The acquisition of Japanese compound accent ruleAyako Shirose, Haruo Kubozono, Shigeru Kiritani. [doi]
- Voice dictation in the secondary school classroomMichael F. McTear, Eamonn A. O Hare. [doi]
- Automated captioning of television programs: development and analysis of a soundtrack corpusIngrid Ahmer, Robin W. King. [doi]
- Language independent and language adaptive large vocabulary speech recognitionTanja Schultz, Alex Waibel. [doi]
- Smoothing and tying for Korean flexible vocabulary isolated word recognitionJae-Seung Choi, Jong-Seok Lee, Hee-Youn Lee. [doi]
- Nonreciprocal data sharing in estimating HMM parametersXiaoqiang Luo, Frederick Jelinek. [doi]
- A unified framework for sublexical and linguistic modelling supporting flexible vocabulary speech understandingRaymond Lau, Stephanie Seneff. [doi]
- Temporal organization of speech for normal and fast ratesGeetha Krishnan, Wayne Ward. [doi]
- Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processingNobuaki Minematsu, Seiichi Nakagawa. [doi]
- Recognition of vowels in fricative contextSantiago Fernández, Sergio Feijóo, Ramón Balsa, Nieves Barros. [doi]
- Phonetic alignment: speech synthesis based vs. hybrid HMM/ANNFabrice Malfrère, Olivier Deroo, Thierry Dutoit. [doi]
- Combination of confidence measures in isolated word recognitionJ. G. A. Dolfing, Andreas Wendemuth. [doi]
- A perceptive measure of pure prosody linguistic functions with reiterant sentencesAlbert Rilliard, Véronique Aubergé. [doi]
- What you see is (almost) what you hear: design principles for user interfaces for accessing speech archivesSteve Whittaker, John Choi, Julia Hirschberg, Christine H. Nakatani. [doi]
- Same news is good news: automatically collecting reoccurring radio news storiesStefan Rapp, Grzegorz Dogil. [doi]
- Real-time recognition of broadcast newsGary Cook, Tony Robinson, James Christie. [doi]
- A fast method of producing talking head mouth shapes from real speechAndrew P. Breen, O. Gloaguen, P. Stern. [doi]
- Forming generic models of speech for uniform database accessToomas Altosaar, Martti Vainio. [doi]
- Perceived Swedish vowel quantity: effects of postvocalic consonant durationDawn M. Behne, Peter E. Czigler, Kirk P. H. Sullivan. [doi]
- Generating emotional speech with a concatenative synthesizerErhard Rank, Hannes Pirker. [doi]
- A schema based approach to dialog controlPaul C. Constantinides, Scott Hansma, Chris Tchou, Alexander I. Rudnicky. [doi]
- Making the most of multiplicity: a multi-parser multi-strategy architecture for the robust processing of spoken languageTobias Ruland, C. J. Rupp, Jörg Spilker, Hans Weber, Karsten L. Worm. [doi]
- Keyword extraction of radio news using domain identification based on categories of an encyclopediaYoshimi Suzuki, Fumiyo Fukumoto, Yoshihiro Sekiguchi. [doi]
- Investigating the syntactic characteristics of English tone unitsAlex Chengyu Fang, Jill House, Mark Huckvale. [doi]
- Word-based acoustic confidence measures for large-vocabulary speech recognitionAsela Gunawardana, Hsiao-Wuen Hon, Li Jiang. [doi]
- A high-performance text-independent speaker identification system based on BCDMQin Jin, Luo Si, Qixiu Hu. [doi]
- On the influence of hyperarticulated speech on recognition performanceHagen Soltau, Alex Waibel. [doi]
- Multi-level rhythm control for speech synthesis using hybrid data driven and rule-based approachesOliver Jokisch, Diane Hirschfeld, Matthias Eichner, Rüdiger Hoffmann. [doi]
- Text-to-speech voice adaptation from sparse training dataAlexander Kain, Michael W. Macon. [doi]
- Phonological units in speech segmentation and phonological awarenessTakashi Otake, Kiyoko Yoneyama. [doi]
- The BBN single-phonetic-tree fast-match algorithmLong Nguyen, Richard M. Schwartz. [doi]
- Robust speech recognition using discriminative stream weighting and parameter interpolationStephen M. Chu, Yunxin Zhao. [doi]
- An instantaneous-frequency-based pitch extraction method for high-quality speech transformation: revised TEMPO in the STRAIGHT-suiteHideki Kawahara, Alain de Cheveigné, Roy D. Patterson. [doi]
- Data-driven PMC and Bayesian learning integration for fast model adaptation in noisy conditionsStefano Crafa, Luciano Fissore, Claudio Vair. [doi]
- A three-dimensional linear articulatory model based on MRI dataPierre Badin, Gérard Bailly, Monica Raybaudi, Christoph Segebarth. [doi]
- Non-linear probability estimation method used in HMM for modeling frame correlationQing Guo, Fang Zheng, Jian Wu, Wenhu Wu. [doi]
- Dual-route phonetic encoding: some acoustic evidenceSandra P. Whiteside, Rosemary A. Varley. [doi]
- Improving the noise and spectral robustness of an isolated-word recognizer using an auditory-model front endMartin Hunke, Meeran Hyun, Steve Love, Thomas Holton. [doi]
- Enhancement techniques to improve the intelligibility of consonants in noise : speaker and listener effectsValérie Hazan, Andrew Simpson, Mark Huckvale. [doi]
- Training speech through visual feedback patternsJan Nouza. [doi]
- A bimodal Korean address entry/retrieval systemHyun-Yeol Chung, Cheol-Jun Hwang, Shi-wook Lee. [doi]
- Trajectory formation of articulatory movements for a given sequence of phonemesTakeshi Okadome, Tokihiko Kaburagi, Masaaki Honda. [doi]
- MIMIC : a voice-adaptive phonetic-tree speech synthesiserAimin Chen, Saeed Vaseghi, Charles Ho. [doi]
- Fuzzy Gaussian mixture models for speaker recognitionDat Tran, Tu Van Le, Michael Wagner. [doi]
- Lexical access for large-vocabulary speech recognitionRoger Ho-Yin Leung, Hong C. Leung. [doi]
- HMM-based smoothing for concatenative speech synthesisMike Plumpe, Alex Acero, Hsiao-Wuen Hon, Xuedong Huang. [doi]
- Multi-lingual concatenative speech synthesisNick Campbell. [doi]
- Acoustic confidence measures for segmenting broadcast newsJon Barker, Gethin Williams, Steve Renals. [doi]
- Probabilistic dialogue act extraction for concept based multilingual translation systemsToshiaki Fukada, Detlef Koll, Alex Waibel, Kouichi Tanigaki. [doi]
- SHEEP, GOATS, LAMBS and WOLVES: a statistical analysis of speaker performance in the NIST 1998 speaker recognition evaluationGeorge R. Doddington, Walter Liggett, Alvin F. Martin, Mark A. Przybocki, Douglas A. Reynolds. [doi]
- Evaluation and implementation of a voice-activated dialing system with utterance verificationBeng Tiong Tan, Yong Gu, Trevor Thomas. [doi]
- Recognition of connected digit speech in Japanese collected over the telephone networkHisashi Kawai, Norio Higuchi. [doi]
- A flexible method of creating HMM using block-diagonalization of covariance matricesRyosuke Koshiba, Mitsuyoshi Tachimori, Hiroshi Kanazawa. [doi]
- Techniques for accurate automatic annotation of speech waveformsStephen Cox, Richard Brady, Peter Jackson. [doi]
- Reduction of English function words in switchboardDaniel Jurafsky, Alan Bell, Eric Fosler-Lussier, Cynthia Girand, William Raymond. [doi]
- On loops and articulatory biomechanicsPascal Perrier, Yohan Payan, Joseph S. Perkell, Frédéric Jolly, Majid Zandipour, Melanie Matthies. [doi]
- Time as a factor in the acoustic variation of schwaWilliam J. Barry. [doi]
- A new synthetic speech/sound control languageOsamu Mizuno, Shin ya Nakajima. [doi]
- Morphological modeling of word classes for language modelsUlla Uebler, Heinrich Niemann. [doi]
- Robust measurement of fundamental frequency and degree of voicingJohn N. Holmes. [doi]
- Speech perception and spoken language in children with impaired hearingPeter J. Blamey, Julia Sarant, Tanya Serry, Roger Wales, Christopher James, Johanna Barry, Graeme M. Clark, M. Wright, R. Tooher, C. Psarros, G. Godwin, M. Rennie, T. Meskin. [doi]
- Automatic generation of visual scenarios for spoken corpora acquisitionDemetrio Aiello, Cristina Delogu, Renato de Mori, Andrea Di Carlo, Marina Nisi, Silvia Tummeacciu. [doi]
- Phonetic investigation of boundary pitch movements in JapaneseKazuaki Maeda, Jennifer J. Venditti. [doi]
- Incremental on-line speaker adaptation in adverse conditionsOlli Viikki, Kari Laurila. [doi]
- The influence of accents in australian English vowels and their relation to articulatory tract parametersDominik R. Dersch, Christopher Cleirigh, Julie Vonwiller. [doi]
- Multi-channel pulsation strategy for electric stimulation of cochleaShigeyoshi Kitazawa, Hiroyuki Kirihata, Tatsuya Kitamura. [doi]
- Heads and tails in word perception: evidence for early-to-late processing in listening and readingSieb G. Nooteboom, Meinou van Dijk. [doi]
- Discriminative training of GMM using a modified EM algorithm for speaker recognitionKonstantin P. Markov, Seiichi Nakagawa. [doi]
- On the influence of the delta coefficients in a HMM-based speech recognition systemFabrice Lefèvre, Claude Montacié, Marie-José Caraty. [doi]
- Pronunciation modeling for large vocabulary conversational speech recognitionKristine W. Ma, George Zavaliagkos, Rukmini Iyer. [doi]
- On the convergence of Gaussian mixture models: improvements through vector quantizationJames Moody, Stefan Slomka, Jason W. Pelecanos, Sridha Sridharan. [doi]
- Improving speaker recognisability in phonetic vocodersCarlos M. Ribeiro, Isabel Trancoso. [doi]
- Improving speaker identification performance in reverberant conditions using lip informationTim Wark, Sridha Sridharan. [doi]
- The IBM trainable speech synthesis systemRobert E. Donovan, Ellen Eide. [doi]
- Automatic recognition of spontaneous speech dialoguesMauro Cettolo, Daniele Falavigna. [doi]
- Are you my little pussy-cat? acoustic, phonetic and affective qualities of infant- and pet-directed speechDenis Burnham, Elizabeth Francis, Ute Vollmer-Conna, Christine Kitamura, Vicky Averkiou, Amanda Olley, Mary Nguyen, Cal Paterson. [doi]
- Confidence measures for HMM-based speech recognitionDaniel Willett, Andreas Worm, Christoph Neukirchen, Gerhard Rigoll. [doi]
- Data-driven extensions to HMM statistical dependenciesJeff A. Bilmes. [doi]
- Improving posterior based confidence measures in hybrid HMM/ANN speech recognition systemsGiulia Bernardis, Hervé Bourlard. [doi]
- Now you hear it, now you don t: empirical studies of audio browsing behavior behaviorChristine H. Nakatani, Steve Whittaker, Julia Hirschberg. [doi]
- Efficient adaptation of TTS duration model to new speakersChilin Shih, Wentao Gu, Jan P. H. van Santen. [doi]
- Automatic detection of landmark for nasal consonants from speech waveformLimin Du, Kenneth N. Stevens. [doi]
- Towards speech understanding across multiple languagesTodd Ward, Salim Roukos, Chalapathy Neti, Jerome Gros, Mark Epstein, Satya Dharanipragada. [doi]
- Example-based error recovery method for speech translation: repairing sub-trees according to the semantic distanceKai Ishikawa, Eiichiro Sumita, Hitoshi Iida. [doi]
- An interlingua based on domain actions for machine translation of task-oriented dialoguesLori S. Levin, Donna Gates, Alon Lavie, Alex Waibel. [doi]
- Two automatic approaches for analyzing connected speech processes in dutchMirjam Wester, Judith M. Kessens, Helmer Strik. [doi]
- Periodicity emphasis of voice wave using nonlinear IIR digital filters and its applicationsHiroyuki Kamata, Akira Kaneko, Yoshihisa Ishida. [doi]
- Efficient computation of MMI neural networks for large vocabulary speech recognition systemsJörg Rottland, Andre Ludecke, Gerhard Rigoll. [doi]
- Phonological similarity effects in Cantonese spoken-word processingMichael C. W. Yip, Po-Yee Leung, Hsuan-Chih Chen. [doi]
- Automatic grammar induction from semantic parsingDebajit Ghosh, David Goddeau. [doi]
- Speaker-independent upfront dialect adaptation in a large vocabulary continuous speech recognizerVolker Fischer, Yuqing Gao, Eric Janke. [doi]
- Speech production of vowel sequences using a physiological articulatory modelJianwu Dang, Kiyoshi Honda. [doi]
- Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection techniqueYukiko Fujisawa, Nobuaki Minematsu, Seiichi Nakagawa. [doi]
- Enhanced ASR by acoustic feature filteringChristian Wellekens. [doi]
- Robust speech recognition using HMM s with toeplitz state covariance matricesWilliam J. J. Roberts, Yariv Ephraim. [doi]
- Perceptual properties of Russians with Japanese fricativesSeiya Funatsu, Shigeru Kiritani. [doi]
- Contextual effects on voicing profiles of German and Mandarin consonantsChilin Shih, Bernd Möbius. [doi]
- Articulatory analysis using a codebook for articulatory based low bit-rate speech codingCarlos Silva, Samir Chennoukh. [doi]
- Automatic rule generation for linguistic features analysis using inductive learning technique: linguistic features analysis in TOS drive TTS systemShigenobu Seto, Masahiro Morita, Takehiko Kagoshima, Masami Akamine. [doi]
- SEMOLE: a robust framework for gathering information from the world wide webHyung Jin Kim, I. Lee Hetherington. [doi]
- Efficient lattice representation and generationFuliang Weng, Andreas Stolcke, Ananth Sankar. [doi]
- Regional variation in the vowels of female adolescents from sydneyFelicity Cox, Sallyanne Palethorpe. [doi]
- The effect of fundamental frequency on Mandarin speech recognitionSharlene Liu, Sean Doyle, Allen Morris, Farzad Ehsani. [doi]
- A hierarchical language model for CSRFrancisco J. Valverde-Albacete, José Manuel Pardo. [doi]
- Grammar fragment acquisition using syntactic and semantic clusteringKazuhiro Arai, Jeremy H. Wright, Giuseppe Riccardi, Allen L. Gorin. [doi]
- Robust speech activity detection in the presence of noiseRuhi Sarikaya, John H. L. Hansen. [doi]
- Evaluation of model adaptation by HMM decomposition on telephone speech recognitionTetsuya Takiguchi, Satoshi Nakamura, Kiyohiro Shikano, Masatoshi Morishima, Toshihiro Isobe. [doi]
- An algorithm for choosing Japanese acknowledgments using prosodic cues and contextWataru Tsukahara. [doi]
- Reconstructing the tongue surface from six cross-sectional contours: ultrasound dataAndrew J. Lundberg, Maureen Stone. [doi]
- Robust automatic speech recognition by the application of a temporal-correlation-based recurrent multilayer neural network to the mel-based cepstral coefficientsMichel Héon, Hesham Tolba, Douglas D. O Shaughnessy. [doi]
- On the significance of temporal masking in speech codingJan Skoglund, W. Bastiaan Kleijn. [doi]
- Speech enhancement using STC-based bandwidth extensionJulien Epps, W. Harvey Holmes. [doi]
- An annotation system for melodic aspects of German spontaneous speechChristel Brindöpke, Brigitte Schaffranietz. [doi]
- A nonlinear unit selection strategy for concatenative speech synthesis based on syllable level featuresMartin Holzapfel, Nick Campbell. [doi]
- Usability evaluation of IVR systems with DTMF and ASRCristina Delogu, Andrea Di Carlo, Paolo Rotundi, Danilo Sartori. [doi]
- The differential status of semivowels in the acoustic phonetic realisation of tonePhil Rose. [doi]
- On robust speech analysis based on time-varying complex AR modelKeiichi Funaki, Yoshikazu Miyanaga, Koji Tochinai. [doi]
- The acquisition of putonghua phonologyLydia K. H. So, Zhou Jing. [doi]
- Speech recognition via phonetically featured syllablesSimon King, Todd A. Stephenson, Stephen Isard, Paul Taylor, Alex Strachan. [doi]
- Generating pitch accents in a concept-to-speech system using a knowledge baseSandra Williams. [doi]
- A kinematic analysis of new zealand and australian English vowel spacesCatherine I. Watson, Jonathan Harrington, Sallyanne Palethorpe. [doi]
- Acoustic and perceptual characteristic of Italian stop consonantsLoredana Cerrato, Mauro Falcone. [doi]
- Grammatical and statistical word prediction system for Spanish integrated in an aid for people with disabilitiesSira E. Palazuelos, Santiago Aguilera, José Rodrigo, Juan Ignacio Godino-Llorente. [doi]
- Speech communication profiles across the adult lifespan: persons without self-identified hearing impairmentM. F. Cheesman, K. L. Smilsky, T. M. Major, F. Lewis, L. M. Boorman. [doi]
- An analysis of the timing of turn-taking in a corpus of goal-oriented dialogueMatthew Bull, Matthew P. Aylett. [doi]
- Temporal variables in lectures in the Japanese languageMichiko Watanabe. [doi]
- Eigenvoices for speaker adaptationRoland Kuhn, Patrick Nguyen, Jean-Claude Junqua, Lloyd Goldwasser, Nancy Niedzielski, Steven Fincke, Ken Field, Matteo Contolini. [doi]
- A*-admissible key-phrase spotting with sub-syllable level utterance verificationBerlin Chen, Hsin-Min Wang, Lee-Feng Chien, Lin-Shan Lee. [doi]
- Do phonetic features help to improve consonant identification in ASR?Jacques C. Koreman, Bistra Andreeva, William J. Barry. [doi]
- The CHAM model of hyperarticulate adaptation during human-computer error resolutionSharon L. Oviatt. [doi]
- Initial speech recognition results using the multinet architectureEdnaldo Brigante Pizzolato, T. Jeff Reynolds. [doi]
- Multi-phone strings as subword units for speech recognitionPhilip O Neill, Saeed Vaseghi, Bernard Doherty, Wooi-Haw Tan, Paul M. McCourt. [doi]
- Robust interpretation for spoken dialogue systemsLena Strömbäck, Arne Jönsson. [doi]
- How a French TTS system can describe loanwordsFrédérique Sannier, Rabia Belrhali, Véronique Aubergé. [doi]
- Orthografik inkoncistensy ephekts in foneme detektion?Anne Cutler, Rebecca Treiman, Brit van Ooijen. [doi]
- FEM analysis of aspirated air flow in three-dimensional vocal tract during fricative consonant phonationTakuya Niikawa, Masafumi Matsumura, Takashi Tachimura, Takeshi Wada. [doi]
- Evaluation and integration of neural-network training techniques for continuous digit recognitionJohn-Paul Hosom, Ronald A. Cole, Piero Cosi. [doi]
- Training of context-dependent subspace distribution clustering hidden Markov modelBrian Mak, Enrico Bocchieri. [doi]
- A minimax search algorithm for CDHMM based robust continuous speech recognitionHui Jiang, Keikichi Hirose, Qiang Huo. [doi]
- A schema for illocutionary act identification with prosodic featureMasafumi Tamoto, Takeshi Kawabata. [doi]
- Towards a minimal standard for dialogue transcripts: a new SGML architecture for the HCRC map task corpusAmy Isard, David McKelvie, Henry S. Thompson. [doi]
- Stochastic calculus, non-linear filtering, and the internal model principle: implications for articulatory speech recognitionGordon Ramsay. [doi]
- Creating a mexican Spanish version of the CSLU toolkitBen Serridge, Alejandro Barbosa, Ronald A. Cole, Nora Munive, Alcira Vargas. [doi]
- Text-independent speaker verification using automatically labelled acoustic segmentsDijana Petrovska-Delacrétaz, Jan Cernocký, Jean Hennebert, Gérard Chollet. [doi]
- Designing a multimodal dialogue system for information retrievalSadaoki Furui, Koh ichiro Yamaguchi. [doi]
- A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation trainingGoh Kawai, Keikichi Hirose. [doi]
- On-line hierarchical transformation of hidden Markov models for speaker adaptationJen-Tzung Chien. [doi]
- Letter to sound rules for accented lexicon compressionVincent Pagel, Kevin A. Lenzo, Alan W. Black. [doi]
- German regional variants - a problem for automatic speech recognition?Nicole Beringer, Florian Schiel, Peter Regel-Brietzmann. [doi]
- Recovering vocal tract shapes from MFCC parametersSorin Dusan, Li Deng. [doi]
- Toward Markov random field modeling of speechGuillaume Gravier, Marc Sigelle, Gérard Chollet. [doi]
- Speech recognition from GSM codec parametersJuan M. Huerta, Richard M. Stern. [doi]
- Prosodic structure in Japanese spontaneous speechYasuo Horiuchi, Akira Ichikawa. [doi]
- Analysis of effects of lexical accent, syntax, and global speech rate upon the local speech rateSumio Ohno, Hiroya Fujisaki, Hideyuki Taguchi. [doi]
- Unsupervised training of HMMs with variable number of mixture components per stateCesar Martín del Alamo, Luis Villarrubia, Francisco Javier Gonzalez, Luis A. Hernández Gómez. [doi]
- Improving accuracy of telephony-based, speaker-independent speech recognitionDaniel Azzopardi, Shahram Semnani, Ben Milner, Richard Wiseman. [doi]
- Spoken L2 teaching with contrastive visual and auditory feedbackAnne-Marie Öster. [doi]
- Acoustic qualities of IDS and ADS in ThaiChayada Thanavisuth, Sudaporn Luksaneeyanawin. [doi]
- Time dependent language model for broadcast news transcription and its post-correctionAkio Kobayashi, Kazuo Onoe, Toru Imai, Akio Ando. [doi]
- How to handle foreign sounds in Swedish text-to-speech conversion: approaching the xenophone problemRobert Eklund, Anders Lindström. [doi]
- Hierarchical temporal decomposition: a novel approach to efficient compression of spectral characteristics of speechShahrokh Ghaemmaghami, Mohamed Deriche, Sridha Sridharan. [doi]
- A thesaurus-based statistical language model for broadcast news transcriptionAkio Ando, Akio Kobayashi, Toru Imai. [doi]
- The influence of syllable structure on the timing of intonational events in GermanHansjörg Mixdorff, Hiroya Fujisaki. [doi]
- Suprasegmental duration modelling with elastic constraints in automatic speech recognitionLaurence Molloy, Stephen Isard. [doi]
- The maximum-based description of F0 contours and its application to EnglishThomas Portele, Barbara Heuft. [doi]
- An acoustic analysis of vowel production across tasks in a case of non-fluent progressive aphasiaKaren Croot. [doi]
- Context-dependent duration modelling for continuous speech recognitionTan Lee, Rolf Carlson, Björn Granström. [doi]
- On the interaction between time and frequency filtering of speech parameters for robust speech recognitionDusan Macho, Climent Nadeu. [doi]
- On the learnability of the voicing contrast for initial stopsRobert I. Damper, Steve R. Gunn. [doi]
- Source controlled variable bit-rate speech coder based on waveform interpolationF. Plante, Barry M. G. Cheetham, D. Marston, P. A. Barrett. [doi]
- The relation between vocal tract shape and formant frequencies can be described by means of a system of coupled differential equationsJean Schoentgen, Alain Soquet, Véronique Lecuit, Sorin Ciocea. [doi]
- A discourse coding scheme for conversational SpanishLori S. Levin, Ann E. Thymé-Gobbel, Alon Lavie, Klaus Ries, Klaus Zechner. [doi]
- New features for confidence annotationDhananjay Bansal, Mosur K. Ravishankar. [doi]
- Effects of contrastive focal accent on linguopalatal articulation and coarticulation in the French [kskl] clusterYohann Meynadier, Michel Pitermann, Alain Marchal. [doi]
- End-user driven dialogue system design: the reward experienceKlaus Failenschmid, J. H. Simon Thornton. [doi]
- Evidence for early effects of sentence context on word segmentationSaskia te Riele, Hugo Quené. [doi]
- Web-based educational tools for speech technologyKåre Sjölander, Jonas Beskow, Joakim Gustafson, Erland Lewin, Rolf Carlson, Björn Granström. [doi]
- Use of high-level linguistic constraints for constructing feature-based phonological model in speech recognitionJiping Sun, Li Deng. [doi]
- On the importance of components of the modulation spectrum for speaker verificationSarel Van Vuuren, Hynek Hermansky. [doi]
- A multilingual prosodic databaseEstelle Campione, Jean Véronis. [doi]
- More evidence for the perceptual basis of sound change? suprasegmental effects in the development of distinctive nasalizationJohn Hajek, Ian Watson. [doi]
- A computational memory and processing model for prosodyJanet E. Cahn. [doi]
- How effective is unsupervised data collection for children s speech recognition?Gregory Aist, Peggy Chan, Xuedong Huang, Li Jiang, Rebecca Kennedy, DeWitt Latimer IV, Jack Mostow, Calvin Yeung. [doi]
- Perception of tonal rises and falls for accentuation and phrasing in SwedishDavid House, Dik J. Hermes, Frédéric Beaugendre. [doi]
- Acoustic speech recognition model by neural net equation with competition and cooperationTetsuro Kitazoe, Tomoyuki Ichiki, Sung-Ill Kim. [doi]
- Speech perception in dyslexia: measurements from birth onwardsFlorien J. Koopmans-van Beinum, Caroline E. Schwippert, Cecile T. L. Kuijpers. [doi]
- Concept-driven speech understanding incorporated with a statistic language modelAkito Nagai, Yasushi Ishikawa. [doi]
- An effective quality evaluation protocol for speech enhancement algorithmsJohn H. L. Hansen, Bryan L. Pellom. [doi]
- Toward on-line learning of Chinese continuous speech recognition systemRong Zheng, Zuoying Wang. [doi]
- Automatic identification of command boundaries in a conversational natural language user interfaceGanesh N. Ramaswamy, Jan Kleindienst. [doi]
- Phonological rules for enhancing acoustic enrollment of unknown wordsBhuvana Ramabhadran, Abraham Ittycheriah. [doi]
- A nonstationary autoregressive HMM with gain adaptation for speech recognitionKi Yong Lee, Joohun Lee. [doi]
- Disambiguation of Korean utterances using automatic intonation recognitionTae-Yeoub Jang, Minsuck Song, Kiyeong Lee. [doi]
- Overview of the maya spoken language systemSimon Downey, Andrew P. Breen, Maria Fernández, Edward Kaneen. [doi]
- Situated dialogue coordination for spoken dialogue systemsMichio Okada, Noriko Suzuki, Jacques M. B. Terken. [doi]
- An undergraduate course on speech recognition based on the CSLU toolkitBen Serridge. [doi]
- Spoken dialogue system using corpus-based hidden Markov modelChung-Hsien Wu, Gwo-Lang Yan, Chien-Liang Lin. [doi]
- Signal extraction from noisy signal based on auditory scene analysisMasashi Unoki, Masato Akagi. [doi]
- Comparison of language modelling techniques for Russian and EnglishEdward W. D. Whittaker, Philip C. Woodland. [doi]
- A VQ based speaker recognition system based in histogram distances. text independent and for noisy environmentsEnric Monte, Ramon Arqué, Xavier Miró. [doi]
- Speech feature modeling for robust stressed speech recognitionSahar E. Bou-Ghazale, John H. L. Hansen. [doi]
- Compression algorithm of trigram language models based on maximum likelihood estimationNorimichi Yodo, Kiyohiro Shikano, Satoshi Nakamura. [doi]
- Design of cochlear implant device for transmitting voice pitch information in speech sound of asian languagesShizuo Hiki, Kazuya Imaizumi, Yumiko Fukuda. [doi]
- Effects of using speech in timetable information systems for WWWPernilla Qvarfordt, Arne Jönsson. [doi]
- A mixed-excitation frequency domain model for time-scale pitch-scale modification of speechAlex Acero. [doi]
- Word verification using confidence measures in speech recognitionM. Carmen Benítez, Antonio J. Rubio, Pedro García, Jesús E. Díaz-Verdejo. [doi]
- Telephone-based conversational speech recognition in the JUPITER domainJames R. Glass, Timothy J. Hazen. [doi]
- Language development after extreme childhood deprivation: a case studyLisa-Jane Brown, John Locke, Peter Jones, Sandra P. Whiteside. [doi]
- A bootstrap training approach for language model classifiersVolker Warnke, Elmar Nöth, Jan Buckow, Stefan Harbeck, Heinrich Niemann. [doi]
- Analytic generation of synthesis units by closed loop training for totally speaker driven text to speech system (TOS drive TTS)Masami Akamine, Takehiko Kagoshima. [doi]
- A speechreading aid based on phonetic ASRPaul Duchnowski, Louis Braida, Maroula Bratakos, David Lum, Matthew Sexton, Jean Krause. [doi]
- Noise robust two-stream auditory feature extraction method for speech recognitionJilei Tian, Ramalingam Hariharan, Kari Laurila. [doi]
- Some developmental patterns in the speech of 6-, 8- and 10-year old children: an acoustic phonetic studySandra P. Whiteside, Carolyn Hodgson. [doi]
- Factors affecting speech retrievalCorinna Ng, Ross Wilkinson, Justin Zobel. [doi]
- Growth transform of a sum of rational functions and its application in estimating HMM parametersXiaoqiang Luo. [doi]
- Assessing high-level language in individuals with multiple sclerosis: a pilot studyKarin Brunnegaard, Katja Laakso, Lena Hartelius, Elisabeth Ahlsen. [doi]
- A comparison of fusion techniques in mel-cepstral based speaker identificationStefan Slomka, Sridha Sridharan, Vinod Chandran. [doi]
- A pressure sensitive palatography: application of new pressure sensitive sheet for measuring tongue-palatal contact pressureMasahiko Wakumoto, Shinobu Masaki, Kiyoshi Honda, Toshikazu Ohue. [doi]
- Some acoustic characteristics of emotionCecile Pereira, Catherine I. Watson. [doi]
- Modelling spoken dialogues with state transition diagrams: experiences with the CSLU toolkitMichael F. McTear. [doi]
- Automatic recognition of Korean broadcast news speechHa-Jin Yu, Hoon Kim, Jae-Seung Choi, Joon-Mo Hong, Kew-Suh Park, Jong-Seok Lee, Hee-Youn Lee. [doi]
- Non-adjacent segmental effects in tonal realization of accentual phrase in seoul KoreanHyuck-Joon Lee. [doi]
- Speech analysis by subspace methods of spectral line estimationNajam Malik, W. Harvey Holmes. [doi]
- On the limitations of stochastic conceptual finite-state language models for speech understandingJosé Colás, Javier Ferreiros, Juan Manuel Montero, Julio Pastor, Ascensión Gallardo-Antolín, José Manuel Pardo. [doi]
- The voicing feature for stop consonants: acoustic phonetic analyses and automatic speech recognition experimentsPadma Ramesh, Partha Niyogi. [doi]
- Pacing spoken directions to suit the listenerTatsuya Iwase, Nigel Ward. [doi]
- A synthesis-oriented model of phrasal pitch movements in standard ChineseJinfu Ni, Goh Kawai, Keikichi Hirose. [doi]
- A generic algorithm for generating spoken monologuesEsther Klabbers, Emiel Krahmer, Mariët Theune. [doi]
- The design of a multi-domain Mandarin Chinese spoken dialogue systemYi-Chung Lin, Tung-Hui Chiang, Huei-Ming Wang, Chung-Ming Peng, Chao-Huang Chang. [doi]
- Spectral sequence compensation based on continuity of spectral sequenceMasato Akagi, Mamoru Iwaki, Noriyoshi Sakaguchi. [doi]
- Improved robust speech recognition considering signal correlation approximated by taylor seriesJia-lin Shen, Jeih-Weih Hung, Lin-Shan Lee. [doi]
- Probabilistic modeling with Bayesian networks for automatic speech recognitionGeoffrey Zweig, Stuart J. Russell. [doi]
- On the structure of vowel space: a genealogy of general phonetic conceptsHendrik F. V. Boshoff, Elizabeth C. Botha. [doi]
- Robust speaker verification insensitive to session-dependent utterance variation and handset-dependent distortionTomoko Matsui, Kiyoaki Aikawa. [doi]
- Assessment of dutch pronunciation by means of automatic speech recognition technologyCatia Cucchiarini, Febe de Wet, Helmer Strik, Lou Boves. [doi]
- Quantification of pharyngeal articulations using measurements from laryngoscopic imagesJohn H. Esling, Jocelyn Clayards, Jerold A. Edmondson, Qiu Fuyuan, Jimmy G. Harris. [doi]
- Phonetic and phonological characteristics of paralinguistic information in spoken JapaneseKikuo Maekawa. [doi]
- A duration-based confidence measure for automatic segmentation of noise corrupted speechBryan L. Pellom, John H. L. Hansen. [doi]
- Towards a reversible symbolic coding of intonationJean Véronis, Estelle Campione. [doi]
- Time shift invariant speech recognitionSankar Basu, Abraham Ittycheriah, Stéphane H. Maes. [doi]
- The automatic marking of prominence in spontaneous speech using duration and part of speech informationMatthew P. Aylett, Matthew Bull. [doi]
- Speech recognition in car noise environments using multiple models according to noise masking levelsMyung Gyu Song, Hoi In Jung, Kab-Jong Shim, Hyung Soon Kim. [doi]
- A new method to achieve fast acoustic matching for speech recognitionClark Z. Lee, Douglas D. O Shaughnessy. [doi]
- Automatic language recognition using high-order HMMsJohan A. du Preez, D. M. Weber. [doi]
- Acoustic observation context modeling in segment based speech recognitionMate Szarvas, Shoichi Matsunaga. [doi]
- Improving the generalization performance of the MCE/GPD learningHiroshi Shimodaira, Jun Rokui, Mitsuru Nakai. [doi]
- Speaker clustering using direct maximisation of the MLLR-adapted likelihoodSue E. Johnson, Philip C. Woodland. [doi]
- Statistical modeling of pronunciation and production variations for speech recognitionFilipp Korkmazskiy, Biing-Hwang Juang. [doi]
- Rejection in speech recognition systems with limited trainingAruna Bayya. [doi]
- The REWARD service creation environment. an overviewTom Brøndsted, Bo Nygaard Bai, Jesper Østergaard Olsen. [doi]
- Speech intelligibility derived from exceedingly sparse spectral informationSteven Greenberg, Takayuki Arai, Rosaria Silipo. [doi]
- Special speech registers: talking to australian and Thai infants, and to petsDenis Burnham. [doi]
- Interfacing acoustic models with natural language processing systemsMichael T. Johnson, Mary P. Harper, Leah H. Jamieson. [doi]
- Context sensitive generation of descriptionsEmiel Krahmer, Mariët Theune. [doi]
- High quality text-to-speech system in Spanish for handicapped peopleFernando Lacunza, Yolanda Blanco. [doi]
- Neural network motivation for segmental distributionEric Keller. [doi]
- Prosynth: an integrated prosodic approach to device-independent, natural-sounding speech synthesisSarah Hawkins, Jill House, Mark Huckvale, John Local, Richard Ogden. [doi]
- Transform coding of LSF parameters using waveletsDavor Petrinovic. [doi]
- STAMP: a suite of tools for analyzing multimodal system processingJosh Clow, Sharon L. Oviatt. [doi]
- Computer-based second language production training by using spectrographic representation and HMM-based speech recognition scoresReiko Akahane-Yamada, Erik McDermott, Takahiro Adachi, Hideki Kawahara, John S. Pruitt. [doi]
- Multi-resolution for speech analysisMarie-José Caraty, Claude Montacié. [doi]
- EGG model of ditoneme in MandarinJiangping Kong. [doi]
- Multi-dimensional scaling of listener responses to complex auditory stimuliZinny S. Bond, Donald Fucci, Verna Stockmal, Douglas McColl. [doi]
- The demiphone versus the triphone in a decision-tree state-tying frameworkJosé B. Mariño, Pau Pachès-Leal, Albino Nogueiras. [doi]
- A syllable-based Chinese spoken dialogue system for telephone directory services primarily trained with a corpusYen-Ju Yang, Lin-Shan Lee. [doi]
- Speech recognition using the probabilistic neural networkRaymond Low, Roberto Togneri. [doi]
- Estimating entropy of a language from optimal word insertion penaltyKazuya Takeda, Atsunori Ogawa, Fumitada Itakura. [doi]
- Relationship between lip shapes and acoustical characteristics during speechKeisuke Mori, Yorinobu Sonoda. [doi]
- Wavelet-based energy binning cepstral features for automatic speech recognitionSankar Basu, Stéphane H. Maes. [doi]
- Spectral noise subtraction with recursive gain curvesKlaus Linhard, Tim Haulick. [doi]
- Speech, silence, music and noise classification of TV broadcast materialAra Samouelian, Jordi Robert-Ribes, Mike Plumpe. [doi]
- An efficient mel-LPC analysis method for speech recognitionHiroshi Matsumoto, Yoshihisa Nakatoh, Yoshinori Furuhata. [doi]
- Effect of task complexity on search strategies for the motorola lexicus continuous speech recognition systemSreeram V. Balakrishnan. [doi]
- A 16 kbit/s wideband CELP coder using MEL-generalized cepstral analysis and its subjective evaluationKazuhito Koishida, Gou Hirabayashi, Keiichi Tokuda, Takao Kobayashi. [doi]
- Exploration of acoustic correlates in speaker selection for concatenative synthesisAnn K. Syrdal, Alistair Conkie, Yannis