EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001

researchr

You are not signed in
Sign in
Sign up

Paul Dalsgaard, Børge Lindberg, Henrik Benner, Zheng-Hua Tan, editors, EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001. ISCA, 2001.

Conference: interspeech2001

Abstract is missing.

Mobile futureYrjö Neuvo. [doi]

Acquiring and implementing phonetic knowledgeLouis C. W. Pols. [doi]

How visual co-presence and joint attention shape speakingSusan E. Brennan. [doi]

Universities and industry: marriage or co-operation between independent partners?Ilkka Niiniluoto. 11-12 [doi]

Considerations on what industry expects from universitiesYrjö Neuvo. 13-14 [doi]

A perspective on industry/university relationships in the USGary W. Strong. 15-16 [doi]

ELRA contribution to bridge the gap between industry and academiaKhalid Choukri. 17-18 [doi]

Combining word- and class-based language models: a comparative study in several languages using automatic and manual word-clustering techniquesGiulio Maltese, P. Bravetti, H. Crépy, B. J. Grainger, M. Herzog, Francisco Palou. 21-24 [doi]

Multi-class composite n-gram language model using multiple word clusters and word successionsShuntaro Isogai, Katsuhiko Shirai, Hirofumi Yamamoto, Yoshinori Sagisaka. 25-28 [doi]

Statistical language model based on a hierarchical approach: MCnvImed Zitouni, Kamel Smaïli, Jean-Paul Haton. 29-32 [doi]

Quantization-based language model compressionEdward W. D. Whittaker, Bhiksha Raj. 33-36 [doi]

Relations between vocal registers in voice breaksGerrit Bloothooft, Mieke van Wijck, Peter Pabon. 39-42 [doi]

A quasi-one-dimensional model of aerodynamic and acoustic flow in the time-varying vocal tract: source and excitation mechanismsGordon Ramsay. 43-46 [doi]

Spectral correlates of voice open quotient and glottal flow asymmetry : theory, limits and experimental dataNathalie Henrich, Christophe d Alessandro, Boris Doval. 47-50 [doi]

One-delayed-mass model for efficient synthesis of glottal flowFederico Avanzini, Paavo Alku, Matti Karjalainen. 51-54 [doi]

Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modelingThomas Fang Zheng, Zhanjiang Song, Pascale Fung, William Byrne. 57-60 [doi]

Learning units for domain-independent out-of- vocabulary word modellingIssam Bazzi, James R. Glass. 61-64 [doi]

Pronunciation variant analysis using speaking style parallel corpusHideharu Nakajima, Izumi Hirano, Yoshinori Sagisaka, Katsuhiko Shirai. 65-68 [doi]

Speech recognition for huge vocabularies by using optimized sub-word unitsJan Kneissler, Dietrich Klakow. 69-72 [doi]

Factors affecting schwa-insertion in final consonant clusters in standard dutchMarc Swerts, Hanne Kloots, Steven Gillis, Georges De Schutter. 75-78 [doi]

Vowel height is intimately associated with stress accent in spontaneous american English discourseLeah Hitchcock, Steven Greenberg. 79-82 [doi]

Finite state prosodic analysis of african corpus resourcesDafydd Gibbon. 83-86 [doi]

Acoustic correlates of emotion dimensions in view of speech synthesisMarc Schröder, Roddy Cowie, Ellen Douglas-Cowie, Machiel Westerdijk, Stan C. A. M. Gielen. 87-90 [doi]

Measuring pitch rangeHanny den Ouden, Jacques M. B. Terken. 91-94 [doi]

Measuring speech rhythmDafydd Gibbon, Ulrike Gut. 95-98 [doi]

Tonal alignment, scaling and slope in Italian question and statement tunesMariapaola D Imperio. 99-102 [doi]

Pragmatic temporal voice range profile as a tool in the research of speech stylesAntti Iivonen. 103-106 [doi]

Model based stress decision methodWooil Kim, Taeyun Kim, Sungjoo Ahn, Hanseok Ko. 107-110 [doi]

Reduction of alternative pronunciations in the norwegian computational lexicon norkompleksTorbjørn Nordgård, Arne Kjell Foldvik. 111-114 [doi]

The role of duration as a correlate of accent in lekeitio basqueGorka Elordieta, José Ignacio Hualde. 115-118 [doi]

Word final aspiration as a phrase boundary cue: data from spontaneous Swedish discourseVictoria Johansson, Merle Horne, Sven Strömqvist. 119-122 [doi]

Study and auto-detection of stress based on tonal pitch range in MandarinXipeng Shen, Bo Xu. 123-126 [doi]

Classifying emotions in speech: a comparison of methodsNoam Amir, Ori Kerret, Dimitry Karlinski. 127-130 [doi]

Development of vowel quantity perception in late childhoodDawn M. Behne, Peter E. Czigler, Kirk P. H. Sullivan. 133-136 [doi]

Japanese can be aware of syllables and morae: evidence from Japanese-English bilingual childrenTakashi Otake, Yuka Yamaguchi. 141-144 [doi]

Neural processes underlying perceptual learning of a difficult second language phonetic contrastDaniel Callan, Keiichi Tajima, Akiko Callan, Reiko Akahane-Yamada, Shinobu Masaki. 145-148 [doi]

Human language identification with reduced segmental information: comparison between monolinguals and bilingualsMasahiko Komatsu, Kazuya Mori, Takayuki Arai, Yuji Murahara. 149-152 [doi]

Coarticulatory effects in perceptionSantiago Fernández, Sergio Feijóo. 155-158 [doi]

A case for multi-resolution auditory scene analysisSue Harding, Georg Meyer. 159-162 [doi]

Perceptual identification and normalization of synthesized French vowels from birth to adulthoodLucie Ménard, Jean-Luc Schwartz, Louis-Jean Boë, Sonia Kandel, Nathalie Vallée. 163-166 [doi]

Perceptual categorization of maximal vowel spaces from birth to adulthood simulated by an articulatory modelLucie Ménard, Louis-Jean Boë. 167-170 [doi]

A study on speech over the telephone and agingMaxine Eskenazi, Alan W. Black. 171-174 [doi]

On the perception of voicing for plosives in noiseMarcia Chen, Abeer Alwan. 175-178 [doi]

Predicting visual consonant perception from physical measuresJintao Jiang, Abeer Alwan, Edward T. Auer, Lynne E. Bernstein. 179-182 [doi]

Noise robust feature extraction for ASR using the Aurora 2 databaseQifeng Zhu, Markus Iseli, Xiaodong Cui, Abeer Alwan. 185-188 [doi]

Investigations into tandem acoustic modeling for the Aurora taskDaniel P. W. Ellis, Manuel J. Reyes Gomez. 189-192 [doi]

Recognition performance of the siemens front-end with and without frame dropping on the Aurora 2 databaseBernt Andrassy, Damjan Vlaj, Christophe Beaugeant. 193-196 [doi]

A multiconditional robust front-end feature extraction with a noise reduction procedure based on improved spectral subtraction algorithmBojan Kotnik, Zdravko Kacic, Bogomir Horvat. 197-200 [doi]

Feature vector selection to improve ASR robustness in noisy conditionsJohan de Veth, Laurent Mauuary, Bernhard Noé, Febe de Wet, Jürgen Sienel, Lou Boves, Denis Jouvet. 201-204 [doi]

Comparison of spectral derivative parameters for robust speech recognitionDusan Macho, Climent Nadeu. 205-208 [doi]

Robust digit recognition in noise: an evaluation using the AURORA corpusUmit H. Yapanel, John H. L. Hansen, Ruhi Sarikaya, Bryan L. Pellom. 209-212 [doi]

Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noiseJon Barker, Martin Cooke, Phil D. Green. 213-217 [doi]

Evaluation of the SPLICE algorithm on the Aurora2 databaseJasha Droppo, Li Deng, Alex Acero. 217-220 [doi]

Model-based compensation of the additive noise for continuous speech recognition. experiments using the Aurora II database and tasksJosé C. Segura, Ángel de la Torre, M. Carmen Benítez, Antonio M. Peinado. 221-224 [doi]

MAP combination of multi-stream HMM or HMM/ANN expertsAndrew C. Morris, Astrid Hagen, Hervé Bourlard. 225-228 [doi]

Second order statistics spectrum estimation method for robust speech recognitionBojan Jarc, Rudolf Babic. 229-232 [doi]

Feature extraction and model-based noise compensation for noisy speech recognition evaluated on AURORA 2 taskKaisheng Yao, Jingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura. 233-236 [doi]

Broadcast news LM adaptation using contemporary textsMarcello Federico, Nicola Bertoldi. 239-242 [doi]

Topic detection for language model adaptation of highly-inflected languages by using a fuzzy comparison functionMirjam Sepesy Maucec, Zdravko Kacic. 243-246 [doi]

Efficient stochastic finite-state networks for language modelling in spoken dialogue systemsKallirroi Georgila, Nikos Fakotakis, George K. Kokkinakis. 247-250 [doi]

Language models conditioned on dialog stateKarthik Visweswariah, Harry Printz. 251-254 [doi]

Using information retrieval methods for language model adaptationLangzhou Chen, Jean-Luc Gauvain, Lori Lamel, Gilles Adda, Martine Adda-Decker. 255-258 [doi]

Making the tongue model talk: merging MRI & EMA measurementsOlov Engwall. 261-264 [doi]

The relationship between intraoral air pressure and tongue/palate contact during the articulation of norwegian /t/ and /d/Inger Moen, Hanne Gram Simonsen, Morten Huseby, John Grue. 265-268 [doi]

Mechanical versus perceptual constraints as determinants of articulatory strategyAhmed M. Elgendy, Louis C. W. Pols. 269-272 [doi]

Pre-liquid excrescent schwa: what happens when vocalic targets conflictBryan Gick, Ian Wilson. 273-276 [doi]

Exploring the null space of the acoustic-to- articulatory inversion using a hypercube codebookSlim Ouni, Yves Laprie. 277-280 [doi]

Phoneme-based topic spotting on the switchboard corpusM. W. Theunissen, K. Scheffler, Johan A. du Preez. 283-286 [doi]

Topic styles in IR and TDT: effect on system behaviorMartin Franz, J. Scott McCarley, Todd Ward, Wei-Jing Zhu. 287-290 [doi]

Extracting caller information from voicemailGeoffrey Zweig, Jing Huang, Mukund Padmanabhan. 291-294 [doi]

A portability study on natural language call steeringHong-Kwang Jeff Kuo, Chin-Hui Lee. 295-298 [doi]

Improved spoken document retrieval by exploring extra acoustic and linguistic cuesBerlin Chen, Hsin-Min Wang, Lin-Shan Lee. 299-302 [doi]

Native vs non-native production of English vowels in spontaneous speech: an acoustic phonetic studyKimiko Tsukada. 305-308 [doi]

Is non-native pronunciation modelling necessary ?Silke Goronzy, Marina Sahakyan, Wolfgang Wokurek. 309-312 [doi]

Burst segmentation and evaluation of acoustic cuesYves Laprie, Anne Bonneau. 313-316 [doi]

A testbed for developing multilingual phonotactic descriptionsSimone Ashby, Julie Carson-Berndsen, Gina Joue. 321-324 [doi]

A component by component listening test analysis of the IBM trainable speech synthesis systemRobert E. Donovan. 329-332 [doi]

Semantic abnormality and its realization in spoken languageShimei Pan, Kathleen McKeown, Julia Hirschberg. 333-336 [doi]

TALKING FOREIGN - concatenative speech synthesis and the language barrierNick Campbell. 337-340 [doi]

Schwa-assimilation in danish synthetic speechChristian Jensen. 341-344 [doi]

Text-to-speech synthesis with arbitrary speaker s voice from average voiceMasatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi. 345-348 [doi]

High quality voice conversion based on Gaussian mixture model with dynamic frequency warpingTomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano. 349-352 [doi]

Voice transformations: from speech synthesis to mammalian vocalizationsMin Tang, Chao Wang, Stephanie Seneff. 353-356 [doi]

A new multi-speaker formant synthesizer that applies voice conversion techniquesJuana M. Gutiérrez-Arriola, Juan Manuel Montero, José A. Vallejo, Ricardo de Córdoba, Rubén San Segundo, Juan Manuel Pardo. 357-360 [doi]

Evaluation of cross-language voice conversion based on GMM and straightMikiko Mashimo, Tomoki Toda, Kiyohiro Shikano, Nick Campbell. 361-364 [doi]

Ejective reduction in chaha is conditioned by more than prosodic positionRachel Coulston. 365-368 [doi]

Effects of noise adaptation on the perception of voiced plosives in isolated syllablesWilliam A. Ainsworth, T. Cervera. 371-374 [doi]

On differential limen of word-based local speechrate variation in Japanese expressed by duration ratioMakoto Hiroshige, Kenji Araki, Koji Tochinai. 375-378 [doi]

Reconstructing dialogue historyMarc Swerts, Emiel Krahmer. 383-386 [doi]

Timing and interaction of visual cues for prominence in audiovisual speech perceptionDavid House, Jonas Beskow, Björn Granström. 387-390 [doi]

Modelling the perceptual identification of Japanese consonants from LPC cepstral distancesMasahiko Komatsu, Shinichi Tokuma, Won Tokuma, Takayuki Arai. 391-394 [doi]

Auditory-visual perception of lexical toneDenis Burnham, Valter Ciocca, Stephanie Stokes. 395-398 [doi]

Syllable prominence: a matter of vocal effort, phonetic distinct-ness and top-down processingAnders Eriksson, Gunilla C. Thunberg, Hartmut Traunmller. 399-402 [doi]

Perceived prominence in terms of a linguistically motivated quantitative intonation modelHansjörg Mixdorff, Christina Widera. 403-406 [doi]

Perception of coda voicing from properties of the onset and nucleus of led and let Sarah Hawkins, Noël Nguyen. 407-410 [doi]

Auditory filter bank design using masking curvesL. Lin, Eliathamby Ambikairajah, W. H. Holmes. 411-414 [doi]

A new feature driven cochlear implant speech processing strategyDashtseren Erdenebat, Shigeyoshi Kitazawa, Tatsuya Kitamura. 415-418 [doi]

Acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environmentsHong Kook Kim, Richard C. Rose, Hong-Goo Kang. 421-424 [doi]

A robust front-end algorithm for distributed speech recognitionYan Ming Cheng, Dusan Macho, Yuanjun Wei, Douglas Ealey, Holly Kelleher, David Pearce, William Kushner, Tenkasi Ramabadran. 425-428 [doi]

Robust ASR front-end using spectral-based and discriminant features: experiments on the Aurora tasksM. Carmen Benítez, Lukas Burget, Barry Y. Chen, Stéphane Dupont, Harinath Garudadri, Hynek Hermansky, Pratibha Jain, Sachin S. Kajarekar, Nelson Morgan, Sunil Sivadas. 429-432 [doi]

Noise reduction for noise robust feature extraction for distributed speech recognitionBernhard Noé, Jürgen Sienel, Denis Jouvet, Laurent Mauuary, Johan de Veth, Lou Boves, Febe de Wet. 433-436 [doi]

Harmonic tunnelling: tracking non-stationary noises during speechDouglas Ealey, Holly Kelleher, David Pearce. 437-440 [doi]

Resource-limited sentence boundary detectionDavid Carter, Ian Gransden. 443-446 [doi]

Metrics for measuring domain independence of semantic classesAndrew N. Pargellis, Eric Fosler-Lussier, Alexandros Potamianos, Chin-Hui Lee. 447-450 [doi]

Context-dependent probabilistic hierarchical sublexical modelling using finite state transducersXiaolong Mou, Stephanie Seneff, Victor Zue. 451-454 [doi]

Data-driven semantic inference for unconstrained desktop command and controlJerome R. Bellegarda, Kim E. A. Silverman. 455-458 [doi]

Information extraction via heuristics for a movie showtime query systemMartin Jansche. 459-462 [doi]

Recognition of (almost) spoken words: evidence from word play in JapaneseTakashi Otake, Anne Cutler. 465-468 [doi]

Perceptual experiments on enhanced and slowed down speech sentences for second language acquisitionVincent Colotte, Yves Laprie, Anne Bonneau. 469-473 [doi]

The relation between speech intelligibility and the complex modulation spectrumSteven Greenberg, Takayuki Arai. 473-476 [doi]

Envelope information in speech processing: acoustic-phonetic analysis vs. auditory figure-ground segregationOlivier Crouzet, William A. Ainsworth. 477-480 [doi]

A comparison between human vowel normalization strategies and acoustic vowel transformation techniquesPatti Adank, Roeland Van Hout, Roel Smits. 481-484 [doi]

On large vocabulary continuous speech recognition of highly inflectional language - czechPavel Ircing, Pavel Krbec, Jan Hajic, Josef Psutka, Sanjeev Khudanpur, Frederick Jelinek, William Byrne. 487-490 [doi]

Towards automatic transcription of spontaneous presentationsTakahiro Shinozaki, Chiori Hori, Sadaoki Furui. 491-494 [doi]

A real-time Japanese broadcast news closed-captioning systemOlivier Siohan, Akio Ando, Mohamed Afify, Hui Jiang, Chin-Hui Lee, Qi Li, Feng Liu, Kazuo Onoe, Frank K. Soong, Qiru Zhou. 495-498 [doi]

Investigations on conversational speech recognitionPeter Beyerlein, Xavier L. Aubert, Matthew Harris, Carsten Meyer, Hauke Schramm. 499-502 [doi]

Recent advances in speech recognition system for IBM DARPA communicatorYuqing Gao, Hakan Erdogan, Yongxin Li, Vaibhava Goel, Michael Picheny. 503-506 [doi]

Festival speaks Italian!Piero Cosi, Fabio Tesser, Roberto Gretter, Cinzia Avesani, Mike Macon. 509-512 [doi]

Multilingual TTS for computer telephony: the aculab approachAlex I. C. Monaghan, Mahmoud Kassaei, Mark Luckin, Mariscela Amador-Hernandez, Andrew Lowry, Daniel Faulkner, Fred Sannier. 513-516 [doi]

A flexible multilingual TTS development and speech research toolGéza Kiss, Géza Németh, Gábor Olaszy, Géza Gordos. 517-520 [doi]

Speech synthesis development made easy: the bonn open synthesis systemEsther Klabbers, Karlheinz Stöber, Raymond N. J. Veldhuis, Petra Wagner, Stefan Breuer. 521-525 [doi]

Automatic prosody generation - a model for hungarianGábor Olaszy, Géza Németh, Péter Olaszi. 525-528 [doi]

Evaluation of PROS-3 for the assignment of prosodic structure, compared to assignment by human expertsOlga van Herwijnen, Jacques M. B. Terken. 529-532 [doi]

Stochastic F0 contour model based on the clustering of F0 shapes of a syntactic unitYoichi Yamashita, Tomoyoshi Ishida. 533-536 [doi]

Intonational phrase break prediction using decision tree and n-gram modelXuejing Sun, Ted H. Applebaum. 537-540 [doi]

Synthesizing intonation of standard arabic languageA. Zaki, A. Rajouani, M. Najim. 541-545 [doi]

Invariance of relative F0 change field of Chinese disyllabic wordsDawei Xu, Hiroki Mori, Hideki Kasuya. 545-548 [doi]

Accent label prediction by time delay neural networks using gating clustersAchim F. Müller, Rüdiger Hoffmann. 549-553 [doi]

Transformation-based learning of danish stress assignmentPeter Juel Henrichsen. 553-556 [doi]

On the prosody of German telephone numbersStefan Baumann, Jürgen Trouvain. 557-560 [doi]

Emotional speech synthesis: a reviewMarc Schröder. 561-564 [doi]

Fun or boring? a web-based evaluation of expressive synthesis for childrenKjell Gustafson, David House. 565-568 [doi]

Sub-band based additive noise removal for robust speech recognitionJingdong Chen, Kuldip K. Paliwal, Satoshi Nakamura. 571-574 [doi]

Development of an asynchronous multi-band system for continuous speech recognitionYik-Cheung Tam, Brian Kan-Wing Mak. 575-578 [doi]

A multi-band approach based on the probabilistic union model and frequency-filtering features for robust speech recognitionPeter Jancovic, Ji Ming. 579-582 [doi]

Split-band perceptual harmonic cepstral coefficients as acoustic features for speech recognitionLiang Gu, Kenneth Rose. 583-586 [doi]

Error correcting posterior combination for robust multi-band speech recognitionAstrid Hagen, Hervé Bourlard. 587-590 [doi]

Robust parameters for speech recognition based on subband spectral centroid histogramsBojana Gajic, Kuldip K. Paliwal. 591-594 [doi]

Pseudo-articulatory representations and the recognition of syllable patterns in speechWilliam H. Edmondson, Li Zhang. 595-598 [doi]

ASR - articulatory speech recognitionJoe Frankel, Simon King. 599-602 [doi]

Efficient decoding strategy for conversational speech recognition using state-space models for vocal-tract-resonance dynamicsJeff Z. Ma, Li Deng. 603-606 [doi]

HMM2- extraction of formant structures and their use for robust ASRKatrin Weber, Samy Bengio, Hervé Bourlard. 607-610 [doi]

Auditory model based speech recognition in noisy environmentXiaoqing Yu, Wanggen Wan, Daniel P. K. Lun. 611-614 [doi]

Forward masking for increased robustness in automatic speech recognitionSascha Wendt, Gernot A. Fink, Franz Kummert. 615-618 [doi]

An auditory system-based feature for robust speech recognitionQi Li, Frank K. Soong, Olivier Siohan. 619-622 [doi]

Experiments with the philips continuous ASR system on the AURORA noisy digits databaseMarkus Lieb, Alexander Fischer. 625-628 [doi]

Robust digit recognition in noisy environments: the IBM Aurora 2 systemGeorge Saon, Juan M. Huerta, Ea-Ee Jan. 629-632 [doi]

Evaluating the Aurora connected digit recognition task - a bell labs approachMohamed Afify, Hui Jiang, Filipp Korkmazskiy, Chin-Hui Lee, Qi Li, Olivier Siohan, Frank K. Soong, Arun C. Surendran. 633-636 [doi]

Liaison and schwa deletion in French: an effect of lexical frequency and competition?Cécile Fougeron, J. P. Goldman, Ulrich H. Frauenfelder. 639-642 [doi]

An acoustical analysis of the vowels in beijing MandarinEric Zee, Wai-Sum Lee. 643-646 [doi]

Discriminant analysis of nasal vs. oral vowels in French: comparison between different parametric representationsVéronique Delvaux, Alain Soquet. 647-650 [doi]

Whispery voiced nasal stops in rwandaDidier Demolin, Véronique Delvaux. 651-654 [doi]

Prominence correlates. a study of SwedishGunnar Fant, Anita Kruckenberg, Johan Liljencrants, Antonis Botinis. 657-660 [doi]

Quantitative analysis of the effects of emphasis upon prosodic features of speechSumio Ohno, Hiroya Fujisaki. 661-664 [doi]

Towards a model of target oriented production of prosodyGrzegorz Dogil, Bernd Möbius. 665-668 [doi]

Prosody control for speaking and singing stylesChilin Shih, Greg Kochanski. 669-672 [doi]

A mixture of Gaussians front end for speech recognitionMatthew N. Stuttle, M. J. F. Gales. 675-678 [doi]

Improved maximum mutual information estimation training of continuous density HMMsJing Zheng, John Butzberger, Horacio Franco, Andreas Stolcke. 679-682 [doi]

Maximum-likelihood training of a bipartite acoustic model for speech recognitionFlorent Perronnin, Roland Kuhn, Patrick Nguyen, Jean-Claude Junqua. 683-686 [doi]

Analysis of the root-cepstrum for acoustic modeling and fast decoding in speech recognitionRuhi Sarikaya, John H. L. Hansen. 687-690 [doi]

Structured language model for class identification of out-of-vocabulary words arising from multiple wordclassesShigehiko Onishi, Hirofumi Yamamoto, Yoshinori Sagisaka. 693-696 [doi]

New language models using phrase structures extracted from parse treesTakatoshi Jitsuhiro, Hirofumi Yamamoto, Setsuo Yamada, Yoshinori Sagisaka. 697-700 [doi]

Triggering individual word domains in n-gram language modelsElvira I. Sicilia-Garcia, Ji Ming, F. Jack Smith. 701-704 [doi]

A structured statistical language model conditioned by arbitrarily abstracted grammatical categories based on GLR parsingTomoyosi Akiba, Katunobu Itou. 705-708 [doi]

Speech recognition of broadcast sports newsAtsushi Matsui, Hiroyuki Segi, Akio Kobayashi, Toru Imai, Akio Ando. 709-712 [doi]

Improvement of a structured language model: arbori-context treeShinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh. 713-716 [doi]

Smoothing issues in the structured language modelWoosung Kim, Sanjeev Khudanpur, Jun Wu. 717-720 [doi]

The study of the effect of training set on statistical language modelingXipeng Shen, Bo Xu. 721-724 [doi]

Stochastic finite state automata language model triggered by dialogue statesYannick Estève, Frédéric Béchet, Alexis Nasr, Renato de Mori. 725-728 [doi]

A baseline method for compiling typed unification grammars into context free language modelsManny Rayner, John Dowding, Beth Ann Hockey. 729-732 [doi]

Comparison of width-wise and length-wise language model compressionEdward W. D. Whittaker, Bhiksha Raj. 733-736 [doi]

Large vocabulary statistical language modeling for continuous speech recognition in finnishVesa Siivola, Mikko Kurimo, Krista Lagus. 737-740 [doi]

A new technique based on augmented language models to improve the performance of spoken dialogue systemsRamón López-Cózar, Diego H. Milone. 741-744 [doi]

The influence of vocal effort on human speaker identificationDouglas Brungart, Kimberly R. Scott, Brian D. Simpson. 747-750 [doi]

Improving speaker recognition using phonetically structured Gaussian mixture modelsRobert Faltlhauser, Günther Ruske. 751-754 [doi]

Information fusion for robust speaker verificationConrad Sanderson, Kuldip K. Paliwal. 755-758 [doi]

A robust speaker verification system against imposture using an HMM-based speech synthesis systemTakayuki Satoh, Takashi Masuko, Takao Kobayashi, Keiichi Tokuda. 759-762 [doi]

Sequential decisions for faster and more flexible verificationArun C. Surendran. 763-766 [doi]

Background learning of speaker voices for textindependent speaker identificationWei-Ho Tsai, Y. C. Chu, Chao-Shih Huang, Wen-Whei Chang. 767-771 [doi]

Explicit exploitation of stochastic characteristics of test utterance for text-independent speaker identificationWei-Ho Tsai, Wen-Whei Chang, Chao-Shih Huang. 771-774 [doi]

Improvement of speaker verification for Thai languageChai Wutiwiwatchai, Varin Achariyakulporn, Sawit Kasuriya. 775-778 [doi]

Speaker identification for car infotainment applicationsJavier Rodríguez Saeta, Christian Koechling, Javier Hernando. 779-782 [doi]

A system for text dependent speaker verification - field trial evaluation and simulation resultsHolger Schalk, Herbert Reininger, Stephan Euler. 783-786 [doi]

Speaker recognition in a multi-speaker environmentAlvin F. Martin, Mark A. Przybocki. 787-790 [doi]

A new DP-like speaker clustering algorithmZhijian Ou, Zuoying Wang. 791-794 [doi]

On the use of the Bayesian information criterion in multiple speaker detectionP. Sivakumaran, J. Fortuna, Aladdin M. Ariyaeeinia. 795-798 [doi]

Preliminary experiments on language identification using broadcast news recordingsLaurent Benarousse, Edouard Geoffrois. 799-802 [doi]

Multi-stream statistical n-gram modeling with application to automatic language identificationKatrin Kirchhoff, Sonia Parandekar. 803-806 [doi]

Up to what level can acoustical and textual features predict prominenceBarbertje M. Streefkerk, Louis C. W. Pols, Louis ten Bosch. 811-814 [doi]

Linguistic factors affecting timing in Korean with application to speech synthesisHyunsong Chung, Mark Huckvale. 815-818 [doi]

Measuring rhythmic deviation in second language speechFelix Schaeffler. 819-822 [doi]

Good timing: place-dependent voice onset time in ejective stopsIan Maddieson. 823-826 [doi]

Design of an optimal continuous speech database for text-to-speech synthesis considered as a set covering problemHélène Francois, Olivier Boëffard. 829-832 [doi]

Use of clustering information for coarticulation compensation in speech synthesis by word concatenationChristos Vosnidis, Vassilios Digalakis. 833-836 [doi]

Reducing spectral mismatches in concatenative speech synthesis via systematic database enrichmentMaria Founda, George Tambouratzis, Aimilios Chalamandaris, George Carayannis. 837-840 [doi]

Hansori 2001 - corpus-based implementation of the Korean hansori text-to-speech synthesizerAttila Ferencz, Sung Woo Choi, Ho-Eun Song, Myoung-Wan Koo. 841-844 [doi]

Time and memory efficient viterbi decoding for LVCSR using a precompiled search networkDaniel Willett, Erik McDermott, Yasuhiro Minami, Shigeru Katagiri. 847-850 [doi]

A new verification-based fast match approach to large vocabulary speech recognitionFeng Liu, Mohamed Afify, Hui Jiang, Olivier Siohan. 851-854 [doi]

A fast calculation method in LVCSRS by time-skipping and clustering of probability density distributionsSeiichi Nakagawa, Yukihisa Horibe. 855-858 [doi]

Speech recognition of Japanese news commentaryShinichi Homma, Akio Kobayashi, Shoei Sato, Toru Imai, Akio Ando. 859-862 [doi]

A comparison of LPC and FFT-based acoustic features for noise robust ASRFebe de Wet, Bert Cranen, Johan de Veth, Lou Boves. 865-868 [doi]

Unsupervised noisy environment adaptation algorithm using MLLR and speaker selectionMiichi Yamada, Akira Baba, Shinichi Yoshizawa, Yuichiro Mera, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano. 869-872 [doi]

Applying parallel model compensation with mel-frequency discrete wavelet coefficients for noise-robust speech recognitionZekeriya Tufekci, John N. Gowdy, Sabri Gurbuz, Eric K. Patterson. 873-876 [doi]

Linear interpolation of cepstral variance for noisy speech recognitionTai-Hwei Hwang, Kuo-Hwei Yuo, Hsiao-Chuan Wang. 877-880 [doi]

Evaluation of a generalized dynamic cepstrum in distant speech recognitionHiroshi Matsumoto, Akihiko Shimizu, Kazumasa Yamamoto. 881-884 [doi]

Robust speech/non-speech detection using LDA applied to MFCC for continuous speech recognitionArnaud Martin, Géraldine Damnati, Laurent Mauuary. 885-888 [doi]

Toward noise-tolerant acoustic modelsEdmondo Trentin, Marco Gori. 889-892 [doi]

Noise estimation without explicit speech, non-speech detection: a comparison of mean, modal and median based approachesNicholas W. D. Evans, John S. D. Mason. 893-896 [doi]

Evaluation of front-end features and noise compensation methods for robust Mandarin speech recognitionRathi Chengalvarayan. 897-900 [doi]

ALGONQUIN: iterating laplace s method to remove multiple types of acoustic distortion for robust speech recognitionBrendan J. Frey, Li Deng, Alex Acero, Trausti T. Kristjansson. 901-904 [doi]

Robust speech recognition in noise: an evaluation using the SPINE corpusJohn H. L. Hansen, Ruhi Sarikaya, Umit H. Yapanel, Bryan L. Pellom. 905-908 [doi]

Automated modeling of Chinese intonation in continuous speechGreg Kochanski, Chilin Shih. 911-914 [doi]

Prediction of intonation patterns of accented words in a corpus of read Swedish news through pitch contour stylizationJohan Frid. 915-918 [doi]

The use of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal, and loud phonationPaavo Alku, Juha Vintturi, Erkki Vilkman. 919-922 [doi]

Prosodic interactions on segmental durations ingreekAntonis Botinis, Marios Fourakis, Robert Bannert. 923-926 [doi]

Study on factors influencing durations of syllables in MandarinMin Chu, Yongqiang Feng. 927-930 [doi]

A comparative study of pauses in dialogues and read speechSofia Gustafson-Capková, Beáta Megyesi. 931-934 [doi]

Detecting Japanese local speech rate deceleration in spontaneous conversational speech using a variable thresholdKeiichi Takamaru, Makoto Hiroshige, Kenji Araki, Koji Tochinai. 935-938 [doi]

Modelling fundamental frequency in first post-tonic syllables in danish sentencesNiels Reinholt Petersen. 939-942 [doi]

Non-finality and pre-finality in bari Italian intonation: a preliminary accountMichelina Savino. 943-946 [doi]

Building an integrated prosodic model of GermanHansjörg Mixdorff, Oliver Jokisch. 947-950 [doi]

A model of F0 contour for arabic affirmative and interrogative sentencesOmar A. G. Ibrahim, S. H. El-Ramly, Nemat S. Abdel Kader. 951-954 [doi]

Variation in final lengthening as a function of topic structureCaroline L. Smith, Lisa A. Hogan. 955-958 [doi]

Do speakers realize the prosodic structure they say they do?Olga van Herwijnen, Jacques M. B. Terken. 959-962 [doi]

Coarticulatory effects at prosodic boundaries: some acoustic resultsMarija Tabain, Guillaume Rolland, Christophe Savariaux. 963-966 [doi]

Generating duration from a cognitively plausible model of rhythm productionPlínio A. Barbosa. 967-970 [doi]

Must diphone synthesis be so unnatural?William J. Barry, Claus Nielsen, Ove Andersen. 975-978 [doi]

Phonetic effects on listener detection of vowel concatenationAnn K. Syrdal. 979-982 [doi]

Variable-length acoustic units inference for text-to-speech synthesisOlivier Boëffard. 983-986 [doi]

Unit selection for speech synthesis using splicing costs with weighted finite state transducersIvan Bulyko, Mari Ostendorf. 987-990 [doi]

Cantonese text-to-speech synthesis using sub-syllable unitsKa Man Law, Tan Lee, Wai H. Lau. 991-994 [doi]

Blind speech separation of moving speakers using hybrid neural networksAthanasios Koutras, Evangelos Dermatas, George K. Kokkinakis. 997-1000 [doi]

Computationally efficient frequency-domain combination of acoustic echo cancellation and robust adaptive beamformingWolfgang Herbordt, Herbert Buchner, Walter Kellermann. 1001-1004 [doi]

Calibration of microphone arrays for improved speech recognitionMichael L. Seltzer, Bhiksha Raj. 1005-1008 [doi]

Improving simultaneous speech recognition in real room environments using overdetermined blind source separationAthanasios Koutras, Evangelos Dermatas, George K. Kokkinakis. 1009-1012 [doi]

Real-time sound source localization and separation system and its application to automatic speech recognitionFutoshi Asano, Masataka Goto, Katunobu Itou, Hideki Asoh. 1013-1016 [doi]

An efficient lipreading method using the symmetry of lipJoohun Lee, Jin Young Kim. 1019-1022 [doi]

Comparing audio- and a-posteriori-probability-based stream confidence measures for audio-visual speech recognitionMartin Heckmann, Thorsten Wild, Frédéric Berthommier, Kristian Kroschel. 1023-1026 [doi]

Large-vocabulary audio-visual speech recognition by machines and humansGerasimos Potamianos, Chalapathy Neti, Giridharan Iyengar, Eric Helmuth. 1027-1030 [doi]

Evaluation of an automatically obtained shape and appearance model for automatic audio visual speech recognitionPhilippe Daubias, Paul Deléglise. 1031-1034 [doi]

An approach to an Italian talking headCatherine Pelachaud, Emanuela Magno Caldognetto, Claudio Zmarich, Piero Cosi. 1035-1038 [doi]

Pause information for dependency analysis of read Japanese sentencesKazuyuki Takagi, Kazuhiko Ozeki. 1041-1044 [doi]

An HMM/n-gram-based linguistic processing approach for Mandarin spoken document retrievalBerlin Chen, Hsin-Min Wang, Lin-Shan Lee. 1045-1048 [doi]

Probabilistic concept verification for language understanding in spoken dialogue systemsYi-Chung Lin, Huei-Ming Wang. 1049-1052 [doi]

Turkish word segmentation using morphological analyzerM. Oguzhan Külekci, Mehmed Özkan. 1053-1056 [doi]

Thai grapheme-to-phoneme using probabilistic GLR parserPongthai Tarsaku, Virach Sornlertlamvanich, Rachod Thongprasirt. 1057-1060 [doi]

Aligning prosody and syntax in property grammarsPhilippe Blache, Daniel Hirst. 1061-1064 [doi]

From perceptual designs to linguistic typology and automatic language identification : overview and perspectivesMelissa Barkat, Ioana Vasilescu. 1065-1068 [doi]

Morphological approaches for an English pronunciation lexiconSusan Fitt. 1069-1072 [doi]

An embodiment paradigm for speech recognition systemsGina Joue, Julie Carson-Berndsen. 1073-1076 [doi]

Multi-parser architecture for query processingKui Xu, Fuliang Weng, Helen M. Meng, Po-Chui Luk. 1077-1080 [doi]

Two-stage probabilistic approach to text segmentationYi-Chia Chen, Yi-Chung Lin. 1081-1084 [doi]

Lexicon optimization for dutch speech recognition in spoken document retrievalRoeland Ordelman, Arjan van Hessen, Franciska de Jong. 1085-1088 [doi]

Evaluation of recent speech grammar standardization effortsTom Brøndsted. 1089-1092 [doi]

Robust speech recognition against packet lossMan-Hung Siu, Yu-Chung Chan. 1095-1098 [doi]

Rapid CODEC adaptation for cellular phone speech recognitionMasaki Naito, Shingo Kuroiwa, Tsuneo Kato, Tohru Shimizu, Norio Higuchi. 1099-1102 [doi]

A robust front-end for ASR over IP snd GSM networks: an integrated scenarioAscensión Gallardo-Antolín, Carmen Peláez-Moreno, Fernando Díaz-de-María. 1103-1106 [doi]

Robust speech recognition using missing feature theory and vector quantizationPhilippe Renevey, Rolf Vetter, Jens Krauss. 1107-1110 [doi]

Modeling the mixtures of known noise and unknown unexpected noise for robust speech recognitionJi Ming, Peter Jancovic, Philip Hanna, Darryl Stewart. 1111-1114 [doi]

Robust speech recognition based on selective use of missing frequency band HMMsTakayoshi Kawamura, Kazuya Takeda, Fumitada Itakura. 1115-1118 [doi]

A new method for speech recognition in the presence of non-stationary, unpredictable and high-level noiseIkuyo Masuda-Katsuse. 1119-1122 [doi]

A computational efficient real time noise robust speech recognition based on improved spectral subtraction methodBojan Kotnik, Zdravko Kacic, Bogomir Horvat. 1123-1126 [doi]

The use of noisy frame elimination and frequency spectrum magnitude reduction in noise robust speech recognitionDamjan Vlaj, Zdravko Kacic, Bogomir Horvat. 1127-1130 [doi]

Combined linear regression adaptation and Bayesian predictive classification for robust speech recognitionJen-Tzung Chien. 1131-1134 [doi]

Quantile based histogram equalization for noise robust speech recognitionFlorian Hilger, Hermann Ney. 1135-1138 [doi]

Sequential noise compensation by a sequential kullback proximal algorithmKaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura. 1139-1142 [doi]

SPeaker and language characterization (spLC): a special interest group (SIG) of ISCAJean-François Bonastre, Ivan Magrin-Chagnolleau, Stephan Euler, François Pellegrino, Régine André-Obrecht, John S. D. Mason, Frédéric Bimbot. 1145-1148 [doi]

The ISCA special interest group on speech synthesisNick Campbell, Wolfgang Hess, Bernd Möbius, Jan P. H. van Santen. 1149-1152 [doi]

Auditory visual speech processingDominic W. Massaro. 1153-1156 [doi]

Training prosodic phrasing rules for Chinese TTS systemsWeijun Chen, Fuzong Lin, Jianmin Li, Bo Zhang. 1159-1162 [doi]

Intonation modelling with a lexicon of natural F0 contoursPer Olav Heggtveit, Jon Emil Natvig. 1163-1166 [doi]

Smooth contour estimation in data-driven pitch modellingKim E. A. Silverman, Jerome R. Bellegarda, Kevin A. Lenzo. 1167-1170 [doi]

Generating F0 contours by statistical manipulation of natural F0 shapesTakashi Saito, Masaharu Sakamoto. 1171-1174 [doi]

Learning prosodic features using a tree representationJulia Hirschberg, Owen Rambow. 1175-1178 [doi]

Lip-reading from parametric lip contours for audio- visual speech recognitionSabri Gurbuz, Eric K. Patterson, Zekeriya Tufekci, John N. Gowdy. 1181-1184 [doi]

An investigation of HMM classifier combination strategies for improved audio-visual speech recognitionSimon Lucey, Sridha Sridharan, Vinod Chandran. 1185-1188 [doi]

Combining multi-party speech and text exchanges over the internetNiels Ole Bernsen, Laila Dybkjær. 1189-1192 [doi]

Real-time multiple speaker tracking by multi-modal integration for mobile robotsKazuhiro Nakadai, Ken-ichi Hidai, Hiroshi G. Okuno, Hiroaki Kitano. 1193-1196 [doi]

XISL: an attempt to separate multimodal interactions from XML contentsTsuneo Nitta, Kouichi Katsurada, Hirobumi Yamada, Yusaku Nakamura, Satoshi Kobayashi. 1197-1200 [doi]

Discriminative speaker adaptation with conditional maximum likelihood linear regressionAsela Gunawardana, William Byrne. 1203-1206 [doi]

What is the best type of prior distribution for EMAP speaker adaptation?Patrick Kenny, Gilles Boulianne, Pierre Dumouchel. 1207-1210 [doi]

Maximum-likelihood affine cepstral filtering (MLACF) technique for speaker normalizationYoon Kim. 1211-1214 [doi]

A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mappingBowen Zhou, John H. L. Hansen. 1215-1218 [doi]

Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakersShinichi Yoshizawa, Akira Baba, Kanako Matsunami, Yuichiro Mera, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano. 1219-1222 [doi]

A novel target-driven MLLR adaptation algorithm with multi-layer structureJia Lei, Xu Bo. 1225-1228 [doi]

Scaled likelihood linear regression for hidden Markov model adaptationFrank Wallhoff, Daniel Willett, Gerhard Rigoll. 1229-1232 [doi]

Fast adaptation using constrained affine transformations with hierarchical priorsTor André Myrvoll, Kuldip K. Paliwal, Torbjørn Svendsen. 1233-1236 [doi]

A context adaptation approach for building context dependent models in LVCSRXiaoxing Liu, Baosheng Yuan, YongHong Yan. 1237-1240 [doi]

Improving genericity for task-independent speech recognitionFabrice Lefevre, Jean-Luc Gauvain, Lori Lamel. 1241-1244 [doi]

A posteriori and a priori transformations for speaker adaptation in large vocabulary speech recognition systemsDriss Matrouf, Olivier Bellot, Pascal Nocera, Georges Linares, Jean-François Bonastre. 1245-1248 [doi]

Bayesian methods for HMM speech recognition with limited training dataDarryl W. Purnell, Elizabeth C. Botha. 1249-1252 [doi]

Rapid speaker adaptation using MLLR and subspace regression classesKwok-Man Wong, Brian Kan-Wing Mak. 1253-1256 [doi]

Speaker adaptation of output probabilities and state duration distributions for speech recognitionNéstor Becerra Yoma, Jorge Silva. 1257-1260 [doi]

Cohorts based custom models for rapid speaker and dialect adaptationJian Wu, Eric Chang. 1261-1264 [doi]

Speaker adaptation of quantized parameter HMMsMarcel Vasilache, Olli Viikki. 1265-1268 [doi]

Segmental eigenvoice for rapid speaker adaptationYu Tsao, Shang-Ming Lee, Fu-Chiang Chou, Lin-Shan Lee. 1269-1272 [doi]

Speaker adaptation in an ASR system based on nonlinear dynamical systemsNarada D. Warakagoda, Magne Hallstein Johnsen. 1273-1276 [doi]

An interactive directory assistance service for Spanish with large-vocabulary recognitionRicardo de Córdoba, Rubén San Segundo, Juan Manuel Montero, José Colás, Javier Ferreiros, Javier Macías Guarasa, Juan Manuel Pardo. 1279-1282 [doi]

A multilingual-supporting dialog system using a common dialog controllerYunbiao Xu, Masahiro Araki, Yasuhisa Niimi. 1283-1286 [doi]

Graphic platform for designing and developing practical voice interaction systemsTomas Nouza, Jan Nouza. 1287-1290 [doi]

Speech translation for French in the NESPOLE! European projectLaurent Besacier, Hervé Blanchon, Y. Fouquet, J. P. Guilbaud, S. Helme, S. Mazenot, Daniel Moraru, Dominique Vaufreydaz. 1291-1294 [doi]

Lessons from the development of a conversational interfaceMarianne Hickey, Paul St John Brittan. 1295-1298 [doi]

SCANMail: browsing and searching speech data by contentJulia Hirschberg, Michiel Bacchiani, Donald Hindle, Philip L. Isenhour, Aaron E. Rosenberg, Litza A. Stark, Larry Stead, Steve Whittaker, Gary Zamchick. 1299-1302 [doi]

Multi-scale retrieval in MEI: an English-Chinese translingual speech retrieval systemWai Kit Lo, Patrick Schone, Helen M. Meng. 1303-1306 [doi]

Compact word graph in spoken dialogue systemShih-Chieh Chien, Sen-Chia Chang. 1307-1310 [doi]

MINOS-II: a prototype car navigation system with mixed initiative turn taking dialogueMunehiko Sasajima, Takehide Yano, Taishi Shimomori, Tatsuya Uehara. 1311-1314 [doi]

Use of topic knowledge in spoken dialogue information retrieval system for academic documentsShinya Kiriyama, Keikichi Hirose, Nobuaki Minematsu. 1315-1318 [doi]

Domain-independent spoken dialogue platform using key-phrase spotting based on combined language modelKazunori Komatani, Katsuaki Tanaka, Hiroaki Kashima, Tatsuya Kawahara. 1319-1322 [doi]

OASIS natural language call steering trialPeter J. Durston, Mark Farrell, David Attwater, James Allen, Hong-Kwang Jeff Kuo, Mohamed Afify, Eric Fosler-Lussier, Chin-Hui Lee. 1323-1326 [doi]

First steps toward an adaptive spoken dialogue system in medical domainIvano Azzini, Daniele Falavigna, Roberto Gretter, Giordano Lanzola, Marco Orlandi. 1327-1330 [doi]

Mokusei: a telephone-based Japanese conversational system in the weather domainMikio Nakano, Yasuhiro Minami, Stephanie Seneff, Timothy J. Hazen, D. Scott Cyphers, James R. Glass, Joseph Polifroni, Victor Zue. 1331-1334 [doi]

Speechbuilder: facilitating spoken dialogue system developmentJames R. Glass, Eugene Weinstein. 1335-1338 [doi]

Voice-IF: a mixed-initiative spoken dialogue system for AT&t conference servicesMazin G. Rahim, Giuseppe Di Fabbrizio, Candace A. Kamm, Marilyn A. Walker, A. Pokrovsky, P. Ruscitti, Esther Levin, Sungbok Lee, Ann K. Syrdal, K. Schlosser. 1339-1342 [doi]

SIGdial - special interest group on discourse and dialogueLaila Dybkjær. 1345-1348 [doi]

ISCA SALTMIL SIG: speech and language technology for minority languagesCliment Nadeu, Donncha Cróinín, Bojan Petek, Kepa Sarasola, Briony Williams. 1353-1556 [doi]

Observations on overlap: findings and implications for automatic processing of multi-party conversationElizabeth Shriberg, Andreas Stolcke, Don Baron. 1359-1362 [doi]

Towards SMIL as a foundation for multimodal, multimedia applicationsJennifer L. Beckham, Giuseppe Di Fabbrizio, Nils Klarlund. 1363-1366 [doi]

ANVIL - a generic annotation tool for multimodal dialogueMichael Kipp. 1367-1370 [doi]

DARPA communicator dialog travel planning systems: the june 2000 data collectionMarilyn A. Walker, John S. Aberdeen, Julie E. Boland, Elizabeth Owen Bratt, John S. Garofolo, Lynette Hirschman, Audrey N. Le, Sungbok Lee, Shrikanth Narayanan, K. Papineni, Bryan L. Pellom, Joseph Polifroni, Alexandros Potamianos, P. Prabhu, Alexander I. Rudnicky, Gregory A. Sanders, Stephanie Seneff, David Stallard, Steve Whittaker. 1371-1374 [doi]

Analysis of speaker variabilityChao Huang, Tao Chen, Stan Z. Li, Eric Chang, Jian-Lai Zhou. 1377-1380 [doi]

Speaker recognition by separating phonetic space and speaker spaceMasafumi Nishida, Yasuo Ariki. 1381-1384 [doi]

Eigen-MLLR coefficients as new feature parameters for speaker identificationNick J.-C. Wang, Wei-Ho Tsai, Lin-Shan Lee. 1385-1388 [doi]

Speaker verification using target and background dependent linear transforms and multi-system fusionJiri Navratil, Upendra V. Chaudhari, Ganesh N. Ramaswamy. 1389-1392 [doi]

Testing the perceptual relevance of syntactic completion and melodic configuration for turn-taking in dutchJohanneke Caspers. 1395-1398 [doi]

Cues for perceived pitch registerToni C. M. Rietveld, Patricia Vermillion. 1399-1402 [doi]

Language-specific effects of pitch range on the perception of universal intonational meaningAoju Chen, Toni C. M. Rietveld, Carlos Gussenhoven. 1403-1406 [doi]

Comparing word-level intelligibility after linear vs. non-linear time-compressionEsther Janse. 1407-1410 [doi]

Dynamic lexicon using phonetic featuresKyung Tak Lee, Christian Wellekens. 1413-1416 [doi]

Triphone tying techniques combining a-priori rules and data driven methodsUte Ziegenhain, Josef G. Bauer. 1417-1420 [doi]

Pronunciation modeling and lexical adaptation in midsize vocabulary ASRLouis ten Bosch, Nick Cremelie. 1421-1424 [doi]

Estimating pronunciation variations from acoustic likelihood score for HMM reconstructionYi Liu, Pascale Fung. 1425-1428 [doi]

Breadth-first search for finding the optimal phonetic transcription from multiple utterancesMaximilian Bisani, Hermann Ney. 1429-1432 [doi]

Improved data-driven generation of pronunciation dictionaries using an adapted word listMatthias Wolff, Matthias Eichner, Rüdiger Hoffmann. 1433-1436 [doi]

Segment-based recognition on the phonebook task: initial results and observations on duration modelingKaren Livescu, James R. Glass. 1437-1440 [doi]

Multilingual text-to-phoneme mappingSøren Kamaric Riis, Morten With Pedersen, Kåre Jean Jensen. 1441-1444 [doi]

Pronunciation variation analysis with respect to various linguistic levels and contextual conditions for Mandarin ChineseMing-Yi Tsai, Fu-Chiang Chou, Lin-Shan Lee. 1445-1448 [doi]

Hypothesis-driven accent discriminationLaura Mayfield Tomokiyo. 1449-1452 [doi]

An approach to automatic phonetic baseform generation based on Bayesian networksChangxue Ma, Mark A. Randolph. 1453-1457 [doi]

Towards discriminative lexicon optimizationHauke Schramm, Peter Beyerlein. 1457-1460 [doi]

Model complexity optimization for nonnative English speakersXiaodong He, Yunxin Zhao. 1461-1464 [doi]

Pronunciation modeling in hungarian number recognitionTibor Fegyó, Péter Mihajlik, Péter Tatai, Géza Gordos. 1465-1468 [doi]

AMSTIVOC (AMsterdam system for transcription of infant VOCalizations) applied to utterances of deaf and normally hearing infantsFlorien J. Koopmans-van Beinum, Chris J. Clement, Ineke Van den Dikkenberg-Pot. 1471-1474 [doi]

Using linguopalatal contact patterns to tune a 3d tongue modelOlov Engwall. 1475-1478 [doi]

Electromagnetic articulograph (EMA) based on a nonparametric representation of tthe magnetic fieldTokihiko Kaburagi, Masaaki Honda. 1479-1482 [doi]

European portuguese nasal vowels: an EMMA studyAntónio J. S. Teixeira, Francisco A. C. Vaz. 1483-1486 [doi]

The role of the palate in tongue kinematics: an experimental assessment in v sequences from EPG and EMMA dataSusanne Fuchs, Pascal Perrier, Christine Mooshammer. 1487-1490 [doi]

Modelling care of articulation with HMMs is dangerousMatthew P. Aylett. 1491-1494 [doi]

Spectral tilt as a perturbation-free measurement of noise levels in voice signalsPeter J. Murphy. 1495-1498 [doi]

Estimation of the modulation frequency and modulation depth of the fundamental frequency owing to vocal micro-tremor of the voice source signalJean Schoentgen. 1499-1502 [doi]

The perceptual relevance of glottal-pulse parameter variationsRalph van Dinther, Raymond N. J. Veldhuis, Armin Kohlrausch. 1503-1506 [doi]

Speaker normalization based on test to reference speaker mappingMarcel Ogner, Zdravko Kacic. 1507-1510 [doi]

A face-to-muscle inversion of a biomechanical face model for audiovisual and motor control researchMichel Pitermann, Kevin G. Munhall. 1511-1514 [doi]

A model of vowel production under positive pressure breathingAllan J. South. 1515-1518 [doi]

Helium speech normalisation by codebook mappingAdam Podhorski, Marek Czepulonis. 1519-1523 [doi]

Building a corpus of natural speech - and tools for the processing of expressive speechNick Campbell. 1525-1528 [doi]

Aspects of modern multi-modal/multi-media corpora exploitation environmentsDaan Broeder, Hennie Brugman, Peter Wittenburg. 1529-1532 [doi]

Emerging requirements for multi-modal annotation and analysis toolsTony Bigbee, Dan Loehr, Lisa Harper. 1533-1536 [doi]

Three-dimensional modelling of speech corpora: added value through visualisationToomas Altosaar, Matti Karjalainen, Martti Vainio. 1537-1540 [doi]

Smartkom: multimodal communication with a life- like characterWolfgang Wahlster, Norbert Reithinger, Anselm Blocher. 1547-1550 [doi]

ISIS: a learning system with combined interaction and delegation dialogsHelen M. Meng, Shuk Fong Chan, Yee Fong Wong, Cheong Chat Chan, Yiu Wing Wong, Tien Ying Fung, Wai Ching Tsui, Ke Chen 0001, Lan Wang, Ting-Yao Wu, Xiaolong Li, Tan Lee, Wing Nin Choi, P. C. Ching, Huisheng Chi. 1551-1554 [doi]

Robust language understanding in mipadYe-Yi Wang. 1555-1558 [doi]

The WITAS multi-modal dialogue system IOliver Lemon, Anne Bracy, Alexander Gruenstein, Stanley Peters. 1559-1562 [doi]

Universalizing speech: notes from the USI projectStefanie Shriver, Roni Rosenfeld, Xiaojin Zhu, Arthur R. Toth, Alexander I. Rudnicky, Markus Flueckiger. 1563-1566 [doi]

Use of real and contaminated speech for training of a hands-free in-car speech recognizerMarco Matassoni, Maurizio Omologo, Piergiorgio Svaizer. 1569-1572 [doi]

Combined front-end signal processing for in-vehicle speech systemsJay P. Plucienkowski, John H. L. Hansen, Pongtep Angkititrakul. 1573-1576 [doi]

Robust automatic speech recognition in low-SNR car environments by the application of a connectionist subspace-based approach to the melbased cepstral coefficientsSid-Ahmed Selouani, Hesham Tolba, Douglas D. O Shaughnessy. 1577-1580 [doi]

Recognition of spelled city names in automotive environmentsAndreas Korthauer. 1581-1584 [doi]

Acoustic echo control and noise reduction for cabin car communicationEduardo Lleida, Enrique Masgrau, Alfonso Ortega. 1585-1588 [doi]

FST-based recognition techniques for multi-lingual and multi-domain spontaneous speechTimothy J. Hazen, I. Lee Hetherington, Alex Park. 1591-1594 [doi]

A transducer approach to word graph generationGilles Boulianne, Pierre Ouellet, Pierre Dumouchel. 1595-1598 [doi]

An efficient implementation of phonological rules using finite-state transducersI. Lee Hetherington. 1599-1602 [doi]

A weight pushing algorithm for large vocabulary speech recognitionMehryar Mohri, Michael Riley. 1603-1606 [doi]

Transducer optimizations for tight-coupled decodingAlexander Seward. 1607-1612 [doi]

Distinctive features for use in an automatic speech recognition systemEllen Eide. 1613-1616 [doi]

Improved context-dependent acoustic modeling for continuous Chinese speech recognitionJiyong Zhang, Thomas Fang Zheng, Jing Li, Chunhua Luo, Guoliang Zhang. 1617-1620 [doi]

Class definition in discriminant feature analysisJacques Duchateau, Kris Demuynck, Dirk Van Compernolle, Patrick Wambacq. 1621-1624 [doi]

Feature extraction from time-frequency matrices for robust speech recognitionJosé C. Segura, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio. 1625-1628 [doi]

Using spatial correlation information in speech recognitionPeng Yu, Zuoying Wang. 1629-1632 [doi]

On the choice of classes in MCE based discriminative HMM-training for speech recognizers used in the telephone environmentJosef G. Bauer. 1633-1636 [doi]

Plosive spotting with margin classifiersJoseph Keshet, Dan Chazan, Ben-Zion Bobrovsky. 1637-1640 [doi]

Model agglomeration for context-dependent acoustic modelingFabio Brugnara. 1641-1644 [doi]

Multipass algorithm for acquisition of salient acoustic morphemesMichael Levit, Allen L. Gorin, Jeremy H. Wright. 1645-1648 [doi]

Rapid vocal tract length normalization using maximum likelihood estimationTadashi Emori, Koichi Shinoda. 1649-1652 [doi]

Towards the creation of acoustic models for stressed Japanese speechKozo Okuda, Tomoko Matsui, Satoshi Nakamura. 1653-1656 [doi]

Elderly acoustic model for large vocabulary continuous speech recognitionAkira Baba, Shinichi Yoshizawa, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano. 1657-1660 [doi]

A hybrid approach to enhance task portability of acoustic models in Chinese speech recognitionJin-Song Zhang, Shuwu Zhang, Yoshinori Sagisaka, Satoshi Nakamura. 1661-1664 [doi]

Evaluation of sublexical and lexical models of acoustic disfluencies for spontaneous speech recognition in SpanishLuis Javier Rodríguez, Inés Torres, Amparo Varona. 1665-1668 [doi]

Structural learning of dynamic Bayesian networks in speech recognitionMurat Deviren, Khalid Daoudi. 1669-1672 [doi]

A new method for testing communication efficiency and user acceptability of speech communication channelsSander J. van Wijngaarden, Paula M. T. Smeele, Herman J. M. Steeneken. 1675-1678 [doi]

Phonetic transcriptions in the spoken dutch corpus: how to combine efficiency and good transcription qualityCatia Cucchiarini, Diana Binnenpoorte, Simo M. A. Goddijn. 1679-1682 [doi]

A functional approach to speech recognition evaluationBen Hutchinson. 1683-1686 [doi]

Instrumental derivation of equipment impairment factors for describing telephone speech codec degradationsSebastian Möller, Jens Berger. 1687-1690 [doi]

Julius - an open source real-time large vocabulary recognition engineAkinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano. 1691-1694 [doi]

Local refinement of phonetic boundaries: a general framework and its application using different transition modelsDoroteo Torre Toledano, Luis A. Hernández Gómez. 1695-1698 [doi]

Detection of digital transmission systems for voice quality measurementsThorsten Ludwig, Ulrich Heute. 1699-1702 [doi]

Automatic segmentation of recorded speech into syllables for speech synthesisEric Lewis, Mark Tatham. 1703-1706 [doi]

Phonetic events from the labeling the european portuguese database for speech synthesis, FEUP/IPBDBJoão Paulo Teixeira, Diamantino Freitas, Daniela Braga, Maria João Barros, Vagner Latsch. 1707-1710 [doi]

Acoustical and topological experiments for an HMM-based speech segmentation systemSamir Nefti, Olivier Boëffard. 1711-1714 [doi]

TclBLASR: an automatic speech recognition extension for tclQiru Zhou, Jinsong Zheng, Chin-Hui Lee. 1715-1718 [doi]

Lower WERs do not guarantee better transcriptionsJudith M. Kessens, Helmer Strik. 1721-1724 [doi]

An elitist approach to articulatory-acoustic feature classificationShuangyu Chang, Steven Greenberg, Mirjam Wester. 1725-1728 [doi]

A dutch treatment of an elitist approach to articulatory-acoustic feature classificationMirjam Wester, Steven Greenberg, Shuangyu Chang. 1729-1732 [doi]

Hybrid natural language generation for spoken dialogue systemsMichel Galley, Eric Fosler-Lussier, Alexandros Potamianos. 1735-1738 [doi]

The generation of speech for a search guideNicholas J. Cook, Ian D. Benest. 1739-1742 [doi]

An automatic dialogue system generator from the internet information contentsMasahiro Araki, Tasuku Ono, Kiyoshi Ueda, Takuya Nishimoto, Yasuhisa Niimi. 1743-1746 [doi]

Training a sentence planner for spoken dialog: the impact of syntactic and planning featuresMonica Rogati, Marilyn A. Walker, Owen Rambow. 1747-1750 [doi]

A comparative study of MLP-based artificial neural networks in text-independent speaker verification against GMM-based systemsCarlos E. Vivaracho, Javier Ortega-Garcia, Luis Alonso, Q. Isaac Moro. 1753-1757 [doi]

Enhancing GMM scores using SVM hints Shai Fine, Jiri Navratil, Ramesh A. Gopinath. 1757-1760 [doi]

Combining GMM s with suport vector machines for text-independent speaker verificationJamal Kharroubi, Dijana Petrovska-Delacrétaz, Gérard Chollet. 1761-1764 [doi]

A text-independent speaker verification system using support vector machines classifierYong Gu, Trevor Thomas. 1765-1768 [doi]

Advances in automatic speech summarizationChiori Hori, Sadaoki Furui. 1771-1774 [doi]

A word graph interface for a flexible concept based speech understanding frameworkKadri Hacioglu, Wayne Ward. 1775-1778 [doi]

Comparing grammar-based and robust approaches to speech understanding: a case studySylvia Knight, Genevieve Gorrell, Manny Rayner, David Milward, Rob Koeling, Ian Lewin. 1779-1782 [doi]

Integrating multiple knowledge sources for improved speech understandingSherif Abdou, Michael S. Scordilis. 1783-1786 [doi]

Classification of transition sounds with application to automatic speech recognitionZeev Litichever, Dan Chazan. 1789-1792 [doi]

Gaussian subtraction (GS) algorithms for word spotting in continuous speechAvi Faizakov, Arnon Cohen, Tzur Vaich. 1793-1796 [doi]

Relating frame accuracy with word error in hybrid ANN-HMM ASRMichael L. Shire. 1797-1800 [doi]

A two-layer lexical tree based beam search in continuous Chinese speech recognitionGuoliang Zhang, Thomas Fang Zheng, Wenhu Wu. 1801-1804 [doi]

Automatic labeling and digesting for lecture speech utilizing repeated speech by shift CDPYoshiaki Itoh, Kazuyo Tanaka. 1805-1808 [doi]

Improved phoneme-history-dependent search for large-vocabulary continuous-speech recognitionTakaaki Hori, Yoshiaki Noda, Shoichi Matsunaga. 1809-1813 [doi]

Comparison of MFCC and PLP parameterizations in the speaker independent continuous speech recognition taskJosef Psutka, Ludek Müller, Josef V. Psutka. 1813-1816 [doi]

N-best list generation using word and phoneme recognition fusionErnest Pusateri, Jean-Manuel Van Thong. 1817-1820 [doi]

A one pass semi-dynamic network decoder based on language model networkDong-Hoon Ahn, Minhwa Chung. 1821-1824 [doi]

Improving automatic speech recognition using tangent distanceWolfgang Macherey, Daniel Keysers, Jörg Dahmen, Hermann Ney. 1825-1828 [doi]

N-best speech hypotheses reordering using linear regressionAnanlada Chotimongkol, Alexander I. Rudnicky. 1829-1832 [doi]

Low-resource hidden Markov model speech recognitionSabine Deligne, Ellen Eide, Ramesh A. Gopinath, Dimitri Kanevsky, Benoît Maison, Peder A. Olsen, Harry Printz, Jan Sedivý. 1833-1836 [doi]

Speech recognition at multiple sampling ratesH. G. Hirsch, K. Hellwig, Stefan Dobler. 1837-1840 [doi]

Support vector machine with dynamic time-alignment kernel for speech recognitionHiroshi Shimodaira, Ken-ichi Noma, Mitsuru Nakai, Shigeki Sagayama. 1841-1844 [doi]

Efficient scalable speech compression for scalable speech recognitionNaveen Srinivasamurthy, Antonio Ortega, Shrikanth Narayanan. 1845-1848 [doi]

Voice activity detection in noisy environmentsJan Stadermann, V. Stahl, G. Rose. 1851-1854 [doi]

An improved wavelet-based speech enhancement systemHamid Sheikhzadeh, Hamid Reza Abutalebi. 1855-1858 [doi]

Enhancing distributed speech recognition with back- end speech reconstructionTenkasi Ramabadran, Jeff Meunier, Mark Jasiuk, Bill Kushner. 1859-1862 [doi]

Implementation effective one-channel noise reduction systemJiri Tihelka, Pavel Sovka. 1863-1866 [doi]

Efficient speech enhancement by diffusive gain factors (DGF)Hyoung-Gook Kim, Klaus Obermayer, Mathias Bode, Dietmar Ruwisch. 1867-1870 [doi]

Correction of the voice timbre distortions on telephone networkGaël Mahé, André Gilloire. 1871-1874 [doi]

Speech enhancement based on IMM with NPHMMYunjung Lee, Joohun Lee, Ki Yong Lee, Katsuhiko Shirai. 1875-1878 [doi]

Speech recognition under musical environments using kalman filter and iterative MLLR adaptationMasakiyo Fujimoto, Yasuo Ariki. 1879-1882 [doi]

Dual channel speech enhancement using coherence function and MDL-based subspace approach in bark domainRolf Vetter, Philippe Renevey, Jens Krauss. 1883-1886 [doi]

Entropy based voice activity detection in very noisy conditionsPhilippe Renevey, Andrzej Drygajlo. 1887-1890 [doi]

Credibility proof for speech content and speaker verification by fragile watermarking with consecutive frame-based processingYiou-Wen Cheng, Lin-Shan Lee. 1895-1898 [doi]

Map estimation for on-line noise compensation of time trajectories of spectral coefficientsIlyas Potamitis, Nikos Fakotakis, George K. Kokkinakis. 1899-1902 [doi]

A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noiseHagai Attias, Li Deng, Alex Acero, John C. Platt. 1903-1906 [doi]

Designing very compact decision trees for grapheme-to-phoneme transcriptionAnne K. Kienappel, Reinhard Kneser. 1911-1914 [doi]

Using machine learning techniques for grapheme to phoneme transcriptionFranco Mana, Paolo Massimino, Alberto Pacchiotti. 1915-1918 [doi]

Knowledge of language origin improves pronunciation accuracy of proper namesAriadna Font Llitjós, Alan W. Black. 1919-1922 [doi]

On the pronunciation of acronyms in French and in ItalianPhilippe Boula de Mareüil, Franck Floricic. 1923-1926 [doi]

Enhancement of noisy speech by using improved global soft decisionVladimir I. Shin, Doh-Suk Kim, Moo-young Kim, Jeongsu Kim. 1929-1932 [doi]

Enhancement of speech using bark-scaled wavelet packet decompositionIsrael Cohen. 1933-1936 [doi]

A new approach for wavelet speech enhancementMohammed Bahoura, Jean Rouat. 1937-1940 [doi]

Speech/noise-dominant decision for speech enhancementSukhyun Yoon, Chang D. Yoo. 1941-1944 [doi]

An MCE based classification tree using hierarchical feature-weighting in speech recognitionFan Wang, Thomas Fang Zheng, Wenhu Wu. 1947-1950 [doi]

Selective MCE training strategy in Mandarin speech recognitionJian-Lai Zhou, Eric Chang, Chao Huang. 1951-1954 [doi]

Discriminative disfluency modeling for spontaneous speech recognitionChung-Hsien Wu, Gwo-Lang Yan. 1955-1958 [doi]

Comparative analysis for data-driven temporal filters obtained via principal component analysis (PCA) and linear discriminant analysis (LDA) in speech recognitionJeih-Weih Hung, Hsin-Min Wang, Lin-Shan Lee. 1959-1962 [doi]

Coding method for successive pitch periodsAri Heikkinen, Vesa T. Ruoppila, Samuli Pietilä. 1965-1968 [doi]

Objective evaluation of methods for quantization of variable-dimension spectral vectors in WI speech codingJani Nurminen, Ari Heikkinen, Jukka Saarinen. 1969-1972 [doi]

Squared error as a measure of phase distortionHarald Pobloth, W. Bastiaan Kleijn. 1973-1976 [doi]

Non-linear predictive vector quantization of speechMarcos Faúndez-Zanuy. 1977-1980 [doi]

A variable rate hybrid coder based on a synchronized harmonic excitationNilantha Katugampala, Ahmet M. Kondoz. 1981-1984 [doi]

A hybrid sub-band sinusoidal coding schemeMeau Shin Ho, Derek J. Molyneux, Barry M. G. Cheetham. 1985-1988 [doi]

Low rate speech coding incorporating simultaneously masked spectrally weighted linear predictionJason Lukasiak, Ian S. Burnett, Christian H. Ritz. 1989-1992 [doi]

Narrowband perceptual audio coding: enhancements for speechHossein Najaf-Zadeh, Peter Kabal. 1993-1996 [doi]

Techniques for high-quality ACELP coding of wideband speechBruno Bessette, Roch Lefebvre, Redwan Salami, Milan Jelinek, J. Vainio, J. Rotola-Pukkila, H. Mikkola, K. Järvinen. 1997-2000 [doi]

Wideband ACELP at 16 kb/s with multi-band excitationSílvia Pujalte, Asunción Moreno. 2001-2004 [doi]

Wideband speech coding algorithm with application of discrete wavelet transform to upper bandSeung-Won Lee, Keun-Sung Bae. 2005-2008 [doi]

A switched DPCM/subband coder for pre-echo reductionS. Satheesh, T. V. Sreenivas. 2009-2012 [doi]

A generalized multistage VQ approach for spectral magnitude quantizationCagri Özgenc Etemoglu, Vladimir Cuperman. 2013-2016 [doi]

Efficient implementation of ITU-t g.723.1 speech coder for multichannel voice transmission and storageSung-Kyo Jung, Young-Cheol Park, Sung-Wan Yoon, Kyung Tae Kim, Dae Hee Youn. 2017-2020 [doi]

CU-move : analysis & corpus development for interactive in-vehicle speech systemsJohn H. L. Hansen, Pongtep Angkititrakul, Jay P. Plucienkowski, Stephen Gallant, Umit H. Yapanel, Bryan L. Pellom, Wayne Ward, Ronald Cole. 2023-2026 [doi]

Multimedia data collection of in-car speech communicationNobuo Kawaguchi, Shigeki Matsubara, Kazuya Takeda, Fumitada Itakura. 2027-2030 [doi]

The u.s. speechdat-car data collectionPeter A. Heeman, David Cole, Andrew Cronk. 2031-2034 [doi]

Word unit based multilingual comparative analysis of text corporaGéza Németh, Csaba Zainkó. 2035-2038 [doi]

Creating a european English broadcast news transcription corpus and systemGerhard Backfried, Robert Hecht, Sabine Loots, Norbert Pfannerer, Jürgen Riedler, Christian Schiefer. 2039-2042 [doi]

The nespole! voIP dialogue databaseSusanne Burger, Laurent Besacier, Paolo Coletti, Florian Metze, Céline Morel. 2043-2046 [doi]

Design of speech corpus for text-to-speech synthesisJindrich Matousek, Josef Psutka, Jiri Kruta. 2047-2050 [doi]

The IFA corpus: a phonemically segmented dutch open source speech databaseR. J. J. H. van Son, Diana Binnenpoorte, Henk van den Heuvel, Louis C. W. Pols. 2051-2054 [doi]

African speech technology (AST) telephone speech databases: corpus design and contentsPhilippa H. Louw, Justus C. Roux, Elizabeth C. Botha. 2055-2058 [doi]

Speechdat-e: five eastern european speech databases for voice-operated teleservices completedHenk van den Heuvel, Jérôme Boudy, Zsolt Bakcsi, Jan Cernocký, Valery Galunov, Julia Kochanina, Wojciech Majewski, Petr Pollák, Milan Rusko, Jerzy Sadowski, Piotr Staroniewicz, Herbert S. Tropf. 2059-2062 [doi]

Concordancing for parallel spoken language corporaDafydd Gibbon, Thorsten Trippel, Serge Sharoff. 2063-2066 [doi]

Large broadcast news and read speech corpora of spoken czechJosef Psutka, Vlasta Radová, Ludek Müller, Jindrich Matousek, Pavel Ircing, David Graff. 2067-2070 [doi]

Development of Russian lexical databases, corpora and supporting tools for speech productsSerge A. Yablonsky. 2071-2074 [doi]

Constructing a segment database for greek time domain speech synthesisStavroula-Evita Fotinea, George Tambouratzis, George Carayannis. 2075-2078 [doi]

Subjective assessment of speech-system interface usabilityKate S. Hone, Robert Graham. 2083-2086 [doi]

An objective measure for estimating MOS of synthesized speechMin Chu, Hu Peng. 2087-2090 [doi]

Comparing the performance of two CSRs: how to determine the significance level of the differencesHelmer Strik, Catia Cucchiarini, Judith M. Kessens. 2091-2094 [doi]

Prediction of low recognition rate words for isolated word recognition systemRyuta Terashima, Hiroyuki Hoshino, Toshihiro Wakita. 2095-2098 [doi]

An objective measure for assessment of the concatenative TTS segment inventoriesRobert Batusek. 2099-2102 [doi]

Word level confidence annotation using combinations of featuresRong Zhang, Alexander I. Rudnicky. 2105-2108 [doi]

A boosting approach for confidence scoringPedro J. Moreno, Beth Logan, Bhiksha Raj. 2109-2112 [doi]

On combining confidence measures for improved rejection of incorrect dataDelphine Charlet, Guy Mercier, Denis Jouvet. 2113-2116 [doi]

Improved word confidence estimation using long range featuresDavid D. Palmer, Mari Ostendorf. 2117-2120 [doi]

Is this conversation on track?Paul Carpenter, Chun Jin, Daniel Wilson, Rong Zhang, Dan Bohus, Alexander I. Rudnicky. 2121 [doi]

Automatic n-gram language model creation from web resourcesRyuichi Nisimura, Kumiko Komatsu, Yuka Kuroda, Kentaro Nagatomo, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano. 2127-2130 [doi]

On integrating the lexicon with the language modelDiamantino Caseiro, Isabel Trancoso. 2131-2134 [doi]

Back-off smoothing evaluation over syntactic language modelsAmparo Varona, Inés Torres. 2135-2138 [doi]

An online incremental language model adaptation methodGenqing Wu, Thomas Fang Zheng, Ling Jin, Wenhu Wu. 2139-2142 [doi]

Using boosting and POS word graph tagging to improve speech recognitionChrister Samuelsson, James Hieronymus. 2143-2146 [doi]

Robust parsing in spoken dialogue systemsPengju Yan, Thomas Fang Zheng, Mingxing Xu. 2149-2152 [doi]

A theme structure method for the ellipsis resolutionYinfei Huang, Thomas Fang Zheng, Yi Su, Fang Li, Wenhu Wu. 2153-2156 [doi]

Deriving document structure from prosodic cuesMartin Haase, Werner Kriechbaum, Gregor Möhler, Gerhard Stenzel. 2157-2160 [doi]

Design of a semantic parser with support to ellipsis resolution in a Chinese spoken language dialogue systemYi Su, Thomas Fang Zheng, Yinfei Huang. 2161-2164 [doi]

Methodology for dialogue design in telephone-based spoken dialogue systems: a Spanish train information systemRubén San Segundo, Juan Manuel Montero, José Colás, Juana M. Gutiérrez, J. M. Ramos, Juan Manuel Pardo. 2165-2168 [doi]

Spoken dialogue management as planning and acting under uncertaintyBo Zhang, Qingsheng Cai, Jianfeng Mao, Eric Chang, Baining Guo. 2169-2172 [doi]

Modeling of conversational strategy for the robot participating in the group conversationYosuke Matsusaka, Shinya Fujie, Tetsunori Kobayashi. 2173-2176 [doi]

Supporting the construction of a user model in speech-only interfaces by adding multi-modalityJacques M. B. Terken, Saskia te Riele. 2177-2180 [doi]

A word- and turn-oriented approach to exploring the structure of Mandarin dialoguesShu-Chuan Tseng. 2181-2184 [doi]

A rule based approach to extraction of topics and dialog acts in a spoken dialog systemYasuhisa Niimi, Tomoki Oku, Takuya Nishimoto, Masahiro Araki. 2185-2188 [doi]

Agent-based error handling in spoken dialogue systemsMarkku Turunen, Jaakko Hakulinen. 2189-2192 [doi]

Iterative implementation of dialogue system modulesLars Degerstedt, Arne Jönsson. 2193-2196 [doi]

Off-talk - a problem for human-machine-interaction?Daniela Oppermann, Florian Schiel, Silke Steininger, Nicole Beringer. 2197-2200 [doi]

Automatic analysis of real dialogues and generating of training corporaJana Schwarz, Václav Matousek. 2201-2204 [doi]

Natural language understanding using statistical machine translationKlaus Macherey, Franz Josef Och, Hermann Ney. 2205-2208 [doi]

Improvements in audio processing and language modeling in the CU communicatorJianping Zhang, Wayne Ward, Bryan L. Pellom, Xiuyang Yu, Kadri Hacioglu. 2209-2212 [doi]

Dialogue session: management using voiceXMLAugustine Tsai, Andrew N. Pargellis, Chin-Hui Lee, Joseph P. Olive. 2213-2216 [doi]

Ambiguity representation and resolution in spoken dialogue systemsEgbert Ammicht, Alexandros Potamianos, Eric Fosler-Lussier. 2217-2220 [doi]

Feature extraction by auditory modeling for unit selection in concatenative speech synthesisMinoru Tsuzaki. 2223-2226 [doi]

Perceptual cost functions for unit searching in large corpus-based text-to-speechMinkyu Lee. 2227-2230 [doi]

Pruning of redundant synthesis instances based on weighted vector quantizationSanghun Kim, YoungJik Lee, Keikichi Hirose. 2231-2234 [doi]

Using real words for recording diphonesSusan Fitt. 2235-2238 [doi]

Application of the trended hidden Markov model to speech synthesisJohn Dines, Sridha Sridharan, Miles Moody. 2239-2242 [doi]

Two features to check phonetic transcriptions in text to speech systemsStefano Sandri, Enrico Zovato. 2243-2246 [doi]

Text-to-speech scripting interface for appropriate vocalisation of e-textsGerasimos Xydas, Georgios Kouroupetroglou. 2247-2250 [doi]

Representation of large lexica using finite-state transducers for the multilingual text-to-speech synthesis systemsMatej Rojc, Zdravko Kacic. 2251-2254 [doi]

Corpus-based synthesis of fundamental frequency contours based on a generation process modelKeikichi Hirose, Masaya Eto, Nobuaki Minematsu, Atsuhiro Sakurai. 2255-2258 [doi]

Corpus-based database of residual excitations used for speech reconstruction from MFCCsZbynek Tychtl, Josef Psutka. 2259-2262 [doi]

Mixed excitation for HMM-based speech synthesisTakayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura. 2263-2266 [doi]

Aperiodicity control in ARX-based speech analysis-synthesis methodTakahiro Ohtsuka, Hideki Kasuya. 2267-2270 [doi]

Generalized source-filter structures for speech synthesisMatti Karjalainen, Tuomas Paatero. 2271-2274 [doi]

The speech synthesis environment and parametric modeling of coarticulationMikolaj Wypych. 2275-2278 [doi]

Defining constraints for multilinear speech processingJulie Carson-Berndsen, Michael Walsh. 2281-2284 [doi]

Prosodic models, automatic speech understanding, and speech synthesis: towards the common groundAnton Batliner, Bernd Möbius, Gregor Möhler, Antje Schweitzer, Elmar Nöth. 2285-2288 [doi]

Introducing phonetically motivated information into ASRHeidi Christensen, Børge Lindberg, Ove Andersen. 2289-2292 [doi]

Integrating contextual phonological rules in a large vocabulary decoderGuillaume Gravier, François Yvon, Bruno Jacob, Frédéric Bimbot. 2293-2296 [doi]

Automatic learning of finite state automata for pronunciation modelingMoisés Pastor-i-Gadea, Francisco Casacuberta. 2297-2300 [doi]

AMR wideband codec - leap in mobile communication voice qualityJ. Rotola-Pukkila, J. Vainio, H. Mikkola, K. Järvinen, Bruno Bessette, Roch Lefebvre, Redwan Salami, M. Jeline. 2303-2306 [doi]

Combined speech and audio coding with bit rate and bandwidth scalabilityMaria Farrugia, Ahmet M. Kondoz. 2307-2310 [doi]

Joint speech and audio coding combining sinusoidal modeling and wavelet packetsMárk Fék, Annamária R. Várkonyi-Kóczy, Jean-Marc Boucher. 2311-2314 [doi]

Temporal decomposition: a promising approach to low rate wideband speech compressionChristian H. Ritz, Ian S. Burnett. 2315-2318 [doi]

Wideband LSF quantization by generalized voronoi codesStéphane Ragot, Hassan Lahdili, Roch Lefebvre. 2319-2322 [doi]

Learning of user formulations for business listings in automatic directory assistanceCosmin Popovici, M. Andorno, Pietro Laface, Luciano Fissore, M. Nigra, Claudio Vair. 2325-2328 [doi]

Mathematical modeling of spoken human - machine dialogues including erroneous confirmationsD. Louloudis, Anastasios Tsopanoglou, Nikos Fakotakis, George K. Kokkinakis. 2329-2332 [doi]

Limited enquiry negotiation dialoguesIan Lewin. 2333-2336 [doi]

A comparison of some different techniques for vector based call-routingStephen Cox, Ben Shahshahani. 2337-2340 [doi]

Architecture for adaptive multimodal dialog systems based on voiceXMLGeorg Niklfeld, Robert Finan, Michael Pucher. 2341-2344 [doi]

Separating speaker and environment variabilities for improved recognition in non-stationary conditionsLuca Rigazio, Patrick Nguyen, David Kryze, Jean-Claude Junqua. 2347-2350 [doi]

Robust speech recognition techniques applied to a speech in noise taskRichard C. Rose, Hong Kook Kim, Donald Hindle. 2351-2355 [doi]

Minimax classification with parametric neighborhoods for noisy speech recognitionMohamed Afify, Olivier Siohan, Chin-Hui Lee. 2355-2358 [doi]

Maximum likelihood non-linear transformation for environment adaptation in speech recognitionMukund Padmanabhan, Satya Dharanipragada. 2359-2362 [doi]

A study of speech coding parameters in speech recognitionJari Turunen, Damjan Vlaj. 2363-2366 [doi]

Some practical considerations in the deployment of a wireless-communication interactive voice response systemCarmen García-Mateo, Laura Docío Fernández, Antonio Cardenal López. 2369-2372 [doi]

Caller identification for the SCANMail voicemail browserAaron E. Rosenberg, Julia Hirschberg, Michiel Bacchiani, S. Parthasarathy, Philip L. Isenhour, Larry Stead. 2373-2376 [doi]

Extractive summarization of voicemail using lexical and prosodic feature subset selectionKonstantinos Koumpis, Steve Renals, Mahesan Niranjan. 2377-2380 [doi]

Business listings in automatic directory assistanceOdette Scharenborg, Janienke Sturm, Lou Boves. 2381-2384 [doi]

Eutrans: a speech-to-speech translator prototypeMoisés Pastor-i-Gadea, Alberto Sanchís, Francisco Casacuberta, Enrique Vidal. 2385-2388 [doi]

Speech recognition over netmeeting connectionsFlorian Metze, John W. McDonough, Hagen Soltau. 2389-2392 [doi]

DIARCA: a component approach to voice recognitionJuan Carlos Díaz Martín, Juan-Luis García Zapata, José Manuel Rodríguez García, José F. Álvarez Salgado, Pablo Espada Bueno, Pedro Gómez Vilda. 2393-2396 [doi]

The mvprotek : m-commerce voice verification systemY. J. Kyung, J. O. Jung, S. M. Sohn, H. J. Chun, S. Y. Moon, M. H. Kim, W. H. Sull. 2397-2400 [doi]

Real-time multilingual communication by means of prestored conversational unitsNorman Alm, Mamoru Iwabuchi, Peter N. Andreasen, Kenryu Nakamura, Iain R. Murray. 2401-2404 [doi]

Writing script-based dialogues for AACIain R. Murray, John L. Arnott, Norman Alm, Richard Dye, Gillian Harper. 2405-2409 [doi]

Communication aid for non-vocal people using corpusbased concatenative speech synthesisAkemi Iida, Yosuke Sakurada, Nick Campbell, Michiaki Yasumura. 2409-2412 [doi]

Social effects on vocal rate with echoic mimicry using prosody-only voiceNoriko Suzuki, Kazuhiko Kakehi, Yugo Takeuchi, Michio Okada. 2413-2416 [doi]

Everyday life sounds and speech analysis for a medical telemonitoring systemEric Castelli, Dan Istrate. 2417-2420 [doi]

Speaking while driving - preliminary results on spellings in the German speechdat-car databaseChristoph Draxler, Klaus Bengler, Cristina Olaverri-Monreal. 2421-2424 [doi]

Efficient periodicity extraction based on sine-wave representation and its application to pitch determination of speech signalsDan Chazan, Meir (Zibulski) Tzur, Ron Hoory, Gilad Cohen. 2427-2430 [doi]

Viseme recognition using multiple feature matchingIslam Shdaifat, Rolf-Rainer Grigat, Stefan Lütgert. 2431-2434 [doi]

The fundamental frequency of cough by autocorrelation analysisAnnemie Van Hirtum, Daniel Berckmans. 2435-2438 [doi]

A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequencyYuichi Ishimoto, Masashi Unoki, Masato Akagi. 2439-2442 [doi]

Robust LP analysis using glottal source HMM with application to high-pitched and noise corrupted speechAkira Sasou, Kazuyo Tanaka. 2443-2446 [doi]

Fast harmonic estimation using a low resolution pitch for low bit rate harmonic codingYong-Soo Choi, Dae Hee Youn. 2447-2450 [doi]

Comparative evaluation of F0 estimation algorithmsAlain de Cheveigné, Hideki Kawahara. 2451-2454 [doi]

Identification of accent and intonation in sentences for CALL systemsCarlos Toshinori Ishi, Nobuaki Minematsu, Ryuji Nishide, Keikichi Hirose. 2455-2458 [doi]

Systematic F0 glitches around nasal-vowel transitionsHideki Kawahara, Parham Zolfaghari. 2459-2462 [doi]

Using aerial and geometric features in automatic lip-readingJacek C. Wojdel, Léon J. M. Rothkrantz. 2463-2466 [doi]

Inverse filtering of tube models with frequency dependent tube terminationsKarl Schnell, Arild Lacroix. 2467-2470 [doi]

Formant estimation using gammachirp filterbankKaïs Ouni, Zied Lachiri, Noureddine Ellouze. 2471-2474 [doi]

Autoregressive time-frequency interpolation in the context of missing data theory for impulsive noise compensationIlyas Potamitis, Nikos Fakotakis. 2475-2478 [doi]

Analysis of the voiced speech using the generalized fourier transform with quadratic phaseD. Petrinovic, Vladimir Cuperman. 2479-2482 [doi]

From here to utility - melding phonetic insight with speech technologySteven Greenberg. 2485-2488 [doi]

Speech quality measure for voIP using wavelet based bark coherence functionSang Wook Park, Young-Cheol Park, Dae Hee Youn. 2491-2494 [doi]

A proposed method for measuring language dependency of narrow band voice codersSander J. van Wijngaarden, Herman J. M. Steeneken. 2495-2498 [doi]

An efficient transcoding algorithm for g.723.1 and g.729a speech codersSung-Wan Yoon, Sung-Kyo Jung, Young-Cheol Park, Dae Hee Youn. 2499-2502 [doi]

Joint source-channel coding for low bit-rate coding of LSP parametersJosé L. Pérez-Córdoba, Antonio J. Rubio, Antonio M. Peinado, Ángel de la Torre. 2503-2506 [doi]

A segmental mixture model for speaker recognitionRobert P. Stapert, John S. D. Mason. 2509-2512 [doi]

Tree based score computation for speaker verificationRaphaël Blouet, Frédéric Bimbot. 2513-2516 [doi]

Phonetic speaker recognitionWalter D. Andrews, Mary A. Kohler, Joseph P. Campbell. 2517-2520 [doi]

Speaker recognition based on idiolectal differences between speakersGeorge R. Doddington. 2521-2524 [doi]

An investigation of modelling aspects for ratedependent speech recognitionBritta Wrede, Gernot A. Fink, Gerhard Sagerer. 2527-2530 [doi]

Speaking rate dependent acoustic modeling for spontaneous lecture speech recognitionHiroaki Nanjo, Kazuomi Kato, Tatsuya Kawahara. 2531-2534 [doi]

Analysis of n-best output hypotheses for fast speech in large vocabulary continuous speech recognitionTibor Fábián, Thilo Pfau, Günther Ruske. 2535-2538 [doi]

Automatic rhythm modeling for language identificationJérôme Farinas, François Pellegrino. 2539-2542 [doi]

Confidence measure (CM) estimation for large vocabulary speaker-independent continuous speech recognition systemYaxin Zhang, Raymond Lee, Anton Madievski. 2545-2548 [doi]

Experimental evaluation on confidence of agreement among multiple Japanese LVCSR modelsYasuhiro Kodama, Takehito Utsuro, Hiromitsu Nishizaki, Seiichi Nakagawa. 2549-2552 [doi]

Detection of recognition errors and out of the spelling dictionary names in a spelled name recognizer for SpanishRubén San Segundo, Javier Macías Guarasa, Javier Ferreiros, P. Martín, Juan Manuel Pardo. 2553-2556 [doi]

Use of acoustic prior information for confidence measure in ASR applicationsErhan Mengusoglu, Christophe Ris. 2557-2560 [doi]

Improving performance of a keyword spotting system by using a new confidence measureLuciana Ferrer, Claudio Estienne. 2561-2564 [doi]

Word level confidence measures using n-best sub-hypotheses likelihood ratioBeng Tiong Tan, Yong Gu, Trevor Thomas. 2565-2568 [doi]

Confidence based lattice segmentation and minimum Bayes-risk decodingVaibhava Goel, Shankar Kumar, William Byrne. 2569-2572 [doi]

A data selection strategy for utterance verification in continuous speech recognitionHui Jiang, Frank K. Soong, Chin-Hui Lee. 2573-2576 [doi]

Improved speech recognition using iterative decoding based on confidence measuresJun Ogata, Yasuo Ariki. 2577-2580 [doi]

Detection of OOV words using generalized word models and a semantic class language modelThomas Schaaf. 2581-2584 [doi]

Effects of OOV rates on keyphrase rejection schemesGies Bouwman, Janienke Sturm, Lou Boves. 2585-2588 [doi]

A new auditory based microphone array and objective evaluation using e-RASTIJ. L. Sánchez-Bote, Joaquin Gonzalez-Rodriguez, Danilo Simon-Zorita. 2591-2594 [doi]

Equivalence between frequency domain blind source separation and frequency domain adaptive null beamformersShoko Araki, Shoji Makino, Ryo Mukai, Hiroshi Saruwatari. 2595-2598 [doi]

Separation and dereverberation performance of frequency domain blind source separation for speech in a reverberant environmentRyo Mukai, Shoko Araki, Shoji Makino. 2599-2602 [doi]

Blind source separation for speech based on fast-convergence algorithm with ICA and beamformingHiroshi Saruwatari, Toshiya Kawamura, Kiyohiro Shikano. 2603-2606 [doi]

Noise reduction using paired-microphones for both far-field and near-field sound sourcesMitsunori Mizumachi, Satoshi Nakamura. 2607-2610 [doi]

Statistical sound source identification in a real acoustic environment for robust speech recognition using a microphone arrayTakanobu Nishiura, Satoshi Nakamura, Kiyohiro Shikano. 2611-2614 [doi]

Speech enhancement and source separation based on binaural negative beamformingAgustín Álvarez Marquina, Pedro Gómez Vilda, Rafael Martínez-Olalla, Victor Nieto Lluis, María Victoria Rodellar Biarge. 2615-2618 [doi]

Multiple source separation in the frequency domain using negative beamformingPedro Gómez Vilda, Agustín Álvarez Marquina, Victor Nieto Lluis, María Victoria Rodellar Biarge, Rafael Martínez-Olalla. 2619-2622 [doi]

Planar superdirective microphone arrays for speech acquisition in the carRainer Martin, Alexey Petrovsky, Thomas Lotter. 2623-2626 [doi]

Is speech data clustered? - statistical analysis of cepstral featuresTomi Kinnunen, Ismo Kärkkäinen, Pasi Fränti. 2627-2630 [doi]

Maximum likelihood adaptation for distant speech recognition of stationary and moving speakers in reverberant environmentsGeorge Nokas, Evangelos Dermatas, George K. Kokkinakis. 2631-2634 [doi]

Model-based blind estimation of reverberation time: application to robust ASR in reverberant environmentsLaurent Couvreur, Christophe Ris, Christophe Couvreur. 2635-2638 [doi]

Using the modulation complex wavelet transform for feature extraction in automatic speech recognitionYasunori Momomura, Kenji Okada, Takayuki Arai, Noboru Kanedera, Yuji Murahara. 2639-2642 [doi]

Separating three simultaneous speeches with two microphones by integrating auditory and visual processingHiroshi G. Okuno, Kazuhiro Nakadai, Tino Lourens, Hiroaki Kitano. 2643-2646 [doi]

A time-varying complex AR speech analysis based on GLS and ELS methodKeiichi Funaki. 2649-2652 [doi]

Vocal tract normalization equals linear transformation in cepstral spaceMichael Pitz, Sirko Molau, Ralf Schlüter, Hermann Ney. 2653-2656 [doi]

An algorithm for finding line spectrum frequencies of added speech signals and its application to robust speech recognitionAn-Tze Yu, Hsiao-Chuan Wang. 2657-2660 [doi]

Improved entropic gain for speech signals analysis/synthesis based on an adaptive time-frequency segmentation schemeGilles Gonon, Silvio Montrésor, Marc Baudry. 2661-2664 [doi]

Automatic word acquisition from continuous speechHelmut Lucke, Masanori Omote. 2667-2670 [doi]

Why is automatic recognition of children s speech difficult?Qun Li, Martin J. Russell. 2671-2674 [doi]

Politeness and frustration language in child-machine interactionsSudha Arunachalam, Dylan Gould, Elaine Andersen, Dani Byrd, Shrikanth Narayanan. 2675-2678 [doi]

Speech emotion recognition using hidden Markov modelsAlbino Nogueiras, Asunción Moreno, Antonio Bonafonte, José B. Mariño. 2679-2682 [doi]

Speech enhanced remote control for media terminalAseel Ibrahim, Jonas Lundberg, Jenny Johansson. 2685-2688 [doi]

The development of a portuguese version of a media watch systemRui Amaral, Thibault Langlois, Hugo Meinedo, João Paulo Neto, Nuno Souto, Isabel Trancoso. 2689-2692 [doi]

Classification of video genre using audioMatthew Roach, John S. D. Mason. 2693-2696 [doi]

Prosody in finger braille and teletext receiver for finger brailleYasuo Horiuchi, Akira Ichikawa. 2697-2702 [doi]

Joint channel decoding - Viterbi recognition for wireless applicationsAlexis Bernard, Abeer Alwan. 2703-2706 [doi]

MMSE-based channel error mitigation for distributed speech recognitionAntonio M. Peinado, Victoria E. Sánchez, José C. Segura, José L. Pérez-Córdoba. 2707-2710 [doi]

Distributed speech recognition using traditional and hybrid modeling techniquesJan Stadermann, R. Meermeier, Gerhard Rigoll. 2711-2714 [doi]

Graceful degradation of speech recognition performance over lossy packet networksEve A. Riskin, Constantinos Boulis, Scott Otterson, Mari Ostendorf. 2715-2718 [doi]

Experiments on cross-language acoustic modelingTanja Schultz, Alex Waibel. 2721-2724 [doi]

Crosslingual speech recognition with multilingual acoustic models based on agglomerative and tree-based triphone clusteringAndrej Zgank, Bojan Imperl, Finn Tore Johansen, Zdravko Kacic, Bogomir Horvat. 2725-2729 [doi]

Comparing parameter tying methods for multilingual acoustic modellingMikko Harju, Petri Salmela, Jussi Leppänen, Olli Viikki, Jukka Saarinen. 2729-2732 [doi]

Accent-independent universal HMM-based speech recognizer for american, australian and british EnglishRathinavelu Chengalvarayan. 2733-2736 [doi]

The effect of time stress on automatic speech recognition accuracy when using second languageFang Chen, Jonas Sääv. 2737-2740 [doi]

The effect of pitch and lexical tone on different Mandarin speech recognition tasksYiu Wing Wong, Eric Chang. 2741-2744 [doi]

Acoustic modeling of foreign words in a German speech recognition systemGeorg Stemmer, Elmar Nöth, Heinrich Niemann. 2745-2748 [doi]

Semi-automatic grammar induction for bi-directional English-Chinese machine translationKai-Chung Siu, Helen M. Meng. 2749-2752 [doi]

F0 feature extraction by polynomial regression function for monosyllabic Thai tone recognitionPatavee Charnvivit, Somchai Jitapunkul, Visarut Ahkuputra, Ekkarit Maneenoi, Umavasee Thathong, Boonchai Thampanitchawong. 2753-2756 [doi]

The use of prosody in a combined system for punctuation generation and speech recognitionJi-Hwan Kim, Philip C. Woodland. 2757-2760 [doi]

Lexical stress modeling for improved speech recognition of spontaneous telephone speech in the jupiter domainChao Wang, Stephanie Seneff. 2761-2765 [doi]

Modeling auxiliary information in Bayesian network based ASRTodd A. Stephenson, M. Mathew, Hervé Bourlard. 2765-2768 [doi]

A new dynamic HMM model for speech recognitionFeili Chen, Eric Chang. 2769-2772 [doi]

Multi-keyword spotting of telephone speech using orthogonal transform-based SBR and RNN prosodic modelWern-Jun Wang, Chun-Jen Lee, Eng-Fong Huang, Sin-Horng Chen. 2773-2776 [doi]

Recognition of slovenian speech: within and cross-language experiments on monophones using the speechdat(II)Andrej Iskra, Bojan Petek, Tom Brøndsted. 2777-2780 [doi]

Boiling down prosody for the classification of boundaries and accents in German and EnglishAnton Batliner, Jan Buckow, Richard Huber, Volker Warnke, Elmar Nöth, Heinrich Niemann. 2781-2784 [doi]

Javaspeakerrecognition - interactive workbench for visualizing speaker recognition concepts on the WWWAndrzej Drygajlo, Gary Garcia Molina. 2787-2790 [doi]

Prototype of a vocal-tract model for vowel production designed for education in speech scienceTakayuki Arai, Nobuyuki Usuki, Yuji Murahara. 2791-2794 [doi]

A tool for automatic feedback on phonemic transcriptionMartin Cooke, Maria Luisa Garcia Lecumberri, John A. Maidment. 2795-2798 [doi]

Speech lab in a box: a Mandarin speech toolbox to jumpstart speech related researchEric Chang, Yu Shi, Jian-Lai Zhou, Chao Huang. 2799-2802 [doi]

Relating phonepass scores overall scores to the council of europe framework level descriptorsJohn H. A. L. de Jong, Jared Bernstein. 2803-2806 [doi]

A multilingual, multimodal, speech training system, SPECOKlára Vicsi, Peter Roach, Anne-Marie Öster, Zdravko Kacic, F. Csatári, A. Sfakianaki, R. Veronik, Géza Gordos. 2807-2810 [doi]

Instantaneous estimation of accentuation habits for Japanese students to learn English pronunciationNaoki Nakamura, Nobuaki Minematsu, Seiichi Nakagawa. 2811-2814 [doi]

Automatic construction of CALL system from TV news program with captionsTakashi Tanaka, Kazumasa Mori, Satoshi Kobayashi, Seiichi Nakagawa. 2815-2818 [doi]

Pitch-dependent GMMs for text-independent speaker recognition systemsMijail Arcienega, Andrzej Drygajlo. 2821-2825 [doi]

Towards combining pitch and MFCC for speaker recognition systemsHassan Ezzaidi, Jean Rouat, Douglas D. O Shaughnessy. 2825-2828 [doi]

Formant-broadened CMS using peak-picking in LOG spectrumYu-Jin Kim, Hea-Kyoung Jung, Jae-Ho Chung. 2829-2832 [doi]

Improvements in the speaker identification rate using feature-setsDaniel J. Mashao, N. Tinyiko Baloyi. 2833-2836 [doi]

Minimum classification error training for speaker identification using Gaussian mixture models based on multi-space probability distributionChiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura. 2837-2840 [doi]

Speaker recognition based on feature space traceYadong Wu, Zhizhu Li. 2841-2844 [doi]

Additive and convolutional noise canceling in speaker verification using a stochastic weighted viterbi algorithmNéstor Becerra Yoma, Miguel Villar Fernandez. 2845-2848 [doi]

A multi-SNR subband model for speaker identification under noisy environmentsKenichi Yoshida, Kazuyuki Takagi, Kazuhiko Ozeki. 2849-2852 [doi]

External Links

Cite Key

Statistics

PDF

Researchr

EUROSPEECH 2001 Scandinavia, 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, Aalborg, Denmark, September 3-7, 2001

Abstract

Table of Contents