The 4th International Conference on Spoken Language Processing, Philadelphia, PA, USA, October 3-6, 1996

researchr

You are not signed in
Sign in
Sign up

The 4th International Conference on Spoken Language Processing, Philadelphia, PA, USA, October 3-6, 1996. ISCA, 1996.

Conference: interspeech1996

Abstract is missing.

Learning non-native vowel categoriesJohn Kingston, Christine Bartels, José Benkí, Deanna Moore, Jeremy Rice, Rachel Thorburn, Neil Macmillan. [doi]

The comparative study of spoken-language processingAnne Cutler. 1 [doi]

New developments in the INRS continuous speech recognition systemZ. Li, Michel Héon, Douglas D. O Shaughnessy. 2-5 [doi]

On designing pronunciation lexicons for large vocabulary, continuous speech recognitionLori Lamel, Gilles Adda. 6-9 [doi]

Word graph rescoring using confidence measuresPablo Fetter, Frédéric Dandurand, Peter Regel-Brietzmann. 10-13 [doi]

A bottom-up approach for handling unseen triphones in large vocabulary continuous speech recognitionXavier L. Aubert, Peter Beyerlein, Meinhard Ullrich. 14-17 [doi]

Discriminative optimisation of large vocabulary recognition systemsV. Valtchev, Philip C. Woodland, Steve J. Young. 18-21 [doi]

Japanese large-vocabulary continuous-speech recognition using a business-newspaper corpusTatsuo Matsuoka, Katsutoshi Ohtsuki, Takeshi Mori, Sadaoki Furui, Katsuhiko Shirai. 22-25 [doi]

Handling compound nouns in a Swedish speech-understanding systemDavid M. Carter, Jaan Kaja, Leonardo Neumeyer, Manny Rayner, Fuliang Weng, Mats Wirén. 26-29 [doi]

Initial evaluation of a preselection module for a flexible large vocabulary speech recognition system inJavier Macías Guarasa, Ascensión Gallardo-Antolín, Javier Ferreiros, José Manuel Pardo, Luis Villarrubia Grande. 30-33 [doi]

Asynchronous integration of visual information in an automatic speech recognition systemMamoun Alissali, Paul Deléglise, Alexandrina Rogozan. 34-37 [doi]

Audiovisual speech recognition using multiscale nonlinear image decompositionIain A. Matthews, J. Andrew Bangham, S. J. Cox. 38-41 [doi]

Robust audiovisual integration using semicontinuous hidden Markov modelsQin Su, Peter L. Silsbee. 42-45 [doi]

The effect of visual information on word initial consonant perception of dysarthric speechRichard P. Schumeyer, Kenneth E. Barner. 46-49 [doi]

A multiple deformable template approach for visual speech recognitionDevi Chandramohan, Peter L. Silsbee. 50-53 [doi]

Speaker independent bimodal phonetic recognition experimentsPiero Cosi, Emanuela Magno Caldognetto, Franco Ferrero, M. Dugatto, Kyriaki Vagges. 54-57 [doi]

Speechreading using shape and intensity informationJuergen Luettin, Neil A. Thacker, Steve W. Beet. 58-61 [doi]

Speaker identification by lipreadingJuergen Luettin, Neil A. Thacker, Steve W. Beet. 62-65 [doi]

How word onsets drive lexical access and segmentation: evidence from acoustics, phonology and processingDavid W. Gow Jr., Janis Melvold, Sharon Manuel. 66-69 [doi]

RAW: a real-speech model for human word recognitionDavid van Kuijk, Peter Wittenburg, Ton Dijkstra. 70-73 [doi]

How facilitatory can lexical information be during word recognition? evidence from moroccan arabicMehdi Meftah, Sami Boudelaa. 74-77 [doi]

Phonotactic and metrical influences on adult ratings of spoken nonsense wordsMichael S. Vitevitch, Paul A. Luce, Jan Charles-Luce, David Kemmerer. 82-85 [doi]

Lipreading supplemented by voice fundamental frequency: to what extent does the addition of voicing increase lexical uniqueness for the lipreader?Edward T. Auer, Lynne E. Bernstein. 86-89 [doi]

Strategies used in rhyme-monitoringSaskia te Riele, Sieb G. Nooteboom, Hugo Quené. 90-93 [doi]

How do dutch listeners process words with epenthetic schwa?Wilma van Donselaar, Cecile T. L. Kuijpers, Anne Cutler. 94-97 [doi]

Whole-word phonetic distances and the PGPfone alphabetPatrick Juola, Philip Zimmermann. 98-101 [doi]

Automatic vowel quality description using a variable mapping to an eight cardinal vowel reference setShuping Ran, J. Bruce Millar, Phil Rose. 102-105 [doi]

Automatic detection and segmentation of pronunciation variants in German speech corporaAndreas Kipp, Maria-Barbara Wesenick, Florian Schiel. 106-109 [doi]

ANGIE: a new framework for speech analysis based on morpho-phonological modellingStephanie Seneff, Raymond Lau, Helen M. Meng. 110-113 [doi]

On phonetic characteristics of pause in the Korean read speechYong-Ju Lee, Sook-Hyang Lee. 118-120 [doi]

Cross-language effects of lexical stress in word recognition: the case of Arabic English bilingualsSami Boudelaa, Mehdi Meftah. 121-124 [doi]

Automatic generation of German pronunciation variantsMaria-Barbara Wesenick. 125-128 [doi]

Estimating the quality of phonetic transcriptions and segmentations of speech signalsMaria-Barbara Wesenick, Andreas Kipp. 129-132 [doi]

An acoustic analysis of contemporary vowels of the standard slovenian languageBojan Petek, Rastislav Sustarsic, Smiljana Komar. 133-136 [doi]

Using decision trees to construct optimal acoustic cuesSandrine Robbe, Anne Bonneau, Sylvie Coste, Yves Laprie. 137-140 [doi]

Maximum jaw displacement in contrastive emphasisDonna Erickson, Osamu Fujimura. 141-144 [doi]

Subglottal pressure and final lowering in EnglishRebecca Herman, Mary E. Beckman, Kiyoshi Honda. 145-148 [doi]

Phonological variation: epenthesis and deletion of schwa in DutchCecile T. L. Kuijpers, Wilma van Donselaar, Anne Cutler. 149-152 [doi]

Feedback considerations for speech training systemsJames J. Mahshie. 153-156 [doi]

Clinical applications of computer-based speech training for children with hearing impairmentAnne-Marie Öster. 157-160 [doi]

Enhancing information-rich regions of natural VCV and sentence materials presented in noiseValérie Hazan, Andrew Simpson. 161-164 [doi]

Speech perceptual abilities of children with specific reading difficulty (dyslexia)Valérie Hazan, Alan Adlard. 165-168 [doi]

Bimodal perception of spectrum compressed speechLarry D. Paarmann, Michael K. Wynne. 169-172 [doi]

Applications of automatic speech recognition to speech and language development in young childrenMartin Russell, Catherine Brown, Adrian Skilling, Robert W. Series, Julie L. Wallace, Bill Bohnam, Paul Barker. 176-179 [doi]

Sub-band adaptive speech enhancement for hearing aidsD. R. Campbell. 180-183 [doi]

Adapting a TTS system to a reading machine for the blindThomas Portele, Jürgen Krämer. 184-187 [doi]

Modeling of spoken dialogue with and without visual informationKatsuhiko Shirai. 188-191 [doi]

Multimodal discourse modelling in a multi-user multi-domain environmentStephanie Seneff, David Goddeau, Christine Pao, Joseph Polifroni. 192-195 [doi]

Automatic acquisition of probabilistic dialogue modelsKenji Kita, Yoshikazu Fukui, Masaaki Nagata, Tsuyoshi Morimoto. 196-199 [doi]

Units of dialogue management: an examplePaul Heisterkamp, Scott McGlashan. 200-203 [doi]

Error resolution during multimodal human-computer interactionSharon L. Oviatt, Robert VanGent. 204-207 [doi]

Improved spontaneous dialogue recognition using dialogue and utterance triggers by adaptive probability boostingRamesh R. Sarukkai, Dana H. Ballard. 208-211 [doi]

Speech recognition for spontaneously spoken German dialoguesKai Hbener, Uwe Jost, Henrik Heine. 212-215 [doi]

Using prosodic information to constrain language models for spoken dialoguePaul Taylor, Hiroshi Shimodaira, Stephen Isard, Simon King, Jacqueline C. Kowtko. 216-219 [doi]

Combination of word-based and category-based language modelsThomas Niesler, Philip C. Woodland. 220-223 [doi]

A multi-level lexical-semantics based language model design for guided integrated continuous speech recognitionFrancisco J. Valverde-Albacete, José Manuel Pardo. 224-227 [doi]

A category based approach for recognition of out-of-vocabulary wordsFlorian Gallwitz, Elmar Nöth, Heinrich Niemann. 228-231 [doi]

Scalable backoff language modelsKristie Seymore, Ronald Rosenfeld. 232-235 [doi]

Modeling long distance dependence in language: topic mixtures vs. dynamic cache modelsRukmini Iyer, Mari Ostendorf. 236-239 [doi]

Bayesian estimation methods for n-gram language model adaptationMarcello Federico. 240-243 [doi]

Feature dimension reduction using reduced-rank maximum likelihood estimation for hidden Markov modelsDon X. Sun. 244-247 [doi]

Using multi-level segmentation coefficients to improve HMM speech recognitionKai Hbener. 248-251 [doi]

A comparative study of linear feature transformation techniques for automatic speech recognitionThomas Eisele, Reinhold Haeb-Umbach, Detlev Langmann. 252-255 [doi]

Inclusion of temporal information into features for speech recognitionBen Milner. 256-259 [doi]

New cepstral representation using wavelet analysis and spectral transformation for robust speech recognitionHubert Wassner, Gérard Chollet. 260-263 [doi]

Extraction of tongue contours in x-ray images with minimal user interactionYves Laprie, Marie-Odile Berger. 268-271 [doi]

Three-dimensional measurement of the vocal tract by MRIDidier Demolin, Thierry Metens, Alain Soquet. 272-275 [doi]

Syllable affiliation of final consonant clusters undergoes a phase transition over speaking ratesPhilip Gleason, Betty Tuller, J. A. Scott Kelso. 276-278 [doi]

Towards a biomechanical model of the larynxArthur Lobo, Michael H. O Malley. 279-282 [doi]

Generating intonation by superposing gesturesYann Morlec, Gérard Bailly, Véronique Aubergé. 283-286 [doi]

Effects of auditory feedback on F0 trajectory generationHideki Kawahara, Hiroko Kato, J. C. Williams. 287-290 [doi]

On the effects of accent and language on low rate speech codersIan S. Burnett, John J. Parry. 291-294 [doi]

VQ codevector index assignment using genetic algorithms for noisy channelsJeng-Shyang Pan, Fergus R. McInnes, Mervyn A. Jack. 295-298 [doi]

An improved vector quantization algorithm for speech transmission over noisy channelsGavin C. Cawley. 299-301 [doi]

Very low delay and high quality coding of 20 hz-15 khz speech signals at 64 kbit/sC. Murgia, Gang Feng, Alain Le Guyader, Catherine Quinquis. 302-305 [doi]

Application of speaker modification techniques to phonetic vocodingCarlos M. Ribeiro, Isabel Trancoso. 306-309 [doi]

Entropy coded vector quantization with hidden Markov modelsTadashi Yonezaki, Kiyohiro Shikano. 310-313 [doi]

An application of recurrent neural networks to low bit rate speech codingMinoru Kohata. 314-317 [doi]

CELP coding system based on mel-generalized cepstral analysisKazuhito Koishida, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai. 318-321 [doi]

Wideband re-synthesis of narrowband CELP-coded speech using multiband excitation modelCheung-fat Chan, Wai-Kwong Hui. 322-325 [doi]

Recurrent neural networks for phoneme recognitionTakuya Koizumi, Mikio Mori, Shuji Taniguchi, Mitsutoshi Maruya. 326-329 [doi]

A model for the acoustic phonetic structure of arabic language using a single ergodic hidden Markov modelM. A. Mokhtar, A. Zein-el-Abddin. 330-333 [doi]

Modelling long term variability information in mixture stochastic trajectory frameworkYifan Gong, Irina Illina, Jean-Paul Haton. 334-337 [doi]

Segmental phonetic features recognition by means of neural-fuzzy networks and integration in an n-best solutions post-processingThierry Moudenc, Robert Sokol, Guy Mercier. 338-341 [doi]

Stochastic trajectory model with state-mixture for continuous speech recognitionIrina Illina, Yifan Gong. 342-345 [doi]

Recognition of spelled names over the telephoneHermann Hild, Alex Waibel. 346-349 [doi]

Optimal tying of HMM mixture densities using decision treesGilles Boulianne, Patrick Kenny. 350-353 [doi]

Speech recognition using an enhanced FVQ based on a codeword dependent distribution normalization and codeword weighting by fuzzy objective functionHwan Jin Choi, Yung-Hwan Oh. 354-357 [doi]

Using the self-organizing map to speed up the probability density estimation for speech recognition with mixture density HMMsMikko Kurimo, Panu Somervuo. 358-361 [doi]

Combining the detection and correction of speech repairsPeter A. Heeman, Kyung-ho Loken-Kim, James F. Allen. 362-365 [doi]

Generating spontaneous elliptical utteranceYuji Sagawa, Wataru Sugimoto, Noboru Ohnishi. 366-369 [doi]

Developing the modelling of Swedish prosody in spontaneous dialogueGösta Bruce, Marcus Filipsson, Johan Frid, Björn Granström, Kjell Gustafson, Merle Horne, David House, Birgitta Lastow, Paul Touati. 370-373 [doi]

Spoken language generation in a multimedia systemShimei Pan, Kathleen McKeown. 374-377 [doi]

Synthesizing dialogue speech of Japanese based on the quantitative analysis of prosodic featuresKeikichi Hirose, Mayumi Sakata, Hiromichi Kawanami. 378-381 [doi]

Spoken dialogue interface in a dual task situationShuichi Tanaka, Shu Nakazato, Keiichiro Hoashi, Katsuhiko Shirai. 382-385 [doi]

Modeling disfluencies in conversational speechMan-Hung Siu, Mari Ostendorf. 386-389 [doi]

Evaluation of a language model using a clustered model backoffJohn Miller, Fil Alleva. 390-393 [doi]

Language modeling using x-gramsAntonio Bonafonte, José B. Mariño. 394-397 [doi]

Class phrase models for language modellingKlaus Ries, Finn Dag Buø, Alex Waibel. 398-401 [doi]

Introducing linguistic constraints into statistical language modelingPetra Geutner. 402-405 [doi]

Language modeling with stochastic automataJianying Hu, William Turin, Michael K. Brown. 406-409 [doi]

New fast wavelet packet transform algorithms for frame synchronized speech processingAndrzej Drygajlo. 410-413 [doi]

Frequency-warping in speechS. Umesh, L. Cohen, N. Marinovic, D. Nelson. 414-417 [doi]

Extracting speech features from human speech-like noiseDaisuke Kobayashi, Shoji Kajita, Kazuya Takeda, Fumitada Itakura. 418-421 [doi]

Subband-crosscorrelation analysis for robust speech recognitionShoji Kajita, Kazuya Takeda, Fumitada Itakura. 422-425 [doi]

A new ASR approach based on independent processing and recombination of partial frequency bandsHervé Bourlard, Stéphane Dupont. 426-429 [doi]

Frequency and time filtering of filter-bank energies for HMM speech recognitionCliment Nadeu, José B. Mariño, Javier Hernando, Albino Nogueiras. 430-433 [doi]

Temporal cues for vowels and universals of vowel inventoriesCarrie E. Lang, John J. Ohala. 434-437 [doi]

Acoustic variability in spontaneous conversational speech of american English talkersAnn K. Syrdal. 438-441 [doi]

Cross-language speech perception: Swedish, English, and Spanish speakers perception of front rounded vowelsRaquel Willerman, Patricia K. Kuhl. 442-445 [doi]

Inter-language vowel perception and production by Korean and Japanese listenersJohn C. L. Ingram, See-Gyoon Park. 446-449 [doi]

Intelligibility and acoustic correlates of Japanese accented English vowelsDiane Kewley-Port, Reiko Akahane-Yamada, Kiyoaki Aikawa. 450-453 [doi]

Segmentation strategies for spoken language recognition: evidence from semi-bilingual Japanese speakers of EnglishKiyoko Yoneyama. 454-457 [doi]

Integrating connectionist, statistical and symbolic approaches for continuous spoken Korean processingGeunbae Lee, Jong-Hyeok Lee, Kyubong Park, Byung-Chang Kim. 458-461 [doi]

Towards ASR on partially corrupted speechHynek Hermansky, Sangita Timberwala, Misha Pavel. 462-465 [doi]

Parametric trajectory models for speech recognitionHerbert Gish, Kenney Ng. 466-469 [doi]

Use of Gaussian selection in large vocabulary continuous speech recognition using HMMsKate Knill, M. J. F. Gales, Steve J. Young. 470-473 [doi]

Cross phone state clustering using lexical stress and contextJ. Hogberg, Kåre Sjölander. 474-477 [doi]

Likelihood ratio decoding and confidence measures for continuous speech recognitionEduardo Lleida-Solano, Richard C. Rose. 478-481 [doi]

A study on continuous Chinese speech recognition based on stochastic trajectory modelsXiaohui Ma, Yifan Gong, Yuqing Fu, Jiren Lu, Jean-Paul Haton. 482-485 [doi]

A proposal for a new algorithm of reference interval-free continuous DP for real-time speech or text retrievalYoshiaki Itoh, Jiro Kiyama, Hiroshi Kojima, Susumu Seki, Ryuichi Oka. 486-489 [doi]

Language modeling by string pattern n-gram for Japanese speech recognitionAkinori Ito, Masaki Kohda. 490-493 [doi]

Statistical language modeling using a variable context lengthReinhard Kneser. 494-497 [doi]

A comparison of hybrid HMM architectures using global discriminative trainingFinn Tore Johansen. 498-501 [doi]

Improved probability estimation with neural network modelsWei Wei, Etienne Barnard, Mark A. Fanty. 502-505 [doi]

A neural network using acoustic sub-word units for continuous speech recognitionHa-Jin Yu, Yung-Hwan Oh. 506-509 [doi]

On the error criteria in neural networks as a tool for human classification modellingLouis ten Bosch, Roel Smits. 510-513 [doi]

A non-linear filtering approach to stochastic training of the articulatory-acoustic mapping using the EM algorithmGordon Ramsay. 514-517 [doi]

A tool for automated design of language modelsY. P. Yang, John R. Deller Jr.. 518-521 [doi]

Acoustic-phonetic decoding based on elman predictive neural networksFelix Freitag, Enric Monte. 522-525 [doi]

On improving discrimination capability of an RNN based recognizerTan Lee, P. C. Ching. 526-529 [doi]

An evaluation of statistical language modeling for speech recognition using a mixed category of both words and parts-of-speechYumi Wakita, Jun Kawai, Hitoshi Iida. 530-533 [doi]

A dialogue control strategy based on the reliability of speech recognitionYasuhisa Niimi, Yutaka Kobayashi. 534-537 [doi]

Speechwear: a mobile speech systemAlexander I. Rudnicky, Stephen Reed, Eric H. Thayer. 538-541 [doi]

WHEELS: a conversational system in the automobile classifieds domainHelen M. Meng, Senis Busayapongchai, James R. Glass, David Goddeau, I. Lee Hetherington, Edward Hurley, Christine Pao, Joseph Polifroni, Stephanie Seneff, Victor Zue. 542-545 [doi]

Effective human-computer cooperative spoken dialogue: the AGS demonstratorM. David Sadek, A. Ferrieux, A. Cozannet, Philippe Bretier, Franck Panaget, J. Simonin. 546-549 [doi]

Dialog in the RAILTEL telephone-based systemSamir Bennacef, Laurence Devillers, Sophie Rosset, Lori Lamel. 550-553 [doi]

Dialogue processing in a conversational speech translation systemAlon Lavie, Lori S. Levin, Yan Qu, Alex Waibel, Donna Gates, Marsal Gavaldà, Laura Mayfield, Maite Taboada. 554-557 [doi]

Novel speech processing mechanism derived from auditory neocortical circuit analysisBoris Aleksandrovsky, James Whitson, Gretchen Andes, Gary Lynch, Richard Granger. 558-561 [doi]

Modeling neurons in the anteroventral cochlear nucleus for amplitude modulation (AM) processing: application to speech soundPing Tang, Jean Rouat. 562-565 [doi]

Noise suppression and loudness normalization in an auditory model-based acoustic front-endHalewijn Vereecken, Jean-Pierre Martens. 566-569 [doi]

A psychoacoustic model for the noise masking of voiceless plosive burstsJames J. Hant, Brian Strope, Abeer Alwan. 570-573 [doi]

Training machine classifiers to match the performance of human listeners in a natural vowel classification taskMartin Hunke, Thomas Holton. 574-577 [doi]

A neural matrix model for active tracking of frequency-modulated tonesKiyoaki Aikawa, Hideki Kawahara, Minoru Tsuzaki. 578-581 [doi]

A user-configurable system for voice label recognitionRichard C. Rose, Eduardo Lleida-Solano, G. W. Erhart, R. V. Grubbe. 582-585 [doi]

Keyword spotting enhancement for video soundtrack indexingPhilippe Gelin, Christian Wellekens. 586-589 [doi]

New efficient fillers for unlimited word recognition and keyword spottingRachida El Méliani, Douglas D. O Shaughnessy. 590-593 [doi]

Automatic transcription of general audio data: preliminary analysesMichelle S. Spina, Victor Zue. 594-597 [doi]

Transcribing radio newsFrancis Kubala, Tasos Anastasakos, Hubert Jin, Long Nguyen, Richard M. Schwartz. 598-601 [doi]

Correcting recognition errors via discriminative utterance verificationAnand R. Setlur, Rafid A. Sukkar, John Jacob. 602-605 [doi]

Does training in speech perception modify speech production?Reiko Akahane-Yamada, Yoh ichi Tohkura, Ann R. Bradlow, David B. Pisoni. 606-609 [doi]

Phrase-final lengthening and stress-timed shortening in the speech of native speakers and Japanese learners of EnglishMotoko Ueyama. 610-613 [doi]

Devoicing of Japanese vowels by taiwanese learners of JapaneseJ. Kevin Varden, Tsutomu Sato. 618-621 [doi]

Fluency and use of segmental dialect features in the acquisition of a second language (French) by English speakersDanièle Archambault, Catherine Foucher, Blagovesta Maneva. 622-625 [doi]

Estimating child and adolescent formant frequency values from adult dataP. Martland, Sandra P. Whiteside, Steve W. Beet, Ladan Baghai-Ravary. 626-629 [doi]

Acoustic correlates of linguistic stress and accent in dutch and american EnglishAgaath M. C. Sluijter, Vincent J. van Heuven. 630-633 [doi]

On the levels of accentuation in spoken JapaneseHiroya Fujisaki, Sumio Ohno, Osamu Tomita. 634-637 [doi]

Tonal distinctions between emphatic stress and pretonic lengthening in quebec FrenchLinda Thibault, Marise Ouellet. 638-641 [doi]

Perception of tonal accent by americans learning JapaneseYukihiro Nishinuma, Masako Arai, Takako Ayusawa. 646-649 [doi]

Modeling intra-speaker pitch range variation: predicting F0 targets when speaking up Elizabeth Shriberg, D. Robert Ladd, Jacques M. B. Terken. 650-653 [doi]

Predicting dialogue acts for a speech-to-speech translation systemNorbert Reithinger, Ralf Engel, Michael Kipp, Martin Klesen. 654-657 [doi]

Automatic speech translation based on the semantic structureJohannes Müller, Holger Stahl, Manfred Lang. 658-661 [doi]

A methodology for application development for spoken language systemsLewis M. Norton, Carl Weir, K. W. Scholz, Deborah A. Dahl, Ahmed Bouzid. 662-664 [doi]

A new restaurant guide conversational system: issues in rapid prototyping for specialized domainsStephanie Seneff, Joseph Polifroni. 665-668 [doi]

Semantic interpretation of a Japanese complex sentence in an advisory dialogue - focused on the postpositional word KEDO, which works as a conjunction between clausesTadahiko Kumamoto, Akira Ito. 669-672 [doi]

A Korean morphological analyzer for speech translation systemYoungkuk Hong, Myoung-Wan Koo, Gijoo Yang. 673-676 [doi]

Generic and domain-specific aspects of the waxholm NLP and dialog modulesRolf Carlson, Sheri Hunnicutt. 677-680 [doi]

A real-time system for summarizing human-human spontaneous spoken dialoguesMegumi Kameyama, Goh Kawai, Isao Arima. 681-684 [doi]

Evaluation of spoken language understanding and dialogue systemsBernd Hildebrandt, Heike Rautenstrauch, Gerhard Sagerer. 685-688 [doi]

Inter-speaker interaction of F0 in dialogsKuniko Kakita. 689-692 [doi]

A robust dialogue system for making an appointmentHans Brandt-Pook, Gernot A. Fink, Bernd Hildebrandt, Franz Kummert, Gerhard Sagerer. 693-696 [doi]

Segmentation of spoken dialogue by interjections, disfluent utterances and pausesKazuyuki Takagi, Shuichi Itahashi. 697-700 [doi]

A form-based dialogue manager for spoken language applicationsDavid Goddeau, Helen M. Meng, Joseph Polifroni, Stephanie Seneff, Senis Busayapongchai. 701-704 [doi]

The design of complex telephony applications using large vocabulary speech technologySteve Whittaker, David Attwater. 705-708 [doi]

Building 10, 000 spoken dialogue systemsStephen Sutton, David G. Novick, Ronald A. Cole, Pieter J. E. Vermeulen, Jacques de Villiers, Johan Schalkwyk, Mark A. Fanty. 709-712 [doi]

Speaker intention modeling for large vocabulary Mandarin spoken dialoguesYen-Ju Yang, Lee-Feng Chien, Lin-Shan Lee. 713-716 [doi]

Hybrid language models and spontaneous legal discourseP. E. Kenne, Mary O Kane. 717-720 [doi]

Topic change and local perplexity in spoken legal dialogueP. E. Kenne, Mary O Kane. 721-724 [doi]

Intonational cues to discourse structure in JapaneseJennifer J. Venditti, Marc Swerts. 725-728 [doi]

Principles for the design of cooperative spoken human-machine dialogueNiels Ole Bernsen, Hans Dybkjær, Laila Dybkjær. 729-732 [doi]

Development and comparison of three syllable stress classifiersKaren L. Jenkin, Michael S. Scordilis. 733-736 [doi]

Interaction of speech disorders with speech coders: effects on speech intelligibilityDonald G. Jamieson, Li Deng, M. Price, Vijay Parsa, J. Till. 737-740 [doi]

Detecting arytenoid cartilage misplacement through acoustic and electroglottographic jitter analysisMaurílio Nunes Vieira, Arnold G. D. Maran, Fergus R. McInnes, Mervyn A. Jack. 741-744 [doi]

Robust F0 and jitter estimation in pathological voicesMaurílio Nunes Vieira, Fergus R. McInnes, Mervyn A. Jack. 745-748 [doi]

Speech monitoring of infective laryngitisF. Plante, H. Kessler, Barry M. G. Cheetham, J. Earis. 749-752 [doi]

Searching for nonlinear relations in whitened jitter time seriesJean Schoentgen, Raoul De Guchteneere. 753-756 [doi]

Vocal fold pathology assessment using AM autocorrelation analysis of the teager energy operatorLiliana Gavidia-Ceballos, John H. L. Hansen, James F. Kaiser. 757-760 [doi]

Continuous positive airway pressure (CPAP) in the treatment of hypernasalityDavid P. Kuehn. 761-763 [doi]

Enhancement of alaryngeal speech by adaptive filteringCarol Y. Espy-Wilson, Venkatesh R. Chari, Caroline B. Huang. 764-767 [doi]

Simulation of disordered speech using a frequency-domain vocal tract modelLi Deng, Xuemin Shen, Donald G. Jamieson, J. Till. 768-771 [doi]

A stochastic model of fundamental period perturbation and its application to perception of pathological voice qualityYasuo Endo, Hideki Kasuya. 772-775 [doi]

A screening test for speech pathology assessment using objective quality measuresEric J. Wallen, John H. L. Hansen. 776-779 [doi]

Recent advances in hypernasal speech detection using the nonlinear teager energy operatorDouglas A. Cairns, John H. L. Hansen, James F. Kaiser. 780-783 [doi]

Human palate and related structures: their articulatory consequencesKiyoshi Honda, Shinji Maeda, Michiko Hashi, Jim Dembowski, John R. Westbury. 784-787 [doi]

A continuum mechanics representation of tongue deformationEdward P. Davis, Andrew Douglas, Maureen Stone. 788-792 [doi]

From MRI and acoustic data to articulatory synthesis: a case study of the lateral approximants in american EnglishPhilbert Bangayan, Abeer Alwan, Shrikanth Narayanan. 793-796 [doi]

Liquids in tamilShrikanth Narayanan, Abigail Kaun, Dani Byrd, Peter Ladefoged, Abeer Alwan. 797-800 [doi]

Modeling hyperarticulate speech during human-computer error resolutionSharon L. Oviatt, Gina-Anne Levow, Margaret MacEachern, Karen Kuhn. 801-804 [doi]

Using stress to disambiguate spoken Thai sentences containing syntactic ambiguitySiripong Potisuk, Mary P. Harper, Jackson T. Gandour. 805-808 [doi]

Use of prosodic information to integrate acoustic and linguistic knowledge in continuous Mandarin speech recognition with very large vocabularyHung-Yun Hsieh, Ren-Yuan Lyu, Lin-Shan Lee. 809-812 [doi]

Word boundary detection using pitch variationsG. V. Ramana Rao, J. Srichand. 813-816 [doi]

Detection of phrase boundaries in Japanese by low-pass filtering of fundamental frequency contoursAtsuhiro Sakurai, Keikichi Hirose. 817-820 [doi]

A new method for speech delexicalization, and its application to the perception of French prosodyVincent Pagel, Noelle Carbonell, Yves Laprie. 821-824 [doi]

Task adaptation for dialogues via telephone linesUdo Bub. 825-828 [doi]

The influence of bigram constraints on word recognition by humans: implications for computer speech recognitionRonald A. Cole, YongHong Yan, Troy Bailey. 829-832 [doi]

ALICE: acquisition of language in conversational environment - an approach to weakly supervised training of spoken language system for language portingTetsunori Kobayashi. 833-836 [doi]

Pitch pattern clustering of user utterances in human-machine dialogueTakashi Yoshimura, Satoru Hayamizu, Hiroshi Ohmura, Kazuyo Tanaka. 837-840 [doi]

Simplifying language through error-correcting decodingJuan-Carlos Amengual, Enrique Vidal, José-Miguel Benedí. 841-844 [doi]

A mixed approach to speech understandingMauro Cettolo, Anna Corazza, Renato de Mori. 845-848 [doi]

Speech recognition for an information kioskJean-Luc Gauvain, J. J. Gangolf, Lori Lamel. 849-852 [doi]

Localizing an automatic inquiry system for public transport informationHelmer Strik, Albert Russel, Henk van den Heuvel, Catia Cucchiarini, Lou Boves. 853-856 [doi]

Prompt constrained natural language - evolving the next generation of telephony servicesStephen M. Marcus, Deborah W. Brown, Randy G. Goldberg, Max S. Schoeffler, William R. Wetzel, Richard R. Rosinski. 857-860 [doi]

Key-phrase detection and verification for flexible speech understandingTatsuya Kawahara, Chin-Hui Lee, Biing-Hwang Juang. 861-864 [doi]

Interactive recovery from speech recognition errors in speech user interfacesBernhard Suhm, Brad A. Myers, Alex Waibel. 865-868 [doi]

Estimation of language models for new spoken language applicationsSunil Issar. 869-872 [doi]

H-infinity filtering for speech enhancementXuemin Shen, Li Deng, Anisa Yasmin. 873-876 [doi]

A comparitive analysis of channel-robust features and channel equalization methods for speech recognitionSaeed Vaseghi, Ben P. Milner. 877-880 [doi]

Robust speech recognition features based on temporal trajectory filtering of frequency band spectrumJia-lin Shen, Wen-Liang Hwang, Lin-Shan Lee. 881-884 [doi]

Durational modelling for improved connected digit recognitionKevin Power. 885-888 [doi]

Study on the dereverberation of speech based on temporal envelope filteringCarlos Avendaño, Hynek Hermansky. 889-892 [doi]

Estimating Markov model structuresThorsten Brants. 893-896 [doi]

A fertility channel model for post-correction of continuous speech recognitionEric K. Ringger, James F. Allen. 897-900 [doi]

Restoration of wide band signal from telephone speech using linear prediction error processingHiroshi Yasukawa. 901-904 [doi]

Smoothed spectral subtraction for a frequency-weighted HMM in noisy speech recognitionHiroshi Matsumoto, Noboru Naitoh. 905-908 [doi]

A simple architecture for using multiple cues in sound separationWilliam S. Woods, Martin Hansen, Thomas Wittkop, Birger Kollmeier. 909-912 [doi]

On the robust automatic segmentation of spontaneous speechBojan Petek, Ove Andersen, Paul Dalsgaard. 913-916 [doi]

Bayesian adaptation of speech recognizers to field speech dataC. G. Miglietta, Chafic Mokbel, Denis Jouvet, Jean Monné. 917-920 [doi]

Sub-band adaptive filtering applied to speech enhancementA. J. Darlington, D. J. Campbell. 921-924 [doi]

Noise robust estimate of speech dynamics for speaker recognitionJ. P. Openshaw, John S. Mason. 925-928 [doi]

Overview of speech enhancement techniques for automatic speaker recognitionJavier Ortega-Garcia, Joaquin Gonzalez-Rodriguez. 929-932 [doi]

Dynamic features for segmental speech recognitionNaomi Harte, Saeed Vaseghi, Ben P. Milner. 933-936 [doi]

Speech recognition based on a model of human auditory systemTakuya Koizumi, Mikio Mori, Shuji Taniguchi. 937-940 [doi]

APVQ encoder applied to wideband speech codingJosep M. Salavedra, Enrique Masgrau. 941-944 [doi]

Simple fast vector quantization of the line spectral frequenciesJin Zhou, Yair Shoham, Ali N. Akansu. 945-948 [doi]

Speaker individualities of vocal tract shapes of Japanese vowels measured by magnetic resonance imagesChang-Sheng Yang, Hideki Kasuya. 949-952 [doi]

Vocal tract acoustics using the transmission line matrix (TLM) methodS. El-Masri, Xavier Pelorson, P. Saguet, Pierre Badin. 953-956 [doi]

Building sensori-motor prototypes from audiovisual exemplarsGérard Bailly. 957-960 [doi]

Parameterized VT area function inversionMats Båvegård, Gunnar Fant. 961-964 [doi]

An improved vocal tract model of vowel production implementing piriform resonance and transvelar nasal couplingJianwu Dang, Kiyoshi Honda. 965-968 [doi]

Pseudo-articulatory speech synthesis for recognition using automatic feature extraction from x-ray dataC. S. Blackburn, Steve J. Young. 969-972 [doi]

N-best-based instantaneous speaker adaptation method for speech recognitionTomoko Matsui, Sadaoki Furui. 973-976 [doi]

Mixture splitting technic and temporal control in a HMM-based recognition systemClaude Montacié, Marie-José Caraty, Claude Barras. 977-980 [doi]

A unified spectral transformation adaptation approach for robust speech recognitionLei Yao, Dong Yu, Taiyi Huang. 981-984 [doi]

On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognitionQiang Huo, Chin-Hui Lee. 985-988 [doi]

Speaker adaptation by modeling the speaker variation in a continuous speech recognition systemNikko Ström. 989-992 [doi]

An enquiring system of unknown words in TV news by spontaneous repetition (application of speaker normalization by speaker subspace projection)Yasuo Ariki, Shigeaki Tagashira. 993-996 [doi]

Language understanding using hidden understanding modelsRichard M. Schwartz, Scott Miller, David Stallard, John Makhoul. 997-1000 [doi]

Processing of semantic information in fluently spoken languageAllen L. Gorin. 1001-1004 [doi]

Automatic linguistic segmentation of conversational speechAndreas Stolcke, Elizabeth Shriberg. 1005-1008 [doi]

Towards understanding spontaneous speech: word accuracy vs. concept accuracyManuela Boros, Wieland Eckert, Florian Gallwitz, Günther Görz, Gerhard Hanrieder, Heinrich Niemann. 1009-1012 [doi]

A stochastic case frame approach for natural language understandingWolfgang Minker, Samir Bennacef, Jean-Luc Gauvain. 1013-1016 [doi]

Improving speech understanding by incorporating database constraints and dialogue historyFrank Seide, Bernhard Rueber, Andreas Kellner. 1017-1020 [doi]

A new discourse structure model for spontaneous spoken dialogueTetsuro Chino, Hiroyuki Tsuboi. 1021-1024 [doi]

An architecture for spoken dialogue managementDavid Duff, Barbara Gates, Susann LuperFoy. 1025-1028 [doi]

Pausing strategies in discourse in dutchMonique E. van Donzel, Florien J. Koopmans-van Beinum. 1029-1032 [doi]

Filled pauses as markers of discourse structureMarc Swerts, Anne Wichmann, Robbert-Jan Beun. 1033-1036 [doi]

The prosodic analysis of Korean dialogue speech - through a comparative study with read speechCheol-jae Seong, Minsoo Hahn. 1037-1040 [doi]

Changing the topic: how long does it take?Mary O Kane, P. E. Kenne. 1041-1044 [doi]

Learning pronunciation dictionary from speech dataChristian-Michael Westendorf, Jens Jelitto. 1045-1048 [doi]

The trended HMM with discriminative training for phonetic classificationC. Rathinavelu, Li Deng. 1049-1052 [doi]

Improving decision trees for acoustic modelingAriane Lazaridès, Yves Normandin, Roland Kuhn. 1053-1056 [doi]

An improved training algorithm in HMM-based speech recognitionGongjun Li, Taiyi Huang. 1057-1060 [doi]

Speech recognition using a strong correlation assumption for the instantaneous spectraJi Ming, Peter O Boyle, John G. McMahon, F. Jack Smith. 1061-1064 [doi]

On parameter filtering in continuous subword-unit-based speech recognitionPau Pachès-Leal, Climent Nadeu. 1065-1068 [doi]

Estimation of statistical phoneme center considering phonemic environmentsShigeki Okawa, Katsuhiko Shirai. 1069-1072 [doi]

Integration of context-dependent durational knowledge into HMM-based speech recognitionXue Wang, Louis ten Bosch, Louis C. W. Pols. 1073-1076 [doi]

Speech recognition based on acoustically derived segment unitsToshiaki Fukada, Michiel Bacchiani, Kuldip K. Paliwal, Yoshinori Sagisaka. 1077-1080 [doi]

Robust gender-dependent acoustic-phonetic modelling in continuous speech recognition based on a new automatic male/female classificationRivarol Vergin, Azarshid Farhat, Douglas D. O Shaughnessy. 1081-1084 [doi]

A codebook adaptation algorithm for SCHMM using formant distributionTae Young Yang, Won-Ho Shin, Weon-Goo Kim, Dae Hee Youn. 1085-1088 [doi]

Parameter tying for flexible speech recognitionJacques Simonin, S. Bodin, Denis Jouvet, Katarina Bartkova. 1089-1092 [doi]

Word-spotting based on inter-word and intra-word diphone modelsTsuneo Nitta, Shin-ichi Tanaka, Yasuyuki Masai, Hiroshi Matsuura. 1093-1096 [doi]

Duration modeling with expanded HMM applied to speech recognitionAntonio Bonafonte, Josep Vidal, Albino Nogueiras. 1097-1100 [doi]

Different strategies for distribution clustering using discrete, semicontinuous and continuous HMMs in CSRRicardo de Córdoba, José Manuel Pardo. 1101-1104 [doi]

Improved HMM phone and triphone models for realtime ASR telephony applicationsIlija Zeljkovic, Shrikanth Narayanan. 1105-1108 [doi]

Improved extended HMM composition by incorporating power varianceYasuhiro Minami, Sadaoki Furui. 1109-1112 [doi]

Optimal filtering and smoothing for speech recognition using a stochastic target modelGordon Ramsay, Li Deng. 1113-1116 [doi]

Speech recognition using syllable-like unitsZhihong Hu, Johan Schalkwyk, Etienne Barnard, Ronald A. Cole. 1117-1120 [doi]

Search for unexplored effects in speech productionCecil H. Coker, M. H. Krane, B. Y. Reis, R. A. Kubli. 1121-1124 [doi]

Articulatory synthesis from x-rays and inversion for an adaptive speech robotPierre Badin, Christian Abry. 1125-1128 [doi]

Adaptive recognition method based on posterior use of distribution pattern of output probabilitiesJin-Song Zhang, Beiqian Dai, Changfu Wang, HingKeung Kwan, Keikichi Hirose. 1129-1132 [doi]

Iterative unsupervised adaptation using maximum likelihood linear regressionPhilip C. Woodland, D. Pye, M. J. F. Gales. 1133-1136 [doi]

A compact model for speaker-adaptive trainingTasos Anastasakos, John W. McDonough, Richard M. Schwartz, John Makhoul. 1137-1140 [doi]

Iterative unsupervised speaker adaptation for batch dictationShigeru Homma, Jun-ichi Takahashi, Shigeki Sagayama. 1141-1144 [doi]

Rapid unsupervised adaptation to children s speech on a connected-digit taskDaniel C. Burnett, Mark A. Fanty. 1145-1148 [doi]

Speaker adaptation using tree structured shared-state HMMsJun Ishii, Masahiro Tonomura, Shoichi Matsunaga. 1149-1152 [doi]

Learning to parse spontaneous speechFinn Dag Buø, Alex Waibel. 1153-1156 [doi]

Spontaneous speech and natural language processing ALPES: a robust semantic-led parserJean-Yves Antoine. 1157-1160 [doi]

The natural language processing module for a voice assisted operator at telef nica i+DJ. Alvarez-Cercadillo, F. Javier Caminero-Gil, C. Crespo-Casas, Daniel Tapias Merino. 1161-1164 [doi]

Compound words in large-vocabulary German speech recognition systemsAndré Berton, Pablo Fetter, Peter Regel-Brietzmann. 1165-1168 [doi]

Prosody, empty categories and parsing - a success storyAnton Batliner, Anke Feldhaus, Stefan Geißler, Tibor Kiss, Ralf Kompe, Elmar Nöth. 1169-1172 [doi]

"almost parsing" technique for language modelingB. Srinivas. 1173-1176 [doi]

From segmental duration properties to rhythmic structure: a study of interactions between high and low levelMarise Ouellet, Benoît Tardif. 1177-1180 [doi]

Analysis of context-dependent segmental duration for automatic speech recognitionXue Wang, Louis C. W. Pols, Louis ten Bosch. 1181-1184 [doi]

The role of the rhythmic groups in the segmentation of continuous French speechDelphine Dahan. 1185-1188 [doi]

Experimental phonetic study of the syllable duration of Korean with respect to the positional effectHyunbok Lee, Cheol-jae Seong. 1193-1196 [doi]

Timing of pitch movements and accentuation of syllablesDik J. Hermes. 1197-1200 [doi]

A probabilistic approach to AMDF pitch detectionGoangshiuan S. Ying, Leah H. Jamieson, Carl D. Mitchell. 1201-1204 [doi]

From sagittal cut to area function: an RMI investigationAlain Soquet, Véronique Lecuit, Thierry Metens, Didier Demolin. 1205-1208 [doi]

Pitch detection and voiced/unvoiced decision algorithm based on wavelet transformsLéonard Janer, Juan José Bonet, Eduardo Lleida-Solano. 1209-1212 [doi]

Decomposition of speech signals into a deterministic and a stochastic partYannis Stylianou. 1213-1216 [doi]

Improved glottal closure instant detector based on linear prediction and standard pitch conceptCheol-Woo Jo, Ho-Gyun Bang, William A. Ainsworth. 1217-1220 [doi]

Analysis of speech segments using variable spectral/temporal resolutionXihong Wang, Stephen A. Zahorian, Stefan Auberg. 1221-1224 [doi]

Time-based clustering for phonetic segmentationBrian Eberman, William Goldenthal. 1225-1228 [doi]

Formant analysis using mixtures of GaussiansParham Zolfaghari, Tony Robinson. 1229-1232 [doi]

Deriving articulatory representations from speech with various excitation modesHywel B. Richards, John S. Mason, Melvyn J. Hunt, John S. Bridle. 1233-1236 [doi]

blind speech segmentation: automatic segmentation of speech without linguistic knowledgeManish Sharma, Richard J. Mammone. 1237-1240 [doi]

Speech synthesis using a nonlinear energy damping model for the vocal folds vibration effectHiroshi Ohmura, Kazuyo Tanaka. 1241-1244 [doi]

Neural networks learning with L1 criteria and its efficiency in linear prediction of speech signalsMunehiro Namba, Hiroyuki Kamata, Yoshihisa Ishida. 1245-1248 [doi]

Preprocessing and neural classification of English stop consonants [b, d, g, p, t, k]Anna Esposito, Eugène C. Ezin, M. Ceccarelli. 1249-1252 [doi]

A novel approach to the estimation of voice source and vocal tract parameters from speech signalsWen Ding, Hideki Kasuya. 1257-1260 [doi]

Syllable detection in read and spontaneous speechHartmut R. Pfitzinger, Susanne Burger, Sebastian Heid. 1261-1264 [doi]

Maximum likelihood learning of auditory feature maps for stationary vowelsKuansan Wang, Chin-Hui Lee, Biing-Hwang Juang. 1265-1268 [doi]

Explicit segmentation of speech using Gaussian modelsAntonio Bonafonte, Albino Nogueiras, Antonio Rodriguez-Garrido. 1269-1272 [doi]

A comparison of several recent methods of fundamental frequency and voicing decision estimationE. Mousset, William A. Ainsworth, José A. R. Fonollosa. 1273-1276 [doi]

Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequencyToshihiko Abe, Takao Kobayashi, Satoshi Imai. 1277-1280 [doi]

Integrated polispectrum on speech recognitionAsunción Moreno, Miquel Rutllán. 1281-1284 [doi]

Analysis of acoustic properties of the nasal tract using 3-d FEMHisayoshi Suzuki, Takayoshi Nakai, Hiroshi Sakakibara. 1285-1288 [doi]

Experiments with analysis by synthesis of glottal airflowJohan Liljencrants. 1289-1292 [doi]

An incremental speaker-adaptation technique for hybrid HMM-MLP recognizerJoao P. Neto, Ciro Martins, Luís B. Almeida. 1293-1296 [doi]

Phoneme segmentation of continuous speech using multi-layer perceptronYoungjoo Suh, YoungJik Lee. 1297-1300 [doi]

Stochastic perceptual speech models with durational dependenceJeff Bilmes, Nelson Morgan, Su-Lin Wu, Hervé Bourlard. 1301-1304 [doi]

Boosting the performance of connectionist large vocabulary speech recognitionG. D. Cook, Anthony J. Robinson. 1305-1308 [doi]

HMMs and OWE neural network for continuous speech recognitionNicolas Pican, Dominique Fohr, Jean-François Mari. 1309-1312 [doi]

Smoothed local adaptation of connectionist systemsSteve Waterhouse, Dan J. Kershaw, Tony Robinson. 1313-1316 [doi]

Robust speech recognition with speaker localization by a microphone arrayTakeshi Yamada, Satoshi Nakamura, Kiyohiro Shikano. 1317-1320 [doi]

Sound source localization in reverberant environments using an outlier elimination algorithmEa-Ee Jan, James L. Flanagan. 1321-1324 [doi]

The 1995 abbot LVCSR system for multiple unknown microphonesDan J. Kershaw, Tony Robinson, Steve Renals. 1325-1328 [doi]

Experiments of speech recognition in a noisy and reverberant environment using a microphone array and HMMDiego Giuliani, Maurizio Omologo, Piergiorgio Svaizer. 1329-1332 [doi]

Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arraysJoaquin Gonzalez-Rodriguez, Javier Ortega-Garcia, César Martin, Luis Hernández. 1333-1336 [doi]

Robust automatic speech recognition using a multi-channel signal separation front-endKuan-Chieh Yen, Yunxin Zhao. 1337-1340 [doi]

Prosody generation in text-to-speech conversion using dependency graphsAnders Lindström, Ivan Bretan, Mats Ljungqvist. 1341-1344 [doi]

Extraction method of non-restrictive modification in Japanese as a marked factor of prosodyHisako Asano, Hisashi Ohara, Yoshifumi Ooyama. 1345-1348 [doi]

Modeling contrast in the generation and synthesis of spoken languageScott Prevost. 1349-1352 [doi]

A left-to-right processing model of pausing in Japanese based on limited syntactic informationHajime Tsukada. 1353-1356 [doi]

Modeling of intonation bearing emphasis for TTS-synthesis of greek dialoguesDimitrios Galanis, Vassilios Darsinos, George Kokkinakis. 1357-1360 [doi]

Synthesizing prosody: a prominence-based approachBarbara Heuft, Thomas Portele. 1361-1364 [doi]

Multilingual text analysis for text-to-speech synthesisRichard Sproat. 1365-1368 [doi]

Spoken-style explanation generator for Japanese kanji using a text-to-speech systemYoshifumi Ooyama, Hisako Asano, Koji Matsuoka. 1369-1372 [doi]

A method for estimating prosodic symbol from text for Japanese text-to-speech synthesisKen-ichi Magata, Tomoki Hamagami, Mitsuo Komura. 1373-1376 [doi]

Statistical methods in data-driven modeling of Spanish prosody for text to speechEduardo López Gonzalo, Jose M. Rodriguez-Garcia. 1377-1380 [doi]

Intonation processing for TTS using stylization and neural network learning methodJung-Chul Lee, YoungJik Lee, Sanghun Kim, Minsoo Hahn. 1381-1384 [doi]

Generating F0 contours from toBI labels using linear regressionAlan W. Black, Andrew Hunt. 1385-1388 [doi]

The broad study of homograph disambiguity for Mandarin speech synthesisWern-Jun Wang, Shaw-Hwa Hwang, Sin-Horng Chen. 1389-1392 [doi]

The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposesThierry Dutoit, Vincent Pagel, Nicolas Pierret, F. Bataille, Olivier van der Vrecken. 1393-1396 [doi]

Training data selection for voice conversion using speaker selection and vector field smoothingMakoto Hashimoto, Norio Higuchi. 1397-1400 [doi]

A new voice transformation method based on both linear and nonlinear prediction analysisKi-Seung Lee, Dae Hee Youn, Il Whan Cha. 1401-1404 [doi]

On the transformation of the speech spectrum for voice conversionGeneviève Baudoin, Yannis Stylianou. 1405-1408 [doi]

Spectral analysis of synthetic speech and natural speech with noise over the telephone lineCristina Delogu, Andrea Paoloni, Susanna Ragazzini, Paola Ridolfi. 1409-1412 [doi]

A new speech synthesis system based on the ARX speech production modelWeizhong Zhu, Hideki Kasuya. 1413-1416 [doi]

Speech synthesis using the CELP algorithmGeraldo Lino de Campos, Evandro B. Gouvêa. 1417-1420 [doi]

A Mandarin text-to-speech systemShaw-Hwa Hwang, Sin-Horng Chen, Yih-Ru Wang. 1421-1424 [doi]

Residual-based speech modification algorithms for text-to-speech synthesisMike D. Edgington, A. Lowry. 1425-1428 [doi]

A generalized LR parser for text-to-speech synthesisPer Olav Heggtveit. 1429-1432 [doi]

Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesisM. P. Pollard, Barry M. G. Cheetham, C. C. Goodyear, Mike D. Edgington, A. Lowry. 1433-1436 [doi]

An excitation synchronous pitch waveform extraction method and its application to the VCV-concatenation synthesis of Japanese spoken wordsYasuhiko Arai, Ryo Mochizuki, Hirofumi Nishimura, Takashi Honda. 1437-1440 [doi]

A new Chinese text-to-speech system with high naturalnessRen-Hua Wang, Qingfeng Liu, Difei Tang. 1441-1444 [doi]

Language training system utilizing speech modificationYoram Meron, Keikichi Hirose. 1449-1452 [doi]

Perception of English /r/ and /l/ speech contrasts by native Korean listeners with extensive English-language experienceDonald G. Jamieson, K. Yu. 1453-1456 [doi]

Automatic text-independent pronunciation scoring of foreign language student speechLeonardo Neumeyer, Horacio Franco, Mitchel Weintraub, Patti Price. 1457-1460 [doi]

Assessing the contribution of instructional technology in the teaching of pronunciationAntônio Simoes. 1461-1464 [doi]

Detection of foreign speakers pronunciation errors for second language training - preliminary resultsMaxine Eskenazi. 1465-1468 [doi]

Foreign accent in intonation patterns - a contrastive study applying a quantitative model of the F0 contourHansjörg Mixdorff. 1469-1472 [doi]

Input modality effects in foreign accentDuncan J. Markham, Yasuko Nagano-Madsen. 1473-1476 [doi]

For speech perception by humans or machines, three senses are better than oneLynne E. Bernstein, Christian Benoît. 1477-1480 [doi]

A few factors which affect the degree of incorporating lip-read information into speech perceptionKaoru Sekiyama, Yoh ichi Tohkura, Michio Umeda. 1481-1484 [doi]

Characterizing audiovisual information during speechEric Vatikiotis-Bateson, Kevin G. Munhall, Y. Kasahara, Frederique Garcia, Hani Yehia. 1485-1488 [doi]

The implications of the tadoma method of speechreading for spoken language processingCharlotte M. Reed. 1489-1492 [doi]

Seeing speech in space and time: psychological and neurological findingsRuth Campbell. 1493-1496 [doi]

What s in the pure prosody?Volker Strom, Christina Widera. 1497-1500 [doi]

F0 declination in read-aloud and spontaneous speechMarc Swerts, Eva Strangert, Mattias Heldner. 1501-1504 [doi]

Prediction of prosodic phrase boundaries considering variable speaking rateYeon-Jun Kim, Yung-Hwan Oh. 1505-1508 [doi]

Prediction of F0 parameter of contextualized utterances in dialogueYoichi Yamashita, Riichiro Mizoguchi. 1509-1512 [doi]

The production and perception of potentially ambiguous intonation contours by speakers of Russian and JapaneseVeronika Makarova, J. Matsui. 1513-1516 [doi]

What is invariant and what is optional in the realization of a FOCUSED word? a cross-dialectal study of Swedish sentences with moving focusRobert Eklund. 1517-1520 [doi]

Quantifying spectral characteristics of fricativesChristine H. Shadle, Sheila J. Mair. 1521-1524 [doi]

Acoustic characteristics of ejectives in ingushNatasha Warner. 1525-1528 [doi]

An acoustic profile of consonant reductionR. J. J. H. van Son, Louis C. W. Pols. 1529-1532 [doi]

Devoicing in post-vocalic canadian-French obstruantsDanièle Archambault, Blagovesta Maneva. 1533-1536 [doi]

Paying attention to speaking rateAlexander L. Francis, Howard C. Nusbaum. 1537-1540 [doi]

The acoustic structure of vowels in mothers speech to infants and adultsJean E. Andruski, Patricia K. Kuhl. 1545-1548 [doi]

Acoustical characteristics of sound production of deaf and normally hearing infantsChris J. Clement, Florien J. Koopmans-van Beinum, Louis C. W. Pols. 1549-1552 [doi]

Word recognition by Japanese infantsPierre A. Hallé, Toshisada Deguchi, Yuji Tamekawa, Benedicte de Boysson-Bardies, Shigeru Kiritani. 1557-1560 [doi]

Developmental change in perception of clause boundaries by 6- and 10-month-old Japanese infantsAkiko Hayashi, Yuji Tamekawa, Toshisada Deguchi, Shigeru Kiritani. 1565-1568 [doi]

A frequency domain method for parametrization of the voice sourcePaavo Alku, Erkki Vilkman. 1569-1572 [doi]

Glottal correlates of the word stress and the tense/lax opposition in GermanKrzysztof Marasek. 1573-1576 [doi]

Coarticulatory stability in american English /r/Suzanne Boyce, Carol Y. Espy-Wilson. 1577-1580 [doi]

An MRI-based analysis of the English /r/ and /l/ articulationsShinobu Masaki, Reiko Akahane-Yamada, Mark K. Tiede, Yasuhiro Shimada, Ichiro Fujimoto. 1581-1584 [doi]

Does lexical stress or metrical stress better predict word boundaries in Dutch?David van Kuijk. 1585-1588 [doi]

Optopalatograph (OPG): a new apparatus for speech production analysisAlan Wrench, Alan D. McIntosh, William J. Hardcastle. 1589-1592 [doi]

Prediction of vowel systems using a deductive approachRené Carré. 1593-1596 [doi]

Distinctions between [t] and [tch] using electropalatography dataSheila J. Mair, Celia Scully, Christine H. Shadle. 1597-1600 [doi]

Relating formants and articulation in intelligibility test wordsMichiko Hashi, Raymond D. Kent, John R. Westbury, Mary J. Lindstrom. 1601-1604 [doi]

The role of coarticulation in the perception of vowel quality in modern standard ArabicImad Znagui, Mohamed Yeou. 1605-1608 [doi]

Updating the reading EPGSimon Arnfield, Wilf Jones. 1609-1611 [doi]

Lexical stress detection on stress-minimal word pairsGoangshiuan S. Ying, Leah H. Jamieson, Ruxin Chen, Carl D. Mitchell. 1612-1615 [doi]

An acoustic study of the interaction between stressed and unstressed syllables in spoken MandarinJing Wang. 1616-1619 [doi]

Automatic detection of accent nuclei at the head of words for speech recognitionNobuaki Minematsu, Seiichi Nakagawa. 1620-1623 [doi]

Automatic generation of prosodic structure for high quality Mandarin speech synthesisFu-Chiang Chou, Chiu-yu Tseng, Lin-Shan Lee. 1624-1627 [doi]

A study on Japanese prosodic pattern and its modeling in restricted speechTomoki Hamagami, Ken-ichi Magata, Mitsuo Komura. 1628-1631 [doi]

A phonetic study of focus in intransitive verb sentencesSteve Hoskins. 1632-1635 [doi]

Goethe for prosodyStefan Rapp. 1636-1639 [doi]

A functional model for generation of the local components of F0 contours in ChineseJinfu Ni, Ren-Hua Wang, Deyu Xia. 1644-1647 [doi]

Studies of the mcgurk effect: implications for theories of speech perceptionKerry P. Green. 1652-1655 [doi]

Using the visual component in automatic speech recognitionN. M. Brooke. 1656-1659 [doi]

Perceptual organization of speech in one and several modalities: common functions, common resourcesRobert E. Remez. 1660-1663 [doi]

Multi-modal encoding of speech in memory: a first reportDavid B. Pisoni, Helena M. Saldaña, Sonya M. Sheffert. 1664-1667 [doi]

Evaluating automatic speech recognition as a component of a multi-input device human-computer interfaceBrian Mellor, Chris Baber, C. Tunley. 1668-1671 [doi]

Data collection for the MASK kiosk: WOz vs prototype systemA. Life, I. Salter, Jean-Noel Temem, F. Bernard, Sophie Rosset, Samir Bennacef, Lori Lamel. 1672-1675 [doi]

An experimental Japanese/English interpreting video phone systemM. Karaorman, Ted H. Applebaum, T. Itoh, M. Endo, Y. Ohno, M. Hoshimi, Takahiro Kamai, Kenji Matsui, Kazue Hata, Steve Pearson, Jean-Claude Junqua. 1676-1679 [doi]

User participation and compliance in speech automated telecommunications applicationsSara Basson, Stephen Springer, Cynthia Fong, Hong C. Leung, Edward Man, Michele Olson, John F. Pitrelli, Ranvir Singh, Suk Wong. 1680-1683 [doi]

Embedding speech in web interfacesSamuel Bayer. 1684-1687 [doi]

Voice-activated home banking system and its field trialToshihiro Isobe, Masatoshi Morishima, Fuminori Yoshitani, Nobuo Koizumi, Ken ya Murakami. 1688-1691 [doi]

A text analyzer for Korean text-to-speech systemsSangho Lee, Yung-Hwan Oh. 1692-1695 [doi]

Comparison of two tree-structured approaches for grapheme-to-phoneme conversionOve Andersen, Roland Kuhn, Ariane Lazaridès, Paul Dalsgaard, Jürgen Haas, Elmar Nöth. 1700-1703 [doi]

A recurrent network that learns to pronounce English textM. J. Adamson, Robert I. Damper. 1704-1707 [doi]

Archisegment-based letter-to-phone conversion for concatenative speech synthesis in PortugueseEleonora Cavalcante Albano, Agnaldo Antonio Moreira. 1708-1711 [doi]

A new method of generating speech synthesis units based on phonological knowledge and clustering techniqueYuki Yoshida, Shin ya Nakajima, Kazuo Hakoda, Tomohisa Hirokawa. 1712-1715 [doi]

Consistency in transcription and labelling of German intonation with GToBIMartine Grice, Matthias Reyelt, Ralf Benzmüller, Jörg Mayer, Anton Batliner. 1716-1719 [doi]

Syntactic-prosodic labeling of large spontaneous speech data-basesAnton Batliner, Ralf Kompe, Andreas Kießling, Heinrich Niemann, Elmar Nöth. 1720-1723 [doi]

Relationship between discourse structure and dynamic speech rateFlorien J. Koopmans-van Beinum, Monique E. van Donzel. 1724-1727 [doi]

Using prosodic clues to decide when to produce back-channel utterancesNigel Ward. 1728-1731 [doi]

Dialog act classification with the help of prosodyMarion Mast, Ralf Kompe, Stefan Harbeck, Andreas Kießling, Heinrich Niemann, Elmar Nöth, Ernst Günter Schukat-Talamazzini, Volker Warnke. 1732-1735 [doi]

Using lexical stress in continuous speech recognition for dutchDavid van Kuijk, Henk van den Heuvel, Lou Boves. 1736-1739 [doi]

Automatic accent classification of foreign accented australian English speechKarsten Kumpf, Robin W. King. 1740-1743 [doi]

Discriminative adaptation for speaker verificationFilipp Korkmazskiy, Biing-Hwang Juang. 1744-1747 [doi]

Perceptual features of unknown foreign languages as revealed by multi-dimensional scalingVerna Stockmal, D. Muljani, Zinny S. Bond. 1748-1751 [doi]

On-line incremental adaptation for speaker verification using maximum likelihood estimates of CDHMM parametersKin Yu, John S. Mason. 1752-1755 [doi]

Combining methods to improve speaker verification decisionDominique Genoud, Frédéric Bimbot, Guillaume Gravier, Gérard Chollet. 1756-1759 [doi]

Incremental speaker adaptation with minimum error discriminative training for speaker identificationCesar Martín del Alamo, J. Alvarez, C. de la Torre, F. J. Poyatos, Lis Hernández. 1760-1763 [doi]

Frame level likelihood normalization for text-independent speaker identification using Gaussian mixture modelsKonstantin P. Markov, Seiichi Nakagawa. 1764-1767 [doi]

On using prosodic cues in automatic language identificationAnn E. Thymé-Gobbel, Sandra E. Hutchins. 1768-1771 [doi]

Speaker recognition model using two-dimensional mel-cepstrum and predictive neural networkTadashi Kitamura, Shinsai Takei. 1772-1775 [doi]

Unknown language rejection in language identification systemHingKeung Kwan, Keikichi Hirose. 1776-1779 [doi]

Spoken language identification using large vocabulary speech recognitionJames Hieronymus, Shubha Kadambe. 1780-1783 [doi]

Accent identificationCarlos Teixeira, Isabel Trancoso, António Joaquim Serralheiro. 1784-1787 [doi]

On the sources of inter- and intra-speaker variability in the acoustic dynamics of speechXue Yang, J. Bruce Millar, Iain MacLeod. 1792-1795 [doi]

Language identification with inaccurate string matchingKay M. Berkling, Etienne Barnard. 1796-1799 [doi]

Robust prosodic features for speaker identificationMichael J. Carey 0002, Eluned S. Parris, Harvey Lloyd-Thomas, S. J. Bennett. 1800-1803 [doi]

Text independent speaker identification on noisy environments by means of self organizing mapsEnric Monte, Javier Hernando Pericas, Xavier Miró, A. Adolf. 1804-1807 [doi]

Language identification using language-dependent phonemes and language-independent speech unitsPaul Dalsgaard, Ove Andersen, Hanne Hesselager, Bojan Petek. 1808-1811 [doi]

Adding the affective dimension: a new look in speech analysis and synthesisKlaus R. Scherer. 1811 [doi]

Ethological theory and the expression of emotion in the voiceJohn J. Ohala. 1812-1815 [doi]

Synthesizing emotions in speech: is it time to get excited?Iain R. Murray, John L. Arnott. 1816-1819 [doi]

A study on task-independent subword selection and modeling for speech recognitionChin-Hui Lee, Biing-Hwang Juang, Wu Chou, J. J. Molina-Perez. 1820-1823 [doi]

Simultaneous ANN feature and HMM recognizer design using string-based minimum classification error (MCE) trainingMazin G. Rahim, Chin-Hui Lee. 1824-1827 [doi]

Quantizing mixture-weights in a tied-mixture HMMSunil K. Gupta, Frank K. Soong, Raziel Haimi-Cohen. 1828-1831 [doi]

Variance compensation within the MLLR framework for robust speech recognition and speaker adaptationM. J. F. Gales, D. Pye, Philip C. Woodland. 1832-1835 [doi]

Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognitionArun C. Surendran, Chin-Hui Lee, Mazin G. Rahim. 1836-1839 [doi]

Estimation of channel bias for telephone speech recognitionJen-Tzung Chien, Hsiao-Chuan Wang, Lee-Min Lee. 1840-1843 [doi]

Synthesis of English intonation using explicit models of reading and spontaneous speechM. E. Johnson. 1844-1847 [doi]

Implementation and evaluation of a model for synthesis of Swedish intonationMerle Horne, Marcus Filipsson. 1848-1851 [doi]

Natural prosody generation for domain specific text-to-speech systemsNobuyuki Katae, Shinta Kimura. 1852-1855 [doi]

Improving text-to-speech synthesisMark Tatham, Eric Lewis. 1856-1859 [doi]

Synthesis of stressed speech from isolated neutral speech using HMM-based modelsSahar E. Bou-Ghazale, John H. L. Hansen. 1860-1863 [doi]

Modeling segment intonation for Slovene TTS systemAles Dobnikar. 1864-1867 [doi]

Word predictability after hesitations: a corpus-based studyElizabeth Shriberg, Andreas Stolcke. 1868-1871 [doi]

Interruptions and intonationLi-chiung Yang. 1872-1875 [doi]

On not recognizing disfluencies in dialogueRobin J. Lickley, Ellen Gurman Bard. 1876-1879 [doi]

A theory of word frequencies and its application to dialogue move recognitionPhil Garner, Sue Browning, Roger Moore, Martin Russell. 1880-1883 [doi]

Utterance units and grounding in spoken dialogueDavid R. Traum, Peter A. Heeman. 1884-1887 [doi]

Coordinating turn-taking with gazeDavid G. Novick, Brian Hansen, Karen Ward. 1888-1891 [doi]

BABEL: an eastern european multi-language databasePeter Roach, Simon Arnfield, William J. Barry, J. Baltova, Marian Boldea, Adrian Fourcin, W. Gonet, Ryszard Gubrynowicz, E. Hallum, Lori Lamel, Krzysztof Marasek, Alain Marchal, Einar Meister, Klára Vicsi. 1892-1893 [doi]

USTC95 - a putonghua corpusRen-Hua Wang, Deyu Xia, Jinfu Ni, Bicheng Liu. 1894-1897 [doi]

Telephone data collection using the world wide webEdward Hurley, Joseph Polifroni, James R. Glass. 1898-1901 [doi]

The SIVA speech database for speaker verification: description and evaluationM. Falcone, A. Gallo. 1902-1905 [doi]

A multi-level description of date expressions in German telephone speechChristoph Draxler. 1906-1909 [doi]

Viterbi search visualization using vista: a generic performance visualization toolRobert H. Halstead Jr., Ben Serridge, Jean-Manuel Van Thong, William Goldenthal. 1910-1913 [doi]

A multilingual phonetic representation and analysis system for different speech databasesToomas Altosaar, Matti Karjalainen, Martti Vainio. 1914-1917 [doi]

FRESCO: the French telephone speech data collection - part of the european Speechdat(m) projectDetlev Langmann, Reinhold Haeb-Umbach, Lou Boves, Els den Os. 1918-1921 [doi]

Predicting the out-of-vocabulary rate and the required vocabulary size for speech processing applicationsJohannes Müller, Holger Stahl, Manfred Lang. 1922-1925 [doi]

AMULET: automatic MUltisensor speech labelling and event tracking: study of the spatio-temporal correlations in voiceless plosive productionNathalie Parlangeau, Alain Marchal. 1926-1929 [doi]

Constructing multi-level speech database for spontaneous speech processingMinsoo Hahn, Sanghun Kim, Jung-Chul Lee, Yong-Ju Lee. 1930-1933 [doi]

Preliminaries to a romanian speech databaseMarian Boldea, Alin Doroga, Tiberiu Dumitrescu, Maria Pescaru. 1934-1937 [doi]

Labelled data bank of spoken standard German the Kiel corpus of read/spontaneous speechKlaus J. Kohler. 1938-1941 [doi]

SAPPHIRE: an extensible speech analysis and recognition tool based on tcl/tkI. Lee Hetherington, Michael K. McCandless. 1942-1945 [doi]

Automatic detection of topic boundaries and keywords in arbitrary speech using incremental reference interval-free continuous DPJiro Kiyama, Yoshiaki Itoh, Ryuichi Oka. 1946-1949 [doi]

Very-large-vocabulary Mandarin voice message file retrieval using speech queriesBo-Ren Bai, Lee-Feng Chien, Lin-Shan Lee. 1950-1953 [doi]

Gandalf - a Swedish telephone speaker verification databaseHåkan Melin. 1954-1957 [doi]

The DCIEM map task corpus: spontaneous dialogue under sleep deprivation and drug treatmentEllen Gurman Bard, Catherine Sotillo, Anne H. Anderson, M. M. Taylor. 1958-1961 [doi]

The nemours database of dysarthric speechXavier Menéndez-Pidal, James B. Polikoff, Shirley M. Peters, Jennie E. Leonzio, H. Timothy Bunnell. 1962-1965 [doi]

POST: parallel object-oriented speech toolkitJean Hennebert, Dijana Petrovska-Delacrétaz. 1966-1969 [doi]

Recognizing emotion in speechFrank Dellaert, Thomas Polzin, Alex Waibel. 1970-1973 [doi]

Emotions in time domain synthesisBarbara Heuft, Thomas Portele, Monika Rauth. 1974-1977 [doi]

Word class driven synthesis of prosodic annotationsSimon Arnfield. 1978-1980 [doi]

Dynamical modelling of vowel sounds as a synthesis toolM. Banbrook, S. McLaughlin. 1981-1984 [doi]

Emotional speech elicited using computer gamesTom Johnstone. 1985-1988 [doi]

Automatic statistical analysis of the signal and prosodic signs of emotion in speechRoddy Cowie, Ellen Douglas-Cowie. 1989-1992 [doi]

Channel and noise normalization using affine transformed cepstrumXiaoyu Zhang, Richard J. Mammone. 1993-1996 [doi]

Spectral estimation and normalisation for robust speech recognitionTom Claes, Fei Xie, Dirk Van Compernolle. 1997-2000 [doi]

Trellis encoded vector quantization for robust speech recognitionWu Chou, Nambi Seshadri, Mazin G. Rahim. 2001-2004 [doi]

Phone clustering using the bhattacharyya distanceBrian Mak, Etienne Barnard. 2005-2008 [doi]

Variability of lombard effects under different noise conditionsAtsushi Wakao, Kazuya Takeda, Fumitada Itakura. 2009-2012 [doi]

Lombard effect compensation and noise suppression for noisy Lombard speech recognitionSang-Mun Chi, Yung-Hwan Oh. 2013-2016 [doi]

The use of shibboleth words for automatically classifying speakers by dialectA. W. F. Huggins, Yogen Patel. 2017-2020 [doi]

Data collection of Japanese dialects and its influence into speech recognitionIkuo Kudo, Takao Nakama, Tomoko Watanabe, Reiko Kameyama. 2021-2024 [doi]

Statistical dialect classification based on mean phonetic featuresDavid R. Miller, James Trischitta. 2025-2027 [doi]

Norwegian numerals: a challenge to automatic speech recognitionKnut Kvale. 2028-2031 [doi]

Evaluation of the telef nica i+d natural numbers recognizer over different dialects of Spanish from Spain and AmericaC. de la Torre, F. Javier Caminero-Gil, J. Alvarez, Cesar Martín del Alamo, Luis A. Hernández Gómez. 2032-2035 [doi]

Rhythmic constraints on English stress timingFred Cummins, Robert F. Port. 2036-2039 [doi]

On the interaction of clash, focus and phonological phrasingIrene Vogel, Steve Hoskins. 2040-2043 [doi]

On the quantal nature of speech timingGunnar Fant, Anita Kruckenberg. 2044-2047 [doi]

Differential perception of tonal contours through the syllableDavid House. 2048-2051 [doi]

Pitch, loudness, and segmental duration correlates: towards a model for the phonetic aspects of finnish prosodyMartti Vainio, Toomas Altosaar. 2052-2055 [doi]

Prosodic manipulation system of speech material for perceptual experimentsNobuaki Minematsu, Seiichi Nakagawa, Keikichi Hirose. 2056-2059 [doi]

Clustered language models with context-equivalent statesJoerg P. Ueberla, I. R. Gransden. 2060-2062 [doi]

Modeling of contextual effects and its application to word spottingYuji Yonezawa, Masato Akagi. 2063-2066 [doi]

A new keyword spotting algorithm with pre-calculated optimal thresholdsJochen Junkawitsch, L. Neubauer, Harald Höge, Günther Ruske. 2067-2070 [doi]

Detection of ambiguous portions of signal corresponding to OOV words or misrecognized portions of inputRoxane Lacouture, Yves Normandin. 2071-2074 [doi]

Techniques for approximating a trigram language modelFabio Brugnara, Marcello Federico. 2075-2078 [doi]

Unsupervised and incremental speaker adaptation under adverse environmental conditionsKeizaburo Takagi, Koichi Shinoda, Hiroaki Hattori, Takao Watanabe. 2079-2082 [doi]

An adaptive-beam pruning technique for continuous speech recognitionHugo Van Hamme, Filip Van Aelten. 2083-2086 [doi]

Data based filter design for RASTA-like channel normalization in ASRCarlos Avendaño, Sarel Van Vuuren, Hynek Hermansky. 2087-2090 [doi]

A comparison of time conditioned and word conditioned search techniques for large vocabulary speech recognitionStefan Ortmanns, Hermann Ney, Frank Seide, I. Lindam. 2091-2094 [doi]

Language-model look-ahead for large vocabulary speech recognitionStefan Ortmanns, Hermann Ney, A. Eiden. 2095-2098 [doi]

A new search algorithm in segmentation lattices of speech signalsJean-Luc Husson, Yves Laprie. 2099-2102 [doi]

LR-parser-driven viterbi search with hypotheses merging mechanism using context-dependent phone modelsTomokazu Yamada, Shigeki Sagayama. 2103-2106 [doi]

Discrete-utterance recognition with a fast match based on total data reductionJan Nouza. 2107-2110 [doi]

On-line garbage modeling with discriminant analysis for utterance verificationF. Javier Caminero-Gil, C. de la Torre, Luis Villarrubia, Cesar Martín del Alamo, Lis Hernández. 2111-2114 [doi]

Cheating with imperfect transcriptsPaul Placeway, John D. Lafferty. 2115-2118 [doi]

Novel training method for classifiers used in speaker adaptationNaoto Iwahashi. 2119-2122 [doi]

Large vocabulary word recognition based on a graph-structured dictionaryKatsuki Minamino. 2123-2126 [doi]

A word graph based n-best search in continuous speech recognitionBach-Hiep Tran, Frank Seide, Volker Steinbiss. 2127-2130 [doi]

Viterbi beam search with layered bigramsDavid M. Goblirsch. 2131-2134 [doi]

A wave decoder for continuous speech recognitionEric Buhrke, Wu Chou, Qiru Zhou. 2135-2138 [doi]

Long term on-line speaker adaptation for large vocabulary dictationEric Thelen. 2139-2142 [doi]

Incremental generation of word graphsGerhard Sagerer, Heike Rautenstrauch, Gernot A. Fink, Bernd Hildebrandt, A. Jusek, Franz Kummert. 2143-2146 [doi]

Improvement in n-best search for continuous speech recognitionIrina Illina, Yifan Gong. 2147-2150 [doi]

Sethos: the UPC speech understanding systemAntonio Bonafonte, José B. Mariño, Albino Nogueiras. 2151-2154 [doi]

Segmental search for continuous speech recognitionPietro Laface, Luciano Fissore, A. Maro, Franco Ravera. 2155-2158 [doi]

An investigation into the generation of mouth shapes for a talking headAndrew P. Breen, E. Bowers, W. Welsh. 2159-2162 [doi]

A text-to-audiovisual-speech synthesizer for FrenchBertrand Le Goff, Christian Benoît. 2163-2166 [doi]

Analysis of head movements and its role in spoken dialogueYuri Iwano, Shioya Kageyama, Emi Morikawa, Shu Nakazato, Katsuhiko Shirai. 2167-2170 [doi]

RWC multimodal database for interactions by integration of spoken language and visual informationSatoru Hayamizu, Osamu Hasegawa, Katunobu Itou, Katsuhiko Sakaue, Kazuyo Tanaka, Shigeki Nagaya, Masayuki Nakazawa, T. Endoh, Fumio Togawa, Kenji Sakamoto, Kazuhiko Yamamoto. 2171-2174 [doi]

About the relationship between eyebrow movements and F0 variationsChristian Cavé, Isabelle Guaïtella, Roxane Bertrand, Serge Santi, Françoise Harlay, Robert Espesser. 2175-2178 [doi]

How many words is a picture really worth?Laurel Fais, Kyung-ho Loken-Kim, Tsuyoshi Morimoto. 2179-2182 [doi]

Audio-visual speech perception without speech cuesHelena M. Saldaña, David B. Pisoni, Jennifer M. Fellowes, Robert E. Remez. 2187-2190 [doi]

Multilingual speech recognition at dragon systemsJim Barnett, Andrés Corrada, G. Gao, Larry Gillick, Yoshiko Ito, Steve Lowe, Linda Manganaro, Barbara Peskin. 2191-2194 [doi]

Multi-lingual phoneme recognition exploiting acoustic-phonetic similarities of soundsJoachim Köhler. 2195-2198 [doi]

Japanese speech databases for robust speech recognitionAtsushi Nakamura, Shoichi Matsunaga, Tohru Shimizu, Masahiro Tonomura, Yoshinori Sagisaka. 2199-2202 [doi]

Spoken language processing in a multilingual contextLori Lamel, Martine Adda-Decker, Jean-Luc Gauvain, Gilles Adda. 2203-2206 [doi]

Multilingual human-computer interactions: from information access to language learningVictor Zue, Stephanie Seneff, Joseph Polifroni, Helen M. Meng, James R. Glass. 2207-2210 [doi]

Speedata: multilingual spoken data entryUlla Ackermann, Bianca Angelini, Fabio Brugnara, Marcello Federico, Diego Giuliani, Roberto Gretter, Gianni Lazzari, Heinrich Niemann. 2211-2214 [doi]

Pseudo-articulatory representations in speech synthesis and recognitionWilliam H. Edmondson, Jon P. Iles, Dorota J. Iskra. 2215-2218 [doi]

Synthesis of initial (/s/-) stop-liquid clusters using HLsynDavid R. Williams. 2219-2222 [doi]

Synthesis of trillChilin Shih. 2223-2226 [doi]

Phone-based speech synthesis with neural network and articulatory controlW. K. Lo, P. C. Ching. 2227-2230 [doi]

Analysis of ten vowel sounds across gender and regional/cultural accentP. Martland, Sandra P. Whiteside, Steve W. Beet, Ladan Baghai-Ravary. 2231-2234 [doi]

Speech morphing by gradually changing spectrum parameter and fundamental frequencyMasanobu Abe. 2235-2238 [doi]

The multi-lag-window method for robust extended-range F0 determinationEdouard Geoffrois. 2239-2242 [doi]

Nonlinear estimation of DEGG signals with applications to speech pitch detectionKenneth E. Barner. 2243-2246 [doi]

Pitch analysis methods for cross-speaker comparisonJohn A. Maidment, Maria Luisa Garcia Lecumberri. 2247-2249 [doi]

Continuous adaptation of linear models with impulsive excitationSteve W. Beet, Ladan Baghai-Ravary. 2250-2253 [doi]

Quantitative analysis of the local speech rate and its application to speech synthesisSumio Ohno, Masamichi Fukumiya, Hiroya Fujisaki. 2254-2257 [doi]

A fast and reliable rate of speech detectorJan P. Verhasselt, Jean-Pierre Martens. 2258-2261 [doi]

Context modeling and clustering in continuous speech recognitionJean-Claude Junqua, Lorenzo Vassallo. 2262-2265 [doi]

Hierarchical partition of the articulatory state space for overlapping-feature based speech recognitionLi Deng, Jim Jian-Xiong Wu. 2266-2269 [doi]

A fuzzy acoustic-phonetic decoder for speech recognitionOlivier Oppizzi, David Fournier, Philippe Gilles, Henri Meloni. 2270-2273 [doi]

Syllable-level desynchronisation of phonetic features for speech recognitionKatrin Kirchhoff. 2274-2276 [doi]

A probabilistic framework for feature-based speech recognitionJames R. Glass, Jane W. Chang, Michael K. McCandless. 2277-2280 [doi]

Modeling context-dependent phonetic units in a continuous speech recognition system for Mandarin ChineseJim Jian-Xiong Wu, Li Deng, Jacky Chan. 2281-2284 [doi]

JANUS-II: towards spontaneous Spanish speech recognitionPuming Zhan, Klaus Ries, Marsal Gavaldà, Donna Gates, Alon Lavie, Alex Waibel. 2285-2288 [doi]

Reduced semi-continuous models for large vocabulary continuous speech recognition in DutchKris Demuynck, Jacques Duchateau, Dirk Van Compernolle. 2289-2292 [doi]

Validating different flexible vocabulary approaches on the Swiss French Polyphone and Polyvar databasesAndrei Constantinescu, Olivier Bornet, Gilles Caloz, Gérard Chollet. 2293-2296 [doi]

Use of a reliability coefficient in noise cancelling by neural net and weighted matching algorithmsNéstor Becerra Yoma, Fergus R. McInnes, Mervyn A. Jack. 2297-2300 [doi]

Likelihood normalization using an ergodic HMM for continuous speech recognitionKazuhiko Ozeki. 2301-2304 [doi]

Dynamic control of a production modelLaurence Candille, Henri Meloni. 2305-2308 [doi]

Speech recognition using sub-word units dependent on phonetic contexts of both training and recognition vocabulariesHiroaki Hattori, Eiko Yamada. 2309-2312 [doi]

Hidden Markov models merging acoustic and articulatory information to automatic speech recognitionBruno Jacob, Christine Sénac. 2313-2315 [doi]

Creation of unseen triphones from diphones and monophones using a speech production approachMats Blomberg, Kjell Elenius. 2316-2319 [doi]

Speaker-independent dictation of Chinese speech with 32k vocabularyBo Xu, Bing Ma, Shuwu Zhang, Fei Qu, Taiyi Huang. 2320-2323 [doi]

Using accent-specific pronunciation modelling for robust speech recognitionJ. J. Humphries, Philip C. Woodland, D. Pearce. 2324-2327 [doi]

Dictionary learning for spontaneous speech recognitionTilo Sloboda, Alex Waibel. 2328-2331 [doi]

Comparison of channel normalisation techniques for automatic speech recognition over the phoneJohan de Veth, Lou Boves. 2332-2335 [doi]

Anchor point detection for continuous speech recognition in Spanish: the spotting of phonetic eventsManuel A. Leandro, José Manuel Pardo. 2336-2339 [doi]

Cepstral compensation by polynomial approximation for environment-independent speech recognitionBhiksha Raj, Evandro Bacci Gouvêa, Pedro J. Moreno, Richard M. Stern. 2340-2343 [doi]

Effect of speech coders on speech recognition performanceB. T. Lilly, Kuldip K. Paliwal. 2344-2347 [doi]

Wavelet transforms for non-uniform speech recogntion systemsLéonard Janer, Josep Martí, Climent Nadeu, Eduardo Lleida-Solano. 2348-2351 [doi]

A binaural model as a front-end for isolated word recognitionTsuyoshi Usagawa, Markus Bodden, Klaus Rateitschek. 2352-2355 [doi]

A new speech enhancement: speech stream segregationHiroshi G. Okuno, Tomohiro Nakatani, Takeshi Kawabata. 2356-2359 [doi]

Head automata for speech translationHiyan Alshawi. 2360-2363 [doi]

Word clustering with parallel spoken language corporaYe-Yi Wang, John Lafferty, Alex Waibel. 2364-2367 [doi]

Toward translating Korean speech into other languagesJae-Woo Yang, YoungJik Lee. 2368-2370 [doi]

VERBMOBIL: the evolution of a complex large speech-to-speech translation systemThomas Bub, Johannes Schwinn. 2371-2374 [doi]

Translation of conversational speech with JANUS-IIAlon Lavie, Alex Waibel, Lori S. Levin, Donna Gates, Marsal Gavaldà, Torsten Zeppenfeld, Puming Zhan, Oren Glickman. 2375-2378 [doi]

Non-segmental analysis and synthesis based on a speech databaseAndrew Slater, John Coleman. 2379-2382 [doi]

Microsegment synthesis - economic principles in a low-cost solutionRalf Benzmüller, William J. Barry. 2383-2386 [doi]

Whistler: a trainable text-to-speech systemX. D. Huang, Alex Acero, J. Adcock, Hsiao-Wuen Hon, J. Goldsmith, J. Liu, Mike Plumpe. 2387-2390 [doi]

Generation of multiple synthesis inventories by a bootstrapping procedureThomas Portele, Karlheinz Stöber, Horst Meyer, Wolfgang Hess. 2391-2394 [doi]

Modeling segmental duration in German text-to-speech synthesisBernd Möbius, Jan P. H. van Santen. 2395-2398 [doi]

Autolabelling Japanese ToBINick Campbell. 2399-2402 [doi]

General phrase speaker verification using sub-word background models and likelihood-ratio scoringS. Parthasarathy, Aaron E. Rosenberg. 2403-2406 [doi]

Unknown-multiple signal source clustering problem using ergodic HMM and applied to speaker classificationJ. Murakami, M. Sugiyama, H. Watanabe. 2407-2410 [doi]

GMM and ARVM cooperation and competition for text-independent speaker recognition on telephone speechJean-Luc Le Floch, Claude Montacié, Marie-José Caraty. 2411-2414 [doi]

Selective use of the speech spectrum and a VQGMM method for speaker identificationQiguang Lin, Ea-Ee Jan, Chiwei Che, Dong-Suk Yuk, James L. Flanagan. 2415-2418 [doi]

Speaker verification through large vocabulary continuous speech recognitionMichael Newman, Larry Gillick, Yoshiko Ito, Don McAllaster, Barbara Peskin. 2419-2422 [doi]

Predictive neural networks in text independent speaker verification: an evaluation on the SIVA databaseAndrea Paoloni, Susanna Ragazzini, Giacomo Ravaioli. 2423-2426 [doi]

Durational characterstics of hindi consonant clustersNisheeth Shrotriya, Rajesh Verma, Sunil K. Gupta, S. S. Agrawal. 2427-2430 [doi]

The use of wavelet transforms in phoneme recognitionBeng T. Tan, Minyue Fu, Andrew Spray, Phillip Dermody. 2431-2434 [doi]

Acoustic properties of phonemes in continuous speech for different speaking rateHisao Kuwabara. 2435-2438 [doi]

Prosodic parameterization of spoken Japanese based on a model of the generation process of F0 contoursHiroya Fujisaki, Sumio Ohno. 2439-2442 [doi]

A logistic regression model for detecting prominencesArman Maghbouleh. 2443-2445 [doi]

High-quality prosodic modification of speech signalsBeat Pfister. 2446-2449 [doi]

On the syllable structures of Chinese relating to speech recognitionJialu Zhang. 2450-2453 [doi]

Can a moraic nasal occur word-initially in Japanese?Takashi Otake, Kiyoko Yoneyama. 2454-2457 [doi]

Perceptual assimilation of american English vowels by Japanese listenersWinifred Strange, Reiko Akahane-Yamada, B. H. Fitzgerald, R. Kubo. 2458-2461 [doi]

Context and speaker effects in the perceptual assimilation of German vowels by american listenersWinifred Strange, Ocke-Schwen Bohn, S. A. Trent, M. C. McNair, K. C. Bielec. 2462-2465 [doi]

Context-dependent relevance of burst and transitions for perceived place in stops: it s in production, not perceptionRoel Smits. 2470-2473 [doi]

The perception of morae in long vowels comparison among Japanese, Korean and English speakersRyoji Baba, Kaori Omuro, Hiromitsu Miyazono, Tsuyoshi Usagawa, Masahiko Higuchi. 2474-2477 [doi]

Juncture cues to disfluencyRobin J. Lickley. 2478-2481 [doi]

Effects of duration and formant movement on vowel perceptionJames R. Sawusch. 2482-2485 [doi]

Benchmarking human performance for continuous speech recognitionNeeraj Deshmukh, Richard Duncan, Aravind Ganapathiraju, Joseph Picone. 2486-2489 [doi]

Intelligibility of speech with filtered time trajectories of spectral envelopesTakayuki Arai, Misha Pavel, Hynek Hermansky, Carlos Avendaño. 2490-2493 [doi]

Perceptual use of vowel and speaker information in breath soundsDouglas H. Whalen, Sonya M. Sheffert. 2494-2497 [doi]

The role of neighborhood relative frequency in spoken word recognitionPhilippe Mousty, Monique Radeau, Ronald Peereman, Paul Bertelson. 2498-2501 [doi]

Transitional probability and phoneme monitoringJames M. McQueen, Mark A. Pitt. 2502-2505 [doi]

Identification of vowel features from French stop burstsAnne Bonneau. 2506-2509 [doi]

Listening in a second languageZinny S. Bond, Thomas J. Moore, Beverley Gable. 2510-2513 [doi]

Perception of lexical tone across languages: evidence for a linguistic mode of processingDenis Burnham, Elizabeth Francis, Di Webster, Sudaporn Luksaneeyanawin, Chayada Attapaiboon, Francisco Lacerda, Peter Keller. 2514-2517 [doi]

Acoustic correlates to the effects of talker variability on the perception of English /r/ and /l/ by Japanese listenersJames S. Magnuson, Reiko Akahane-Yamada. 2518-2521 [doi]

Natural communication with machines - progress and challengeJames L. Flanagan. 2522 [doi]

External Links

Cite Key

Statistics

PDF

Researchr

The 4th International Conference on Spoken Language Processing, Philadelphia, PA, USA, October 3-6, 1996

Abstract

Table of Contents