2012 IEEE Spoken Language Technology Workshop (SLT), Miami, FL, USA, December 2-5, 2012 - researchr publication

researchr

You are not signed in
Sign in
Sign up

2012 IEEE Spoken Language Technology Workshop (SLT), Miami, FL, USA, December 2-5, 2012. IEEE, 2012. [doi]

Conference: slt2012

Abstract is missing.

A nonparametric Bayesian approach to learning multimodal interaction managementZhuoran Wang, Oliver Lemon. 1-6 [doi]

Simultaneous feature selection and parameter optimization for training of dialog policy by reinforcement learningTeruhisa Misu, Hideki Kashioka. 1-6 [doi]

Reinforcement learning for spoken dialogue systems using off-policy natural gradient methodFilip Jurcícek. 7-12 [doi]

Realistic answer verification: An analysis of user errors in a sentence-repetition taskSajad Shirali-Shahreza, Gerald Penn. 19-24 [doi]

Localized detection of speech recognition errorsSvetlana Stoyanchev, Philipp Salletmayr, Jingbo Yang, Julia Hirschberg. 25-30 [doi]

Policy optimisation of POMDP-based dialogue systems without state space compressionMilica Gasic, Matthew Henderson, Blaise Thomson, Pirros Tsiakoulis, Steve Young. 31-36 [doi]

N-best error simulation for training spoken dialogue systemsBlaise Thomson, Milica Gasic, Matthew Henderson, Pirros Tsiakoulis, Steve Young. 37-42 [doi]

Affective evaluation of a mobile multimodal dialogue system using brain signalsManolis Perakakis, Alexandros Potamianos. 43-48 [doi]

A reranking approach for recognition and classification of speech input in conversational dialogue systemsFabrizio Morbini, Kartik Audhkhasi, Ron Artstein, Maarten Van Segbroeck, Kenji Sagae, Panayiotis G. Georgiou, David R. Traum, Shrikanth S. Narayanan. 49-54 [doi]

A critical analysis of two statistical spoken dialog systems in public useJason D. Williams. 55-60 [doi]

POMDP-based Let's Go system for spoken dialog challengeSungjin Lee, Maxine Eskenazi. 61-66 [doi]

Employing boosting to compare cues to verbal feedback in multi-lingual dialogGina-Anne Levow, Siwei Wang. 67-72 [doi]

Crowdsourcing the acquisition of natural language corpora: Methods and observationsWilliam Yang Wang, Dan Bohus, Ece Kamar, Eric Horvitz. 73-78 [doi]

Exploiting loudness dynamics in stochastic models of turn-takingKornel Laskowski. 79-84 [doi]

Word segmentation through cross-lingual word-to-phoneme alignmentFelix Stahlberg, Tim Schlippe, Stephan Vogel, Tanja Schultz. 85-90 [doi]

Class-based speech recognition using a maximum dissimilarity criterion and a tolerance classification marginArseniy Gorin, Denis Jouvet. 91-96 [doi]

On the generalization of Shannon entropy for speech recognitionNicolas Obin, Marco Liuni. 97-102 [doi]

A noise-robust speech recognition method composed of weak noise suppression and weak Vector Taylor Series AdaptationShuji Komeiji, Takayuki Arakawa, Takafumi Koshinaka. 103-106 [doi]

Improving large vocabulary continuous speech recognition by combining GMM-based and reservoir-based acoustic modelingFabian Triefenbach, Kris Demuynck, Jean-Pierre Martens. 107-112 [doi]

Recognition rate estimation based on word alignment network and discriminative error type classificationAtsunori Ogawa, Takaaki Hori, Atsushi Nakamura. 113-118 [doi]

American sign language fingerspelling recognition with phonological feature-based tandem modelsTaehwan Kim, Karen Livescu, Gregory Shakhnarovich. 119-124 [doi]

Efficient prior and incremental beam width control to suppress excessive speech recognition time based on score range estimationSatoshi Kobashikawa, Takaaki Hori, Yoshikazu Yamaguchi, Taichi Asami, Hirokazu Masataki, Satoshi Takahashi. 125-130 [doi]

Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMMJinyu Li, Dong Yu, Jui-Ting Huang, Yifan Gong. 131-136 [doi]

Combining cepstral normalization and cochlear implant-like speech processing for microphone array-based speech recognitionCong-Thanh Do, Mohammad J. Taghizadeh, Philip N. Garner. 137-142 [doi]

Context-dependent Deep Neural Networks for audio indexing of real-life dataGang Li, Huifeng Zhu, Gong Cheng, Kit Thambiratnam, Behrooz Chitsaz, Dong Yu, Frank Seide. 143-148 [doi]

Audio-visual feature integration based on piecewise linear transformation for noise robust automatic speech recognitionYosuke Kashiwagi, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose. 149-152 [doi]

Intent transfer in speech-to-speech machine translationGopala Krishna Anumanchipalli, Luís C. Oliveira, Alan W. Black. 153-158 [doi]

Using syntactic and confusion network structure for out-of-vocabulary word detectionAlex Marin, Tom Kwiatkowski, Mari Ostendorf, Luke S. Zettlemoyer. 159-164 [doi]

Topic n-gram count language model adaptation for speech recognitionMd. Akmal Haidar, Douglas D. O'Shaughnessy. 165-169 [doi]

Using rhythmic features for Japanese spoken term detectionNaoyuki Kanda, Ryu Takeda, Yasunari Obuchi. 170-175 [doi]

Discriminative spoken language understanding using word confusion networksMatthew Henderson, Milica Gasic, Blaise Thomson, Pirros Tsiakoulis, Kai Yu, Steve Young. 176-181 [doi]

Improved semantic retrieval of spoken content by language models enhanced with acoustic similarity graphHung-yi Lee, Tsung-Hsien Wen, Lin-Shan Lee. 182-187 [doi]

Personalized language modeling by crowd sourcing with social network data for voice access of cloud applicationsTsung-Hsien Wen, Hung-yi Lee, Tai-Yuan Chen, Lin-Shan Lee. 188-193 [doi]

Combining multiple translation systems for Spoken Language Understanding portabilityFernando García 0001, Lluís F. Hurtado, Encarna Segarra, Emilio Sanchis, Giuseppe Riccardi. 194-198 [doi]

Joint language models for automatic speech recognition and understandingAli Orkan Bayer, Giuseppe Riccardi. 199-203 [doi]

Incorporating syllable duration into line-detection-based spoken term detectionTeppei Ohno, Tomoyosi Akiba. 204-209 [doi]

Use of kernel deep convex networks and end-to-end learning for spoken language understandingLi Deng, Gökhan Tür, Xiaodong He, Dilek Z. Hakkani-Tür. 210-215 [doi]

Statistical semantic interpretation modeling for spoken language understanding with enriched semantic featuresAsli Çelikyilmaz, Dilek Z. Hakkani-Tür, Gökhan Tür. 216-221 [doi]

Modeling multiword phrases with constrained phrase trees for improved topic modeling of conversational speechTimothy J. Hazen, Fred Richardson. 222-227 [doi]

Exploiting the Semantic Web for unsupervised spoken language understandingLarry P. Heck, Dilek Hakkani-Tür. 228-233 [doi]

Context dependent recurrent neural network language modelTomas Mikolov, Geoffrey Zweig. 234-239 [doi]

What makes this voice sound so bad? A multidimensional analysis of state-of-the-art text-to-speech systemsFlorian Hinterleitner, Christoph Norrenbrock, Sebastian Möller, Ulrich Heute. 240-245 [doi]

Unsupervised cross-lingual knowledge transfer in DNN-based LVCSRPawel Swietojanski, Arnab Ghoshal, Steve Renals. 246-251 [doi]

Syllable-based prosodic analysis of Amharic read speechOliver Jokisch, Yitagessu Birhanu, Rüdiger Hoffmann. 252-257 [doi]

Reactive and continuous control of HMM-based speech synthesisMaria Astrinaki, Nicolas D'Alessandro, Benjamin Picart, Thomas Drugman, Thierry Dutoit. 252-257 [doi]

MediaParl: Bilingual mixed language accented speech databaseDavid Imseng, Hervé Bourlard, Holger Caesar, Philip N. Garner, Gwénolé Lecorve, Alexandre Nanchen. 263-268 [doi]

Comparison of adaptation methods for GMM-SVM based speech emotion recognitionJianbo Jiang, Zhiyong Wu, Mingxing Xu, Jia Jia, Lianhong Cai. 269-273 [doi]

On the use of phone log-likelihood ratios as features in spoken language recognitionMireia Díez, Amparo Varona, Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Germán Bordel. 274-279 [doi]

Speaker diarization and linking of large corporaMarc Ferras, Herve Boudard. 280-285 [doi]

A grapheme-based method for automatic alignment of speech and text dataAdriana Stan, Peter Bell, Simon King. 286-290 [doi]

Statistical methods for varying the degree of articulation in new HMM-based voicesBenjamin Picart, Thomas Drugman, Thierry Dutoit. 291-296 [doi]

Synthesizing expressive speech from amateur audiobook recordingsÉva Székely, Tamás Gábor Csapó, Bálint Tóth, Péter Mihajlik, Julie Carson-Berndsen. 297-302 [doi]

Frame-based phonotactic Language IdentificationKyu J. Han, Jason W. Pelecanos. 303-306 [doi]

Noisy channel adaptation in language identificationSriram Ganapathy, Mohamed Kamal Omar, Jason Kamal Pelecanos. 307-312 [doi]

Exemplar-based voice conversion in noisy environmentRyoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki. 313-317 [doi]

Optimization of the DET curve in speaker verificationL. Paola García-Perera, Juan Arturo Nolazco-Flores, Bhiksha Raj, Richard M. Stern. 318-323 [doi]

Transcription of multi-genre media archives using out-of-domain dataP. J. Bell, M. J. F. Gales, P. Lanchantin, Xunying Liu, Y. Long, S. Renals, P. Swietojanski, Philip C. Woodland. 324-329 [doi]

Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture ModelsMohamed Bouallegue, Emmanuel Ferreira, Driss Matrouf, Georges Linares, Maria Goudi, Pascal Nocera. 330-335 [doi]

The language-independent bottleneck featuresKarel Veselý, Martin Karafiát, Frantisek Grézl, Milos Janda, Ekaterina Egorova. 336-341 [doi]

Towards a new speech event detection approach for landmark-based speech recognitionStefan Ziegler, Bogdan Ludusan, Guillaume Gravier. 342-347 [doi]

Recovery of acronyms, out-of-lattice words and pronunciations from parallel multilingual speechJoão Miranda, João Paulo Neto, Alan W. Black. 348-353 [doi]

The Bavieca open-source speech recognition toolkitDaniel Bolanos. 354-359 [doi]

Active learning for accent adaptation in Automatic Speech RecognitionUdhyakumar Nallasamy, Florian Metze, Tanja Schultz. 360-365 [doi]

Adaptation of context-dependent deep neural networks for automatic speech recognitionKaisheng Yao, Dong Yu, Frank Seide, Hang Su, Li Deng, Yifan Gong. 366-369 [doi]

Deep-level acoustic-to-articulatory mapping for DBN-HMM based phone recognitionLeonardo Badino, Claudia Canevari, Luciano Fadiga, Giorgio Metta. 370-375 [doi]

Modeling intensity contours and the interaction of pitch and intensity to improve automatic prosodic event detection and classificationAndrew Rosenberg. 376-381 [doi]

A comparison-based approach to mispronunciation detectionAnn Lee, James R. Glass. 382-387 [doi]

Automatic classification of unequal lexical stress patterns using machine learning algorithmsMostafa Ali Shahin, Beena Ahmed, Kirrie J. Ballard. 388-391 [doi]

The FAU Video Lecture Browser systemKorbinian Riedhammer, Martin Gropp, Elmar Nöth. 392-397 [doi]

Automatic transcription of academic lectures from diverse disciplinesGhada AlHarbi, Thomas Hain. 398-403 [doi]

Lexical entrainment and success in student engineering groupsHeather Friedberg, Diane J. Litman, Susannah B. F. Paletz. 404-409 [doi]

Automatic detection and correction of syntax-based prosody annotation errorsSandrine Brognaux, Thomas Drugman, Richard Beaufort. 410-415 [doi]

Train&align: A new online tool for automatic phonetic alignmentSandrine Brognaux, Sophie Roekhaut, Thomas Drugman, Richard Beaufort. 416-421 [doi]

Combining criteria for the detection of incorrect entries of non-native speech in the context of foreign language learningLuiza Orosanu, Denis Jouvet, Dominique Fohr, Irina Illina, Anne Bonneau. 422-427 [doi]

Performance improvement of automatic pronunciation assessment in a noisy classroomYi Luan, Masayuki Suzuki, Yutaka Yamauchi, Nobuaki Minematsu, Shuhei Kato, Keikichi Hirose. 428-431 [doi]

An automatic pitch accent feedback system for english learners with adaptation of an english corpus spoken by KoreansSechun Kang, Gary Geunbae Lee, Ho-Young Lee, Byeongchang Kim. 432-437 [doi]

Robust detection of voiced segments in samples of everyday conversations using unsupervised HMMSMeysam Asgari, Izhak Shafran, Alireza Bayestehtashk. 438-442 [doi]

Generating grammar questions using corpus data in L2 learningKyusong Lee, Soo-Ok Kweon, Hongsuck Seo, Gary Geunbae Lee. 443-448 [doi]

Analysis of speech transcripts to predict winners of U.S. Presidential and Vice-Presidential debatesIan Kaplan, Andrew Rosenberg. 449-454 [doi]

Speech-based emotion classification using multiclass SVM with hybrid kernel and thresholding fusionNa Yang, R. Muraleedharan, J. Kohl, Ilker Demirkol, Wendi Rabiner Heinzelman, Melissa Sturge-Apple. 455-460 [doi]

Two-layer mutually reinforced random walk for improved multi-party meeting summarizationYun-Nung Chen, Florian Metze. 461-466 [doi]

Ecological validity and the evaluation of speech summarization qualityAnthony McCallum, Gerald Penn, Cosmin Munteanu, Xiaodan Zhu. 467-472 [doi]

Automatic Chinese pronunciation error detection using SVM trained with structural featuresTongmu Zhao, Akemi Hoshino, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose. 473-478 [doi]

Evaluating the effect of normalizing informal text on TTS outputDeana Pennell, Yang Liu. 479-483 [doi]

runs on WebDSL