1094 | -- | 1106 | Hamed Ketabdar, Hervé Bourlard. Enhanced Phone Posteriors for Improving Speech Recognition Systems |
1107 | -- | 1115 | Sergio Canazza, Giovanni De Poli, Gian Antonio Mian. Restoration of Audio Documents by Means of Extended Kalman Filter |
1116 | -- | 1126 | Chunghsin Yeh, Axel Röbel, Xavier Rodet. Multiple Fundamental Frequency Estimation and Polyphony Inference of Polyphonic Music Signals |
1127 | -- | 1136 | Jiucang Hao, Te-Won Lee, Terrence J. Sejnowski. Speech Enhancement Using Gaussian Scale Mixture Models |
1137 | -- | 1146 | R. Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. Integrated Active Noise Control and Noise Reduction in Hearing Aids |
1147 | -- | 1157 | Justin Jian Zhang, Ricky Ho Yin Chan, Pascale Fung. Extractive Speech Summarization Using Shallow Rhetorical Structure Modeling |
1158 | -- | 1169 | Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee. A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition |
1170 | -- | 1181 | Chung-Hsien Wu, Chao-Hong Liu, M. Harris, Liang-Chih Yu. Sentence Correction Incorporating Relative Position and Parse Template Language Models |
1182 | -- | 1192 | Robbie Vogt, Sridha Sridharan, Michael Mason. Making Confident Speaker Verification Decisions With Minimal Speech |
1193 | -- | 1207 | Dimitri Nion, Kleanthis N. Mokios, Nicholas D. Sidiropoulos, Alexandros Potamianos. Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures |
1208 | -- | 1217 | Vaclav Eksler, Milan Jelinek. Glottal-Shape Codebook to Improve Robustness of CELP Codecs |
1218 | -- | 1227 | Moo-young Kim, W. Bastiaan Kleijn. Reduction of the Impact of Distortion Outliers and Source Mismatch in Resolution-Constrained Quantization |
1228 | -- | 1242 | Maurice F. Fallon, Simon J. Godsill. Acoustic Source Localization and Tracking Using Track Before Detect |
1243 | -- | 1257 | Xiaoqiang Xiao, Robert M. Nickel. Speech Enhancement With Inventory Style Speech Resynthesis |
1258 | -- | 1268 | Angel M. Gomez, Jose L. Carmona, Antonio M. Peinado, Victoria E. Sánchez. A Multipulse-Based Forward Error Correction Technique for Robust CELP-Coded Speech Transmission Over Erasure Channels |
1269 | -- | 1279 | Matt Gibson 0002, Thomas Hain. Error Approximation and Minimum Phone Error Acoustic Model Estimation |
1280 | -- | 1289 | Matthias Mauch, Simon Dixon. Simultaneous Estimation of Chords and Musical Context From Audio |
1290 | -- | 1299 | Jingen Ni, Feng Li. A Variable Step-Size Matrix Normalized Subband Adaptive Filter |
1300 | -- | 1312 | Chang Huai You, Kong-Aik Lee, Haizhou Li. GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition |
1313 | -- | 1322 | Ilknur Durgar El-Kahlout, Kemal Oflazer. Exploiting Morphology and Local Word Reordering in English-to-Turkish Phrase-Based Statistical Machine Translation |
1323 | -- | 1331 | Jingbo Zhu, Huizhen Wang, Benjamin K. Tsou, Matthew Y. Ma. Active Learning With Sampling by Uncertainty and Density for Data Annotations |
1332 | -- | 1340 | Chi-Sang Jung, Moo-young Kim, Hong-Goo Kang. Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information |
1341 | -- | 1353 | Jose L. Carmona, Antonio M. Peinado, José L. Pérez-Córdoba, Angel M. Gomez. MMSE-Based Packet Loss Concealment for CELP-Coded Speech Recognition |
1354 | -- | 1365 | Kentaro Ishizuka, Shoko Araki, T. Kawahara. Speech Activity Detection for Multi-Party Conversation Analyses Based on Likelihood Ratio Test on Spatial Magnitude |
1366 | -- | 1378 | Marc Ferras, Cheung Chi Leung, Claude Barras, Jean-Luc Gauvain. Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition |
1379 | -- | 1393 | Hynek Boril, John H. L. Hansen. Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments |
1394 | -- | 1405 | Chung-Hsien Wu, Chi-Chun Hsia, Chung-Han Lee, Mai-Chun Lin. Hierarchical Prosody Conversion Using Regression-Based Clustering for Emotional Speech Synthesis |
1406 | -- | 1416 | Keansub Lee, Daniel P. W. Ellis. Audio-Based Semantic Concept Classification for Consumer Video |
1417 | -- | 1428 | C. Tantibundhit, Franz Pernkopf, Gernot Kubin. Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement |
1429 | -- | 1439 | Eric A. Lehmann, Anders M. Johansson. Diffuse Reverberation Model for Efficient Image-Source Simulation of Room Impulse Responses |
1440 | -- | 1454 | Ø. Birkenes, T. Matsui, K. Tanabe, Sabato Marco Siniscalchi, Tor André Myrvoll, M. H. Johnsen. Penalized Logistic Regression With HMM Log-Likelihood Regressors for Speech Recognition |
1455 | -- | 1463 | Jerome R. Bellegarda. A Dynamic Cost Weighting Framework for Unit Selection Text-to-Speech Synthesis |
1464 | -- | 1475 | Mathieu Parvaix, Laurent Girin, Jean-Marc Brossier. A Watermarking-Based Method for Informed Source Separation of Audio Signals With a Single Sensor |
1476 | -- | 1485 | Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino. Blind Source Separation With Parameter-Free Adaptive Step-Size Method for Robot Audition |
1486 | -- | 1495 | M. Akbacak, John H. L. Hansen. Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid Representations |
1496 | -- | 1506 | Mitchell McLaren, Robbie Vogt, Brendan Baker, Sridha Sridharan. Data-Driven Background Dataset Selection for SVM-Based Speaker Verification |
1507 | -- | 1516 | Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama. Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency |
1517 | -- | 1527 | Andre Holzapfel, Yannis Stylianou, Ali Cenk Gedik, Baris Bozkurt. Three Dimensions of Pitched Instrument Onset Detection |
1528 | -- | 1538 | Panikos Heracleous, V.-A. Tran, T. Nagai, Kiyohiro Shikano. Analysis and Recognition of NAM Speech Using HMM Distances and Visual Information |
1539 | -- | 1549 | Yuya Akita, Tatsuya Kawahara. Statistical Transformation of Language and Pronunciation Models for Spontaneous Speech Recognition |
1550 | -- | 1561 | Charles Verron, Mitsuko Aramaki, Richard Kronland-Martinet, Grégory Pallone. A 3-D Immersive Synthesizer for Environmental Sounds |
1562 | -- | 1574 | Yi-Cheng Pan, Lin-Shan Lee. Performance Analysis for Lattice-Based Speech Indexing Approaches Using Words and Subword Units |
1575 | -- | 1587 | Mehrez Souden, Jacob Benesty, Sofiène Affes. Broadband Source Localization From an Eigenanalysis Perspective |
1588 | -- | 1600 | Péter Mihajlik, Zoltán Tüske, B. Tarjan, Bottyán Németh, Tibor Fegyó. Improved Recognition of Spontaneous Hungarian Speech - Morphological and Acoustic Modeling Techniques for a Less Resourced Task |
1601 | -- | 1611 | Gökhan Tür, Andreas Stolcke, L. Voss, Stanley Peters, Dilek Hakkani-Tür, John Dowding, Benoît Favre, Raquel Fernández, Matthew Frampton, Michael W. Frandsen, C. Frederickson, Martin Graciarena, D. Kintzing, K. Leveque, S. Mason, John Niekrasz, Matthew Purver, Korbinian Riedhammer, Elizabeth Shriberg, Jing Tien, Dimitra Vergyri, Fan Yang. The CALO Meeting Assistant System |
1612 | -- | 1623 | Bengt J. Borgstrom, Abeer Alwan. HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition |
1624 | -- | 1631 | Sriram Ganapathy, Petr Motlícek, Hynek Hermansky. Autoregressive Models of Amplitude Modulations in Audio Compression |
1632 | -- | 1642 | Hyeon-Jin Jeon, Tae-Gyu Chang, Sen M. Kuo. Analysis of Frequency Mismatch in Narrowband Active Noise Control |
1643 | -- | 1654 | Valentin Emiya, Roland Badeau, Bertrand David. Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle |
1655 | -- | 1666 | Thushara D. Abhayapala, Aastha Gupta. Spherical Harmonic Analysis of Wavefields Using Multiple Circular Sensor Arrays |