1993 | -- | 2005 | William Hartmann, Arun Narayanan, Eric Fosler-Lussier, DeLiang Wang. A Direct Masking Approach to Robust ASR |
2006 | -- | 2014 | Yow-Bang Wang, Shang-wen Li, Lin-Shan Lee. An Experimental Analysis on Integrating Multi-Stream Spectro-Temporal, Cepstral and Pitch Information for Mandarin Speech Recognition |
2015 | -- | 2028 | Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass. Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach |
2029 | -- | 2041 | Zbynek Koldovský, Jirí Málek, Petr Tichavský, Francesco Nesta. Semi-Blind Noise Extraction Using Partially Known Position of the Target Source |
2042 | -- | 2056 | Mads Graesboll Christensen. Accurate Estimation of Low Fundamental Frequencies From Real-Valued Measurements |
2057 | -- | 2072 | Philippe Esling, Carlos Agon. Multiobjective Time Series Matching for Audio Classification and Retrieval |
2073 | -- | 2084 | Chao Zhang, Yi Liu, Yunqing Xia, Xuan Wang, Chin-Hui Lee. Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition |
2085 | -- | 2095 | Gilles Degottex, Yannis Stylianou. Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model |
2096 | -- | 2107 | Bilei Zhu, Wei Li, Ruijiang Li, Xiangyang Xue. Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation |
2108 | -- | 2117 | Sadao Hiroya. Non-Negative Temporal Decomposition of Speech Parameters by Multiplicative Update Rules |
2118 | -- | 2128 | Cyril Joder, Slim Essid, Gaël Richard. Learning Optimal Features for Polyphonic Audio-to-Score Alignment |
2129 | -- | 2139 | Zhen-Hua Ling, Li Deng, Dong Yu. Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis |
2140 | -- | 2151 | Nasser Mohammadiha, Paris Smaragdis, Arne Leijon. Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization |
2152 | -- | 2161 | Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee. Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems |
2162 | -- | 2171 | Nikolay D. Gaubitch, Mike Brookes, Patrick A. Naylor. Blind Channel Magnitude Response Estimation in Speech Using Spectrum Classification |
2172 | -- | 2181 | Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe, Nobuaki Minematsu, Keikichi Hirose. Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting |
2182 | -- | 2192 | Takuya Yoshioka, Tomohiro Nakatani. Noise Model Transfer: Novel Approach to Robustness Against Nonstationary Noise |
2193 | -- | 2206 | Despoina Pavlidi, Anthony Griffin, Matthieu Puigt, Athanasios Mouchtaris. Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array |
2207 | -- | 2220 | Sefki Kolozali, Mathieu Barthet, György Fazekas, Mark Sandler. Automatic Ontology Generation for Musical Instruments Based on Audio Analysis |