Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 21, Issue 10

1993 -- 2005William Hartmann, Arun Narayanan, Eric Fosler-Lussier, DeLiang Wang. A Direct Masking Approach to Robust ASR
2006 -- 2014Yow-Bang Wang, Shang-wen Li, Lin-Shan Lee. An Experimental Analysis on Integrating Multi-Stream Spectro-Temporal, Cepstral and Pitch Information for Mandarin Speech Recognition
2015 -- 2028Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass. Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach
2029 -- 2041Zbynek Koldovský, Jirí Málek, Petr Tichavský, Francesco Nesta. Semi-Blind Noise Extraction Using Partially Known Position of the Target Source
2042 -- 2056Mads Graesboll Christensen. Accurate Estimation of Low Fundamental Frequencies From Real-Valued Measurements
2057 -- 2072Philippe Esling, Carlos Agon. Multiobjective Time Series Matching for Audio Classification and Retrieval
2073 -- 2084Chao Zhang, Yi Liu, Yunqing Xia, Xuan Wang, Chin-Hui Lee. Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition
2085 -- 2095Gilles Degottex, Yannis Stylianou. Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model
2096 -- 2107Bilei Zhu, Wei Li, Ruijiang Li, Xiangyang Xue. Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation
2108 -- 2117Sadao Hiroya. Non-Negative Temporal Decomposition of Speech Parameters by Multiplicative Update Rules
2118 -- 2128Cyril Joder, Slim Essid, Gaël Richard. Learning Optimal Features for Polyphonic Audio-to-Score Alignment
2129 -- 2139Zhen-Hua Ling, Li Deng, Dong Yu. Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis
2140 -- 2151Nasser Mohammadiha, Paris Smaragdis, Arne Leijon. Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization
2152 -- 2161Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee. Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems
2162 -- 2171Nikolay D. Gaubitch, Mike Brookes, Patrick A. Naylor. Blind Channel Magnitude Response Estimation in Speech Using Spectrum Classification
2172 -- 2181Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe, Nobuaki Minematsu, Keikichi Hirose. Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting
2182 -- 2192Takuya Yoshioka, Tomohiro Nakatani. Noise Model Transfer: Novel Approach to Robustness Against Nonstationary Noise
2193 -- 2206Despoina Pavlidi, Anthony Griffin, Matthieu Puigt, Athanasios Mouchtaris. Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array
2207 -- 2220Sefki Kolozali, Mathieu Barthet, György Fazekas, Mark Sandler. Automatic Ontology Generation for Musical Instruments Based on Audio Analysis