Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 15, Issue 3

749 -- 755Pradeepa Yahampath, Paul Rondeau. Multiple-Description Predictive-Vector Quantization With Applications to Low Bit-Rate Speech Coding Over Networks
756 -- 769Ethan R. Duni, Bhaskar D. Rao. High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models
770 -- 783Ethan R. Duni, Bhaskar D. Rao. A High-Rate Optimal Transform Coder With Gaussian Mixture Companders
784 -- 795Brian Kan-Wing Mak, Roger Wend-Huu Hsiao. Kernel Eigenspace-Based MLLR Adaptation
796 -- 802Bertrand Rivet, Laurent Girin, Christian Jutten. Log-Rayleigh Distribution: A Simple and Efficient Statistical Representation of Log-Spectral Coefficients
803 -- 812Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly. Using Broad Phonetic Group Experts for Improved Speech Recognition
813 -- 822Barbara Resch, Mattias Nilsson, Anders Ekman, W. Bastiaan Kleijn. Estimation of the Instantaneous Pitch of Speech
823 -- 837Francesco Gianfelici, Giorgio Biagetti, Paolo Crippa, Claudio Turchetti. Multicomponent AM-FM Representations: An Asymptotically Exact Approach
838 -- 850Dima Ruinskiy, Y. Lavner. An Effective Algorithm for Automatic Detection and Exact Demarcation of Breath Sounds in Speech and Song Signals
851 -- 861Laurent Girin, Mohammad Firouzmand, Sylvain Marchand. Perceptual Long-Term Variable-Rate Sinusoidal Modeling of Speech
862 -- 872Jesper Jensen, Richard Heusdens. Improved Subspace-Based Single-Channel Speech Enhancement Using Generalized Super-Gaussian Priors
873 -- 881Juho Kontio, Laura Laaksonen, Paavo Alku. Neural Network-Based Artificial Bandwidth Expansion of Speech
882 -- 892David Y. Zhao, W. Bastiaan Kleijn. HMM-Based Gain Modeling for Enhancement of Speech in Noise
893 -- 900M. Khademul Islam Molla, Keikichi Hirose. Single-Mixture Audio Source Separation by Subspace Decomposition of Hilbert Spectrum
901 -- 917Karsten Vandborg Sorensen, Sren Vang Andersen. Rayleigh Mixture Model-Based Hidden Markov Modeling and Estimation of Noise in Noisy Speech Signals
918 -- 927Richard C. Hendriks, Rainer Martin. MAP Estimators for Speech Enhancement Under Normal and Rayleigh Inverse Gaussian Distributions
928 -- 938Nikos Chatzichrisafis, Vassilios Diakoloukas, Vassilios Digalakis, Costas Harizakis. Gaussian Mixture Clustering and Language Adaptation for the Development of a New Language Speech Recognition System
939 -- 948Ghinwa F. Choueiter, James R. Glass. An Implementation of Rational Wavelets and Filter Design for Phonetic Classification
949 -- 956Esther Klabbers, Jan P. H. van Santen, Alexander Kain. The Contribution of Various Sources of Spectral Mismatch to Audible Discontinuities in a Diphone Database
957 -- 965Jerome R. Bellegarda. Globally Optimal Training of Unit Boundaries in Unit Selection Text-to-Speech Synthesis
966 -- 981Pim Korten, Jesper Jensen, Richard Heusdens. High-Resolution Spherical Quantization of Sinusoidal Parameters
982 -- 994Hirokazu Kameoka, Takuya Nishimoto, Shigeki Sagayama. A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering
995 -- 1008Johannes Nix, Volker Hohmann. Combined Estimation of Spectral Envelopes and Sound Source Direction of Concurrent Voices by Multidimensional Statistical Filtering
1009 -- 1020Matthew E. P. Davies, Mark D. Plumbley. Context-Dependent Beat Tracking of Musical Audio
1021 -- 1029Leevi Peltola, Cumhur Erkut, Perry R. Cook, Vesa Välimäki. Synthesis of Hand Clapping Sounds
1030 -- 1034Jean-Marc Valin. On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk
1035 -- 1043James D. Gordy, Rafik A. Goubran. Statistical Analysis of Doubletalk Detection for Calibration and Performance Evaluation
1044 -- 1052Felix Albu, Martin Bouchard, Yuriy V. Zakharov. Pseudo-Affine Projection Algorithms for Multichannel Active Noise Control
1053 -- 1065Jacob Benesty, Jingdong Chen, Yiteng Huang, Jacek Dmochowski. On Microphone-Array Beamforming From a MIMO Acoustic Signal Processing Perspective
1066 -- 1074Tuomas Virtanen. Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria
1075 -- 1086Carlos Busso, Zhigang Deng, Michael Grimm, Ulrich Neumann, Shrikanth Narayanan. Rigid Head Motion in Expressive Speech Animation: Analysis and Synthesis
1087 -- 1097Chen Yang, Frank K. Soong, Tan Lee. Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR
1098 -- 1113Luis Buera, Eduardo Lleida, A. Miguel, Alfonso Ortega, O. Saz. Cepstral Vector Normalization Based on Stereo Data for Robust Speech Recognition
1114 -- 1122Xianyu Zhao, Zhijian Ou. Closely Coupled Array Processing and Model-Based Compensation for Microphone Array Speech Recognition