Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 16, Issue 8

1361 -- 1372E. Ravelli, G. Richard, Laurent Daudet. Union of MDCT Bases for Audio Coding
1373 -- 1382Olivier Derrien, G. Richard. A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo
1383 -- 1395Alberto Carini, S. Malatini. Optimal Variable Step-Size NLMS Algorithms With Auxiliary Noise Power Scheduling for Feedforward Active Noise Control
1396 -- 1408Miguel Ferrer, Alberto Gonzalez, Maria de Diego, Gema Pinero. Fast Affine Projection Algorithms for Filtered-x Multichannel Active Noise Control
1409 -- 1419Ming Wu, Guoyue Chen, Xiaojun Qiu. An Improved Active Noise Control Algorithm Without Secondary Path Identification Based on the Frequency-Domain Subband Architecture
1420 -- 1432Jian-Wu Xu, José Carlos Príncipe. A Pitch Detector Based on a Generalized Correlation Function
1433 -- 1451Emanuel A. P. Habets, Sharon Gannot, Israel Cohen, P. Sommen. Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments
1452 -- 1465J. H. Gunther, G. Wilson. Mean-Squared Error Analysis of Adaptive Subband-Based System Identification
1466 -- 1478Constantin Paleologu, Jacob Benesty, Silviu Ciochina. A Variable Step-Size Affine Projection Algorithm Designed for Acoustic Echo Cancellation
1479 -- 1489J. Scheuing, Bin Yang. Disambiguation of TDOA Estimation for Multiple Sources in Reverberant Environments
1490 -- 1502Jacek Dmochowski, Jacob Benesty, Sofiène Affes. Linearly Constrained Minimum Variance Source Localization and Spectral Estimation
1503 -- 1511Jeroen Breebaart, Erik Schuijers. Phantom Materialization: A Novel Method to Enhance Stereo Audio Reproduction on Headphones
1512 -- 1527Tomohiro Nakatani, Biing-Hwang Juang, Takuya Yoshioka, Keisuke Kinoshita, Marc Delcroix, Masato Miyoshi. Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model
1528 -- 1540A. Abramson, I. Cohen. Single-Sensor Audio Source Separation Using Classification and Estimation Approach and GARCH Modeling
1541 -- 1550Chang-Hsing Lee, Chin-Chuan Han, Ching-Chien Chuang. Automatic Classification of Bird Species From Their Sounds Using Two-Dimensional Cepstral Coefficients
1551 -- 1564Shahram Khadivi, Hermann Ney. Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation
1565 -- 1578Juan Manuel Górriz, Javier Ramírez, Elmar Wolfgang Lang, Carlos García Puntonet. Jointly Gaussian PDF-Based Likelihood Ratio Test for Voice Activity Detection
1579 -- 1589Tiago H. Falk, Wai-Yip Chan. Hybrid Signal-and-Link-Parametric Speech Quality Measurement for VoIP Communications
1590 -- 1601K. J. Han, S. Kim, S. S. Narayanan. Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization
1602 -- 1613K. S. R. Murty, B. Yegnanarayana. Epoch Extraction From Speech Signals
1614 -- 1623E. Plourde, B. Champagne. Auditory-Based Spectral Amplitude Estimators for Speech Enhancement
1624 -- 1632Benny Sallberg, Nedelko Grbic, Ingvar Claesson. Complex-Valued Independent Component Analysis for Online Blind Speech Extraction
1633 -- 1641Hai Huyen Dam, Hai Quang Dam, Sven Nordholm. Noise Statistics Update Adaptive Beamformer With PSD Estimation for Speech Extraction in Noisy Environment
1642 -- 1653Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee. Optimizing the Performance of Spoken Language Recognition With Discriminative Training
1654 -- 1661Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson. Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model
1662 -- 1674Xiong Xiao, Chng Eng Siong, Haizhou Li. Normalization of the Speech Modulation Spectra for Robust Speech Recognition
1675 -- 1684Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang. Using Kernel Discriminant Analysis to Improve the Characterization of the Alternative Hypothesis for Speaker Verification
1685 -- 1695Ruohua Zhou, Marco Mattavelli, Giorgio Zoia. Music Onset Detection Based on Resonator Time Frequency Image
1696 -- 1705Nicola Bertoldi, Richard Zens, Marcello Federico, Wade Shen. Efficient Speech Translation Through Confusion Network Decoding
1706 -- 1710Ming Wu, Xiaojun Qiu, Guoyue Chen. An Overlap-Save Frequency-Domain Implementation of the Delayless Subband ANC Algorithm

Volume 16, Issue 7

1222 -- 1237Evgeny Matusov, Gregor Leusch, Rafael E. Banchs, Nicola Bertoldi, Daniel Dechelotte, Marcello Federico, M. Kolss, Young-Suk Lee, José B. Mariño, M. Paulik, Salim Roukos, Holger Schwenk, Hermann Ney. System Combination for Machine Translation of Spoken and Written Language
1238 -- 1248Mike Dowman, Virginia Savova, Thomas L. Griffiths, Konrad P. Körding, Joshua B. Tenenbaum, Matthew Purver. A Probabilistic Model of Meetings That Combines Words and Discourse Features
1249 -- 1259Srinivas Bangalore, Giuseppe Di Fabbrizio, Amanda Stent. Learning the Structure of Task-Driven Human-Human Dialogs
1260 -- 1273Hany Hassan, Khalil Sima'an, Andy Way. Syntactically Lexicalized Phrase-Based SMT
1274 -- 1286Christoph Tillmann, Tong Zhang. An Online Relevant Set Algorithm for Statistical Machine Translation
1287 -- 1302Minwoo Jeong, Gary Geunbae Lee. Triangular-Chain Conditional Random Fields
1303 -- 1314Alfred Dielmann, Steve Renals. Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN
1315 -- 1329Min Zhang, Wanxiang Che, Guodong Zhou, AiTi Aw, Chew Lim Tan, Ting Liu, Sheng Li. Semantic Role Labeling Using a Grammar-Driven Convolution Tree Kernel
1330 -- 1339Ruhi Sarikaya, Mohamed Afify, Yonggang Deng, Hakan Erdogan, Yuqing Gao. Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic
1340 -- 1354Francesc Alías, Xavier Sevillano, Joan Claudi Socoró, Xavi Gonzalvo. Towards High-Quality Next-Generation Text-to-Speech Synthesis: A Multidomain Approach by Automatic Domain Classification

Volume 16, Issue 6

1077 -- 1086Jia-Li You, Yining Chen, Min Chu, Frank K. Soong, Jin-Lin Wang. Identifying Language Origin of Named Entity With Multiple Information Sources
1087 -- 1096K. I. Nordstrom, George Tzanetakis, Peter F. Driessen. Transforming Perceived Vocal Effort and Breathiness Using Adaptive Pre-Emphasis Linear Prediction
1097 -- 1111Marco Grimaldi, Fred Cummins. Speaker Identification Using Instantaneous Frequencies
1112 -- 1123Jan S. Erkelens, Richard Heusdens. Tracking of Nonstationary Noise Based on Data-Driven Recursive Noise Power Estimation
1124 -- 1137H. Pulakka, Laura Laaksonen, M. Vainio, Jouni Pohjalainen, Paavo Alku. Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages
1138 -- 1151J. Serra, E. Gomez, Perfecto Herrera, Xavier Serra. Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification
1152 -- 1162Ioannis Karydis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Dimitrios Katsaros, Yannis Manolopoulos. Music Retrieval Over Wireless Ad-Hoc Networks
1163 -- 1172K. van den Doel, U. M. Ascher. Real-Time Numerical Solution of Webster s Equation on A Nonuniform Grid
1173 -- 1180T. S. Brandes. Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise
1181 -- 1193U. Manmontri, Patrick A. Naylor. A Class of Frobenius Norm-Based Algorithms Using Penalty Term and Natural Gradient for Blind Signal Separation
1194 -- 1206Manolis Perakakis, Alexandros Potamianos. A Study in Efficiency and Modality Usage in Multimodal Form Filling Systems
1207 -- 1214S. Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero. An Integrative and Discriminative Technique for Spoken Utterance Classification

Volume 16, Issue 5

881 -- 890Norman H. Adams, Gregory H. Wakefield. State-Space Synthesis of Virtual Auditory Space
891 -- 899Jianping Deng, Martin Bouchard, Tet Hin Yeap. Feature Enhancement for Noisy Speech Recognition With a Time-Variant Linear Predictive HMM Structure
900 -- 909P. Liu, C. Liu, H. Jiang, F. Soong, R.-H. Wang. A Constrained Line Search Optimization Method for Discriminative Training of HMMs
910 -- 919T. Gerkmann, C. Breithaupt, R. Martin. Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors
920 -- 933Margarita Kotti, Emmanouil Benetos, Costas Kotropoulos. Computationally Efficient and Robust BIC-Based Speaker Segmentation
934 -- 946H. Hacihabiboglu, B. Gunel, Ahmet M. Kondoz. Time-Domain Simulation of Directive Sources in 3-D Digital Waveguide Mesh-Based Acoustical Models
947 -- 956M. Karjalainen. Efficient Realization of Wave Digital Components for Physical Modeling and Sound Synthesis
957 -- 968Yiteng Huang, Jacob Benesty, Jingdong Chen. Analysis and Comparison of Multichannel Noise Reduction Methods in a Common Framework
969 -- 979Srivatsan Kandadai, Charles D. Creusere. Scalable Audio Compression at Low Bitrates
980 -- 988Patrick Kenny, Pierre Ouellet, N. Dehak, V. Gupta, Pierre Dumouchel. A Study of Interspeaker Variability in Speaker Verification
989 -- 999J. Paschedag, B. Lohmann. Error Convergence of the Filtered-X LMS Algorithm for Multiple Harmonic Excitation
1000 -- 1014Yegui Xiao, Akira Ikuta, Liying Ma, Khashayar Khorasani. Stochastic Analysis of the FXLMS-Based Narrowband Active Noise Control System
1015 -- 1028Michael Casey, Christophe Rhodes, Malcolm Slaney. Analysis of Minimum Distances in High-Dimensional Musical Spaces
1029 -- 1037Khe Chai Sim, Haizhou Li. On Acoustic Diversification Front-End for Spoken Language Identification
1038 -- 1046Rasool Tahmasbi, Sadegh Rezaei. Change Point Detection in GARCH Models for Voice Activity Detection
1047 -- 1060Valentin Ion, Reinhold Haeb-Umbach. A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition
1061 -- 1070Dong Yu, Li Deng, James Droppo, Jian Wu, Yifan Gong, Alex Acero. Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor

Volume 16, Issue 4

681 -- 695Chi-Min Liu, Han-Wen Hsu, Wen-Chieh Lee. Compression Artifacts in Perceptual Audio Coding
696 -- 710M. Yukawa, Rodrigo C. de Lamare, Raimundo Sampaio Neto. Efficient Acoustic Echo Cancellation With Reduced-Rank Adaptive Filtering Based on Selective Decimation and Adaptive Interpolation
711 -- 727Gal Reuven, Sharon Gannot, Israel Cohen. Dual-Source Transfer-Function Generalized Sidelobe Canceller
728 -- 739N. Roman, DeLiang Wang. Binaural Tracking of Multiple Moving Sources
740 -- 747Boaz Rafaely. The Spherical-Shell Microphone Array
748 -- 756B. Gunel, H. Hachabiboglu, Ahmet M. Kondoz. Acoustic Source Separation of Convolutive Mixtures Based on Intensity Vector Statistics
757 -- 765Jacob Benesty, Jingdong Chen, Yiteng Huang. On the Importance of the Pearson Correlation Coefficient in Noise Reduction
766 -- 778Zhiyao Duan, Yungang Zhang, Changshui Zhang, Zhenwei Shi. Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling
779 -- 789S. Yaman, Chin-Hui Lee. A Flexible Classifier Design Framework Based on Multiobjective Programming
790 -- 796Simon Tucker, Steve Whittaker. Temporal Compression Of Speech: An Evaluation
797 -- 811Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, S. S. Narayanan. Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework
812 -- 824F. Antonacci, M. Foco, Augusto Sarti, Stefano Tubaro. Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup
825 -- 834T. Fingscheidt, S. Suhadi, S. Stan. Environment-Optimized Speech Enhancement
835 -- 846David Y. Zhao, W. Bastiaan Kleijn, A. Ypma, B. de Vries. Online Noise Estimation Using Stochastic-Gain HMM for Speech Enhancement
847 -- 858J. Grothendieck, A. Gorin. Towards Link Characterization From Content: Recovering Distributions From Classifier Output
859 -- 873Chia-Yu Wan, Lin-Shan Lee. Histogram-Based Quantization for Robust and/or Distributed Speech Recognition

Volume 16, Issue 3

481 -- 493Jingdong Chen, Jacob Benesty, Yiteng Huang. A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones
494 -- 507Yonggang Deng, William J. Byrne. HMM Word and Phrase Alignment for Statistical Machine Translation
508 -- 518Giulia Garau, Steve Renals. Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition
519 -- 528Jian Xue, Yunxin Zhao. Random Forests of Phonetic Decision Trees for Acoustic Modeling in Conversational Speech Recognition
529 -- 540O. Gillet, G. Richard. Transcription and Separation of Drum Signals From Polyphonic Music
541 -- 553Richard C. Hendriks, Jesper Jensen, Richard Heusdens. Noise Tracking Using DFT Domain Subspace Decompositions
554 -- 562Haibin Huang, Pasi Fränti, Dong-Yan Huang, Susanto Rahardja. Cascaded RLS-LMS Prediction in MPEG-4 Lossless Audio Coding
563 -- 577Jeih-Weih Hung, Wei-Yi Tsai. Constructing Modulation Frequency Domain-Based Features for Robust Speech Recognition
578 -- 593A. Miguel, Eduardo Lleida, R. Rose, Luis Buera, O. Saz, Alfonso Ortega. Capturing Local Variability for Speaker Normalization in Speech Recognition
594 -- 606Norman Poh, Josef Kittler. Incorporating Model-Specific Score Distribution in Speaker Verification Systems
607 -- 616Yun Tang, R. Rose. Rapid Speaker Adaptation Using Clustered Maximum-Likelihood Linear Basis With Sparse Training Data
617 -- 628Jeremy Morris, Eric Fosler-Lussier. Conditional Random Fields for Integrating Local Discriminative Classifiers
629 -- 638Oscal T.-C. Chen, Wen-Chih Wu. Highly Robust, Secure, and Perceptual-Quality Echo Hiding Scheme
639 -- 650Shoichiro Saito, Hirokazu Kameoka, K. Takahashi, Takuya Nishimoto, Shigeki Sagayama. Specmurt Analysis of Polyphonic Music Signals
651 -- 665S. Shelley, D. T. Murphy. The Modeling of Diffuse Boundaries in the 2-D Digital Waveguide Mesh
666 -- 670Iain McCowan, Mike Lincoln, Ivan Himawan. Microphone Array Shape Calibration in Diffuse Noise Fields
671 -- 676Bob L. Sturm, John J. Shynk, Laurent Daudet, C. Roads. Dark Energy in Sparse Atomic Estimations

Volume 16, Issue 2

255 -- 266Anssi Klapuri. Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model
267 -- 277Mark R. Every. Discriminating Between Pitched Sources in Music Audio
278 -- 290Mathieu Lagrange, Luis Gustavo Martins, Jennifer Murdoch, George Tzanetakis. Normalized Cuts for Predominant Melodic Source Separation
291 -- 301Kyogu Lee, Malcolm Slaney. Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio
302 -- 317Peter Jan O. Doets, Reginald L. Lagendijk. Distortion Estimation in Compressed Music Using Only Audio Fingerprints
318 -- 326M. Levy, M. Sandler. Structural Segmentation of Musical Audio by Constrained Clustering
327 -- 337Shlomo Dubnov. Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection
338 -- 349Min-Yen Kan, Ye Wang, Denny Iskandar, Tin Lay Nwe, Arun Shenoy. LyricAlly: Automatic Synchronization of Textual Lyrics to Acoustic Music Signals
350 -- 358Jyh-Shing Roger Jang, Hong-Ru Lee. A General Framework of Progressive Filtering and Its Application to Query by Singing/Humming
359 -- 371Erdem Unal, Elaine Chew, Panayiotis G. Georgiou, Shrikanth S. Narayanan. Challenging Uncertainty in Query by Humming Systems: A Fingerprinting Approach
372 -- 381Iman S. H. Suyoto, Alexandra L. Uitdenbogerd, Falk Scholer. Searching Musical Audio Using Symbolic Queries
382 -- 395F. Kurth, M. Muler. Efficient Index-Based Audio Matching
396 -- 407Akihiro Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase. A Quick Search Method for Audio Signals Based on a Piecewise Linear Representation of Feature Trajectories
408 -- 423E. Pampalk, P. Herrera, M. Goto. Computational Models of Similarity for Drum Samples
424 -- 434A. Holzapfel, Y. Stylianou. Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features
435 -- 447Kazuyoshi Yoshii, Masataka Goto, Kazuhiro Komatani, Tetsuya Ogata, Hiroshi G. Okuno. An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model
448 -- 457Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, Homer H. Chen. A Regression Approach to Music Emotion Recognition
458 -- 466Luca Mion, Giovanni De Poli. Score-Independent Audio Features for Description of Music Expression
467 -- 476Douglas Turnbull, Luke Barrington, D. Torres, Gert R. G. Lanckriet. Semantic Annotation and Retrieval of Music and Sound Effects

Volume 16, Issue 1

1 -- 7Julio Vargas, Steve McLaughlin. Cascade Prediction Filters With Adaptive Zeros to Track the Time-Varying Resonances of the Vocal Tract
8 -- 22J. Tepperman, S. Narayanan. Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation
23 -- 33Ian Vince McLoughlin. Subjective Intelligibility Testing of Chinese Speech
34 -- 46N. Malyska, T. F. Quatieri. Spectral Representations of Nonmodal Phonation
47 -- 56Carlos Toshinori Ishi, K.-I. Sakakibara, Hiroshi Ishiguro, Norihiro Hagita. A Method for Automatic Detection of Vocal Fry
57 -- 64V. Grancharov, Jan H. Plasberg, J. Samuelsson, W. Bastiaan Kleijn. Generalized Postfilter for Speech Quality Enhancement
65 -- 73L. A. Ekman, W. Bastiaan Kleijn, M. N. Murthi. Regularized Linear Prediction of Speech
74 -- 82Jerome R. Bellegarda. Unit-Centric Feature Mapping for Inventory Pruning in Unit Selection Text-to-Speech Synthesis
83 -- 93Gerard Hotho, Lars F. Villemoes, Jeroen Breebaart. A Backward-Compatible Multichannel Audio Codec
94 -- 105Te Li, Susanto Rahardja, Soo Ngee Koh. Frequency Region-Based Prioritized Bit-Plane Coding for Scalable Audio
106 -- 115S. Grofit, Y. Lavner. Time-Scale Modification of Audio Signals Using Enhanced WSOLA With Management of Transients
116 -- 128Pierre Leveau, E. Vincent, G. Richard, Laurent Daudet. Instrument-Specific Harmonic Atoms for Mid-Level Music Representation
129 -- 136C. D. Creusere, K. D. Kallakuri, R. Vanam. An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities
137 -- 150Wei Chu, B. Champagne. A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification
151 -- 161Heidi Christensen, Yoshihiko Gotoh, Steve Renals. A Cascaded Broadcast News Highlighter
162 -- 173. Adaptive System Identification in the Short-Time Fourier Transform Domain Using Cross-Multiplicative Transfer Function Approximation
174 -- 185Cédric Févotte, Bruno Torrésani, Laurent Daudet, Simon J. Godsill. Sparse Linear Regression With Structured Priors and Application to Denoising of Musical Audio
186 -- 197A. S. Park, J. R. Glass. Unsupervised Pattern Discovery in Speech
198 -- 207Jen-Tzung Chien, Meng-Sung Wu. Adaptive Bayesian Latent Semantic Analysis
208 -- 215Imed Zitouni. Constrained Minimization and Discriminative Training for Natural Language Call Routing
216 -- 228S. Ananthakrishnan, S. S. Narayanan. Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence
229 -- 238Yi Hu, Philipos C. Loizou. Evaluation of Objective Quality Measures for Speech Enhancement
239 -- 248Jen-Tzung Chien, Chuan-Wei Ting. Factor Analyzed Subspace Modeling and Selection