IEEE Transactions on Audio, Speech & Language Processing

researchr

You are not signed in
Sign in
Sign up

1361	--	1372	E. Ravelli, G. Richard, Laurent Daudet. Union of MDCT Bases for Audio Coding
1373	--	1382	Olivier Derrien, G. Richard. A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo
1383	--	1395	Alberto Carini, S. Malatini. Optimal Variable Step-Size NLMS Algorithms With Auxiliary Noise Power Scheduling for Feedforward Active Noise Control
1396	--	1408	Miguel Ferrer, Alberto Gonzalez, Maria de Diego, Gema Pinero. Fast Affine Projection Algorithms for Filtered-x Multichannel Active Noise Control
1409	--	1419	Ming Wu, Guoyue Chen, Xiaojun Qiu. An Improved Active Noise Control Algorithm Without Secondary Path Identification Based on the Frequency-Domain Subband Architecture
1420	--	1432	Jian-Wu Xu, José Carlos Príncipe. A Pitch Detector Based on a Generalized Correlation Function
1433	--	1451	Emanuel A. P. Habets, Sharon Gannot, Israel Cohen, P. Sommen. Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments
1452	--	1465	J. H. Gunther, G. Wilson. Mean-Squared Error Analysis of Adaptive Subband-Based System Identification
1466	--	1478	Constantin Paleologu, Jacob Benesty, Silviu Ciochina. A Variable Step-Size Affine Projection Algorithm Designed for Acoustic Echo Cancellation
1479	--	1489	J. Scheuing, Bin Yang. Disambiguation of TDOA Estimation for Multiple Sources in Reverberant Environments
1490	--	1502	Jacek Dmochowski, Jacob Benesty, Sofiène Affes. Linearly Constrained Minimum Variance Source Localization and Spectral Estimation
1503	--	1511	Jeroen Breebaart, Erik Schuijers. Phantom Materialization: A Novel Method to Enhance Stereo Audio Reproduction on Headphones
1512	--	1527	Tomohiro Nakatani, Biing-Hwang Juang, Takuya Yoshioka, Keisuke Kinoshita, Marc Delcroix, Masato Miyoshi. Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model
1528	--	1540	A. Abramson, I. Cohen. Single-Sensor Audio Source Separation Using Classification and Estimation Approach and GARCH Modeling
1541	--	1550	Chang-Hsing Lee, Chin-Chuan Han, Ching-Chien Chuang. Automatic Classification of Bird Species From Their Sounds Using Two-Dimensional Cepstral Coefficients
1551	--	1564	Shahram Khadivi, Hermann Ney. Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation
1565	--	1578	Juan Manuel Górriz, Javier Ramírez, Elmar Wolfgang Lang, Carlos García Puntonet. Jointly Gaussian PDF-Based Likelihood Ratio Test for Voice Activity Detection
1579	--	1589	Tiago H. Falk, Wai-Yip Chan. Hybrid Signal-and-Link-Parametric Speech Quality Measurement for VoIP Communications
1590	--	1601	K. J. Han, S. Kim, S. S. Narayanan. Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization
1602	--	1613	K. S. R. Murty, B. Yegnanarayana. Epoch Extraction From Speech Signals
1614	--	1623	E. Plourde, B. Champagne. Auditory-Based Spectral Amplitude Estimators for Speech Enhancement
1624	--	1632	Benny Sallberg, Nedelko Grbic, Ingvar Claesson. Complex-Valued Independent Component Analysis for Online Blind Speech Extraction
1633	--	1641	Hai Huyen Dam, Hai Quang Dam, Sven Nordholm. Noise Statistics Update Adaptive Beamformer With PSD Estimation for Speech Extraction in Noisy Environment
1642	--	1653	Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee. Optimizing the Performance of Spoken Language Recognition With Discriminative Training
1654	--	1661	Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson. Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model
1662	--	1674	Xiong Xiao, Chng Eng Siong, Haizhou Li. Normalization of the Speech Modulation Spectra for Robust Speech Recognition
1675	--	1684	Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang. Using Kernel Discriminant Analysis to Improve the Characterization of the Alternative Hypothesis for Speaker Verification
1685	--	1695	Ruohua Zhou, Marco Mattavelli, Giorgio Zoia. Music Onset Detection Based on Resonator Time Frequency Image
1696	--	1705	Nicola Bertoldi, Richard Zens, Marcello Federico, Wade Shen. Efficient Speech Translation Through Confusion Network Decoding
1706	--	1710	Ming Wu, Xiaojun Qiu, Guoyue Chen. An Overlap-Save Frequency-Domain Implementation of the Delayless Subband ANC Algorithm

1222	--	1237	Evgeny Matusov, Gregor Leusch, Rafael E. Banchs, Nicola Bertoldi, Daniel Dechelotte, Marcello Federico, M. Kolss, Young-Suk Lee, José B. Mariño, M. Paulik, Salim Roukos, Holger Schwenk, Hermann Ney. System Combination for Machine Translation of Spoken and Written Language
1238	--	1248	Mike Dowman, Virginia Savova, Thomas L. Griffiths, Konrad P. Körding, Joshua B. Tenenbaum, Matthew Purver. A Probabilistic Model of Meetings That Combines Words and Discourse Features
1249	--	1259	Srinivas Bangalore, Giuseppe Di Fabbrizio, Amanda Stent. Learning the Structure of Task-Driven Human-Human Dialogs
1260	--	1273	Hany Hassan, Khalil Sima'an, Andy Way. Syntactically Lexicalized Phrase-Based SMT
1274	--	1286	Christoph Tillmann, Tong Zhang. An Online Relevant Set Algorithm for Statistical Machine Translation
1287	--	1302	Minwoo Jeong, Gary Geunbae Lee. Triangular-Chain Conditional Random Fields
1303	--	1314	Alfred Dielmann, Steve Renals. Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN
1315	--	1329	Min Zhang, Wanxiang Che, Guodong Zhou, AiTi Aw, Chew Lim Tan, Ting Liu, Sheng Li. Semantic Role Labeling Using a Grammar-Driven Convolution Tree Kernel
1330	--	1339	Ruhi Sarikaya, Mohamed Afify, Yonggang Deng, Hakan Erdogan, Yuqing Gao. Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic
1340	--	1354	Francesc Alías, Xavier Sevillano, Joan Claudi Socoró, Xavi Gonzalvo. Towards High-Quality Next-Generation Text-to-Speech Synthesis: A Multidomain Approach by Automatic Domain Classification

1077	--	1086	Jia-Li You, Yining Chen, Min Chu, Frank K. Soong, Jin-Lin Wang. Identifying Language Origin of Named Entity With Multiple Information Sources
1087	--	1096	K. I. Nordstrom, George Tzanetakis, Peter F. Driessen. Transforming Perceived Vocal Effort and Breathiness Using Adaptive Pre-Emphasis Linear Prediction
1097	--	1111	Marco Grimaldi, Fred Cummins. Speaker Identification Using Instantaneous Frequencies
1112	--	1123	Jan S. Erkelens, Richard Heusdens. Tracking of Nonstationary Noise Based on Data-Driven Recursive Noise Power Estimation
1124	--	1137	H. Pulakka, Laura Laaksonen, M. Vainio, Jouni Pohjalainen, Paavo Alku. Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages
1138	--	1151	J. Serra, E. Gomez, Perfecto Herrera, Xavier Serra. Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification
1152	--	1162	Ioannis Karydis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Dimitrios Katsaros, Yannis Manolopoulos. Music Retrieval Over Wireless Ad-Hoc Networks
1163	--	1172	K. van den Doel, U. M. Ascher. Real-Time Numerical Solution of Webster s Equation on A Nonuniform Grid
1173	--	1180	T. S. Brandes. Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise
1181	--	1193	U. Manmontri, Patrick A. Naylor. A Class of Frobenius Norm-Based Algorithms Using Penalty Term and Natural Gradient for Blind Signal Separation
1194	--	1206	Manolis Perakakis, Alexandros Potamianos. A Study in Efficiency and Modality Usage in Multimodal Form Filling Systems
1207	--	1214	S. Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero. An Integrative and Discriminative Technique for Spoken Utterance Classification

881	--	890	Norman H. Adams, Gregory H. Wakefield. State-Space Synthesis of Virtual Auditory Space
891	--	899	Jianping Deng, Martin Bouchard, Tet Hin Yeap. Feature Enhancement for Noisy Speech Recognition With a Time-Variant Linear Predictive HMM Structure
900	--	909	P. Liu, C. Liu, H. Jiang, F. Soong, R.-H. Wang. A Constrained Line Search Optimization Method for Discriminative Training of HMMs
910	--	919	T. Gerkmann, C. Breithaupt, R. Martin. Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors
920	--	933	Margarita Kotti, Emmanouil Benetos, Costas Kotropoulos. Computationally Efficient and Robust BIC-Based Speaker Segmentation
934	--	946	H. Hacihabiboglu, B. Gunel, Ahmet M. Kondoz. Time-Domain Simulation of Directive Sources in 3-D Digital Waveguide Mesh-Based Acoustical Models
947	--	956	M. Karjalainen. Efficient Realization of Wave Digital Components for Physical Modeling and Sound Synthesis
957	--	968	Yiteng Huang, Jacob Benesty, Jingdong Chen. Analysis and Comparison of Multichannel Noise Reduction Methods in a Common Framework
969	--	979	Srivatsan Kandadai, Charles D. Creusere. Scalable Audio Compression at Low Bitrates
980	--	988	Patrick Kenny, Pierre Ouellet, N. Dehak, V. Gupta, Pierre Dumouchel. A Study of Interspeaker Variability in Speaker Verification
989	--	999	J. Paschedag, B. Lohmann. Error Convergence of the Filtered-X LMS Algorithm for Multiple Harmonic Excitation
1000	--	1014	Yegui Xiao, Akira Ikuta, Liying Ma, Khashayar Khorasani. Stochastic Analysis of the FXLMS-Based Narrowband Active Noise Control System
1015	--	1028	Michael Casey, Christophe Rhodes, Malcolm Slaney. Analysis of Minimum Distances in High-Dimensional Musical Spaces
1029	--	1037	Khe Chai Sim, Haizhou Li. On Acoustic Diversification Front-End for Spoken Language Identification
1038	--	1046	Rasool Tahmasbi, Sadegh Rezaei. Change Point Detection in GARCH Models for Voice Activity Detection
1047	--	1060	Valentin Ion, Reinhold Haeb-Umbach. A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition
1061	--	1070	Dong Yu, Li Deng, James Droppo, Jian Wu, Yifan Gong, Alex Acero. Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor

681	--	695	Chi-Min Liu, Han-Wen Hsu, Wen-Chieh Lee. Compression Artifacts in Perceptual Audio Coding
696	--	710	M. Yukawa, Rodrigo C. de Lamare, Raimundo Sampaio Neto. Efficient Acoustic Echo Cancellation With Reduced-Rank Adaptive Filtering Based on Selective Decimation and Adaptive Interpolation
711	--	727	Gal Reuven, Sharon Gannot, Israel Cohen. Dual-Source Transfer-Function Generalized Sidelobe Canceller
728	--	739	N. Roman, DeLiang Wang. Binaural Tracking of Multiple Moving Sources
740	--	747	Boaz Rafaely. The Spherical-Shell Microphone Array
748	--	756	B. Gunel, H. Hachabiboglu, Ahmet M. Kondoz. Acoustic Source Separation of Convolutive Mixtures Based on Intensity Vector Statistics
757	--	765	Jacob Benesty, Jingdong Chen, Yiteng Huang. On the Importance of the Pearson Correlation Coefficient in Noise Reduction
766	--	778	Zhiyao Duan, Yungang Zhang, Changshui Zhang, Zhenwei Shi. Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling
779	--	789	S. Yaman, Chin-Hui Lee. A Flexible Classifier Design Framework Based on Multiobjective Programming
790	--	796	Simon Tucker, Steve Whittaker. Temporal Compression Of Speech: An Evaluation
797	--	811	Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, S. S. Narayanan. Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework
812	--	824	F. Antonacci, M. Foco, Augusto Sarti, Stefano Tubaro. Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup
825	--	834	T. Fingscheidt, S. Suhadi, S. Stan. Environment-Optimized Speech Enhancement
835	--	846	David Y. Zhao, W. Bastiaan Kleijn, A. Ypma, B. de Vries. Online Noise Estimation Using Stochastic-Gain HMM for Speech Enhancement
847	--	858	J. Grothendieck, A. Gorin. Towards Link Characterization From Content: Recovering Distributions From Classifier Output
859	--	873	Chia-Yu Wan, Lin-Shan Lee. Histogram-Based Quantization for Robust and/or Distributed Speech Recognition

481	--	493	Jingdong Chen, Jacob Benesty, Yiteng Huang. A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones
494	--	507	Yonggang Deng, William J. Byrne. HMM Word and Phrase Alignment for Statistical Machine Translation
508	--	518	Giulia Garau, Steve Renals. Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition
519	--	528	Jian Xue, Yunxin Zhao. Random Forests of Phonetic Decision Trees for Acoustic Modeling in Conversational Speech Recognition
529	--	540	O. Gillet, G. Richard. Transcription and Separation of Drum Signals From Polyphonic Music
541	--	553	Richard C. Hendriks, Jesper Jensen, Richard Heusdens. Noise Tracking Using DFT Domain Subspace Decompositions
554	--	562	Haibin Huang, Pasi Fränti, Dong-Yan Huang, Susanto Rahardja. Cascaded RLS-LMS Prediction in MPEG-4 Lossless Audio Coding
563	--	577	Jeih-Weih Hung, Wei-Yi Tsai. Constructing Modulation Frequency Domain-Based Features for Robust Speech Recognition
578	--	593	A. Miguel, Eduardo Lleida, R. Rose, Luis Buera, O. Saz, Alfonso Ortega. Capturing Local Variability for Speaker Normalization in Speech Recognition
594	--	606	Norman Poh, Josef Kittler. Incorporating Model-Specific Score Distribution in Speaker Verification Systems
607	--	616	Yun Tang, R. Rose. Rapid Speaker Adaptation Using Clustered Maximum-Likelihood Linear Basis With Sparse Training Data
617	--	628	Jeremy Morris, Eric Fosler-Lussier. Conditional Random Fields for Integrating Local Discriminative Classifiers
629	--	638	Oscal T.-C. Chen, Wen-Chih Wu. Highly Robust, Secure, and Perceptual-Quality Echo Hiding Scheme
639	--	650	Shoichiro Saito, Hirokazu Kameoka, K. Takahashi, Takuya Nishimoto, Shigeki Sagayama. Specmurt Analysis of Polyphonic Music Signals
651	--	665	S. Shelley, D. T. Murphy. The Modeling of Diffuse Boundaries in the 2-D Digital Waveguide Mesh
666	--	670	Iain McCowan, Mike Lincoln, Ivan Himawan. Microphone Array Shape Calibration in Diffuse Noise Fields
671	--	676	Bob L. Sturm, John J. Shynk, Laurent Daudet, C. Roads. Dark Energy in Sparse Atomic Estimations

255	--	266	Anssi Klapuri. Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model
267	--	277	Mark R. Every. Discriminating Between Pitched Sources in Music Audio
278	--	290	Mathieu Lagrange, Luis Gustavo Martins, Jennifer Murdoch, George Tzanetakis. Normalized Cuts for Predominant Melodic Source Separation
291	--	301	Kyogu Lee, Malcolm Slaney. Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio
302	--	317	Peter Jan O. Doets, Reginald L. Lagendijk. Distortion Estimation in Compressed Music Using Only Audio Fingerprints
318	--	326	M. Levy, M. Sandler. Structural Segmentation of Musical Audio by Constrained Clustering
327	--	337	Shlomo Dubnov. Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection
338	--	349	Min-Yen Kan, Ye Wang, Denny Iskandar, Tin Lay Nwe, Arun Shenoy. LyricAlly: Automatic Synchronization of Textual Lyrics to Acoustic Music Signals
350	--	358	Jyh-Shing Roger Jang, Hong-Ru Lee. A General Framework of Progressive Filtering and Its Application to Query by Singing/Humming
359	--	371	Erdem Unal, Elaine Chew, Panayiotis G. Georgiou, Shrikanth S. Narayanan. Challenging Uncertainty in Query by Humming Systems: A Fingerprinting Approach
372	--	381	Iman S. H. Suyoto, Alexandra L. Uitdenbogerd, Falk Scholer. Searching Musical Audio Using Symbolic Queries
382	--	395	F. Kurth, M. Muler. Efficient Index-Based Audio Matching
396	--	407	Akihiro Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase. A Quick Search Method for Audio Signals Based on a Piecewise Linear Representation of Feature Trajectories
408	--	423	E. Pampalk, P. Herrera, M. Goto. Computational Models of Similarity for Drum Samples
424	--	434	A. Holzapfel, Y. Stylianou. Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features
435	--	447	Kazuyoshi Yoshii, Masataka Goto, Kazuhiro Komatani, Tetsuya Ogata, Hiroshi G. Okuno. An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model
448	--	457	Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, Homer H. Chen. A Regression Approach to Music Emotion Recognition
458	--	466	Luca Mion, Giovanni De Poli. Score-Independent Audio Features for Description of Music Expression
467	--	476	Douglas Turnbull, Luke Barrington, D. Torres, Gert R. G. Lanckriet. Semantic Annotation and Retrieval of Music and Sound Effects

1	--	7	Julio Vargas, Steve McLaughlin. Cascade Prediction Filters With Adaptive Zeros to Track the Time-Varying Resonances of the Vocal Tract
8	--	22	J. Tepperman, S. Narayanan. Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation
23	--	33	Ian Vince McLoughlin. Subjective Intelligibility Testing of Chinese Speech
34	--	46	N. Malyska, T. F. Quatieri. Spectral Representations of Nonmodal Phonation
47	--	56	Carlos Toshinori Ishi, K.-I. Sakakibara, Hiroshi Ishiguro, Norihiro Hagita. A Method for Automatic Detection of Vocal Fry
57	--	64	V. Grancharov, Jan H. Plasberg, J. Samuelsson, W. Bastiaan Kleijn. Generalized Postfilter for Speech Quality Enhancement
65	--	73	L. A. Ekman, W. Bastiaan Kleijn, M. N. Murthi. Regularized Linear Prediction of Speech
74	--	82	Jerome R. Bellegarda. Unit-Centric Feature Mapping for Inventory Pruning in Unit Selection Text-to-Speech Synthesis
83	--	93	Gerard Hotho, Lars F. Villemoes, Jeroen Breebaart. A Backward-Compatible Multichannel Audio Codec
94	--	105	Te Li, Susanto Rahardja, Soo Ngee Koh. Frequency Region-Based Prioritized Bit-Plane Coding for Scalable Audio
106	--	115	S. Grofit, Y. Lavner. Time-Scale Modification of Audio Signals Using Enhanced WSOLA With Management of Transients
116	--	128	Pierre Leveau, E. Vincent, G. Richard, Laurent Daudet. Instrument-Specific Harmonic Atoms for Mid-Level Music Representation
129	--	136	C. D. Creusere, K. D. Kallakuri, R. Vanam. An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities
137	--	150	Wei Chu, B. Champagne. A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification
151	--	161	Heidi Christensen, Yoshihiko Gotoh, Steve Renals. A Cascaded Broadcast News Highlighter
162	--	173	. Adaptive System Identification in the Short-Time Fourier Transform Domain Using Cross-Multiplicative Transfer Function Approximation
174	--	185	Cédric Févotte, Bruno Torrésani, Laurent Daudet, Simon J. Godsill. Sparse Linear Regression With Structured Priors and Application to Denoising of Musical Audio
186	--	197	A. S. Park, J. R. Glass. Unsupervised Pattern Discovery in Speech
198	--	207	Jen-Tzung Chien, Meng-Sung Wu. Adaptive Bayesian Latent Semantic Analysis
208	--	215	Imed Zitouni. Constrained Minimization and Discriminative Training for Natural Language Call Routing
216	--	228	S. Ananthakrishnan, S. S. Narayanan. Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence
229	--	238	Yi Hu, Philipos C. Loizou. Evaluation of Objective Quality Measures for Speech Enhancement
239	--	248	Jen-Tzung Chien, Chuan-Wei Ting. Factor Analyzed Subspace Modeling and Selection

External Links

Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 16, Issue 8

Volume 16, Issue 7

Volume 16, Issue 6

Volume 16, Issue 5

Volume 16, Issue 4

Volume 16, Issue 3

Volume 16, Issue 2

Volume 16, Issue 1

External Links

Journal: IEEE Transactions on Audio, Speech &amp; Language Processing

Volume 16, Issue 8

Volume 16, Issue 7

Volume 16, Issue 6

Volume 16, Issue 5

Volume 16, Issue 4

Volume 16, Issue 3

Volume 16, Issue 2

Volume 16, Issue 1

Journal: IEEE Transactions on Audio, Speech & Language Processing