IEEE Transactions on Audio, Speech & Language Processing

researchr

You are not signed in
Sign in
Sign up

1777	--	1790	Zhanyu Ma, Arne Leijon, W. Bastiaan Kleijn. Vector quantization of LSF parameters with a mixture of dirichlet distributions
1791	--	1804	Liang Lu, K. K. Chin, Arnab Ghoshal, Stephen Renals. Joint Uncertainty Decoding for Noise Robust Subspace Gaussian Mixture Models
1805	--	1817	Dimitrios Giannoulis, Anssi Klapuri. Musical Instrument Recognition in Polyphonic Audio Using Missing Feature Approach
1818	--	1829	Freddy William, Abhijeet Sangwan, John H. L. Hansen. Automatic Accent Assessment Using Phonetic Mismatch and Human Perception
1830	--	1840	Stanislaw Andrzej Raczynski, Emmanuel Vincent, Shigeki Sagayama. Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
1841	--	1853	Raymond W. M. Ng, Tan Lee, Cheung Chi Leung, Bin Ma, Haizhou Li. Spoken Language Recognition With Prosodic Features
1854	--	1866	Benoit Fuentes, Roland Badeau, Gaël Richard. Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription
1867	--	1878	Jose Manuel Gil-Cacho, Marco Signoretto, Toon van Waterschoot, Marc Moonen, Søren Holdt Jensen. Nonlinear Acoustic Echo Cancellation Based on a Sliding-Window Leaky Kernel Affine Projection Algorithm
1879	--	1890	Ina Kodrasi, Stefan Goetze, Simon Doclo. Regularization for Partial Multichannel Equalization for Speech Dereverberation
1891	--	1899	Roman Scharrer, Michael Vorländer. Sound Field Classification in Small Microphone Arrays Using Spatial Coherences
1900	--	1912	Muhammad Salman Khan, Syed M. Naqvi, Ata ur-Rehman, Wenwu Wang, Jonathon A. Chambers. Video-Aided Model-Based Source Separation in Real Reverberant Rooms
1913	--	1928	Mehrez Souden, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani, Hiroshi Sawada. A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction
1929	--	1939	Matías Zanartu, Julio C. Ho, Daryush D. Mehta, Robert E. Hillman, George R. Wodicka. Subglottal Impedance-Based Inverse Filtering of Voiced Sounds Using Neck Surface Acceleration
1940	--	1952	Alex Southern, Samuel Siltanen, Damian T. Murphy, Lauri Savioja. Room Impulse Response Synthesis and Validation Using a Hybrid Acoustic Model
1953	--	1965	Futoshi Asano, Hideki Asoh, Kazuhiro Nakadai. Sound Source Localization Using Joint Bayesian Estimation With a Hierarchical Noise Model
1966	--	1978	Neil Wachowski, Mahmood R. Azimi-Sadjadi. Characterization of Multiple Transient Acoustical Sources From Time-Transform Representations
1979	--	1986	Cheng-Yuan Chang, Sen M. Kuo. Complete Parallel Narrowband Active Noise Control Systems

1539	--	1549	Constantin Paleologu, Jacob Benesty, Silviu Ciochina. Study of the General Kalman Filter for Echo Cancellation
1550	--	1559	Anaik Olivero, Bruno Torrésani, Richard Kronland-Martinet. A Class of Algorithms for Time-Frequency Multiplier Estimation
1560	--	1572	Olaf Schleusing, Tomi Kinnunen, Brad H. Story, Jean-Marc Vesin. Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution
1573	--	1585	Dumidu S. Talagala, Wen Zhang 0002, Thushara D. Abhayapala. Broadband DOA Estimation Using Sensor Arrays on Complex-Shaped Rigid Bodies
1586	--	1597	Jiajun Zhang, Feifei Zhai, Chengqing Zong. Syntax-Based Translation With Bilingually Lexicalized Synchronous Tree Substitution Grammars
1598	--	1611	Chang Woo Han, Shin Jae Kang, Nam Soo Kim. Reverberation and Noise Robust Feature Compensation Based on IMM
1612	--	1621	Anoop Deoras, Gökhan Tür, Ruhi Sarikaya, Dilek Z. Hakkani-Tür. Joint Discriminative Decoding of Words and Semantic Tags for Spoken Language Understanding
1622	--	1631	Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong-Aik Lee, Bin Ma, Haizhou Li. Sparse Classifier Fusion for Speaker Verification
1632	--	1639	Sarthak Khanal, Harvey F. Silverman, Rahul R. Shakya. A Free-Source Method (FrSM) for Calibrating a Large-Aperture Microphone Array
1640	--	1652	Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach. Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition
1653	--	1665	Enzo De Sena, Hüseyin Hacihabiboglu, Zoran Cvetkovic. Analysis and Design of Multichannel Systems for Perceptual Sound Field Reconstruction
1666	--	1675	Marcelo F. Caetano, Xavier Rodet. Musical Instrument Sound Morphing Guided by Perceptually Motivated Features
1676	--	1688	Bin Cheng, Christian Ritz, Ian S. Burnett, Xiguang Zheng. A General Compression Approach to Multi-Channel Three-Dimensional Audio
1689	--	1698	Weifeng Li, Longbiao Wang, Yicong Zhou, Hervé Bourlard, Qingmin Liao. Robust Log-Energy Estimation and its Dynamic Change Enhancement for In-car Speech Recognition
1699	--	1712	Alexey Ozerov, Antoine Liutkus, Roland Badeau, Gaël Richard. Coding-Based Informed Source Separation: Nonnegative Tensor Factorization Approach
1713	--	1726	David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, Mathew Magimai-Doss. Applying Multi- and Cross-Lingual Stochastic Phone Space Transformations to Non-Native Speech Recognition
1727	--	1741	Eleftheria Georganti, Tobias May, Steven van de Par, John Mourjopoulos. Sound Source Distance Estimation in Rooms based on Statistical Properties of Binaural Signals
1742	--	1754	Michael Wohlmayr, Franz Pernkopf. Model-Based Multiple Pitch Tracking Using Factorial HMMs: Model Adaptation and Inference
1755	--	1759	Ron M. Hecht, Elad Noor, Gil Dobry, Yaniv Zigel, Aharon Bar-Hillel, Naftali Tishby. Effective Model Representation by Information Bottleneck Principle
1760	--	1765	Luis Weruaga, Leonid Dimitrov. The Spectral Nature of Maximum Likelihood Noise Compensated Linear Prediction

1317	--	1329	Charles Verron, P.-A. Gauthier, Jennifer Langlois, Catherine Guastavino. Spectral and Spatial Multichannel Analysis/Synthesis of Interior Aircraft Sounds
1330	--	1342	Chun-an Chan, Lin-Shan Lee. Model-Based Unsupervised Spoken Term Detection with Spoken Queries
1343	--	1354	Jingdong Chen, Jacob Benesty. On the Time-Domain Widely Linear LCMV Filter for Noise Reduction With a Stereo System
1355	--	1368	Ji Ming, Ramji Srinivasan, Danny Crookes, Ayeh Jafari. CLOSE - A Data-Driven Approach to Speech Separation
1369	--	1380	Masahito Togami, Yohei Kawaguchi, Ryu Takeda, Yasunari Obuchi, Nobuo Nukaga. Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function
1381	--	1390	Yuxuan Wang, DeLiang Wang. Towards Scaling Up Classification-Based Speech Separation
1391	--	1402	Simon Arberet, Pierre Vandergheynst, Rafael E. Carrillo, Jean-Philippe Thiran, Yves Wiaux. Sparse Reverberant Audio Source Separation via Reweighted Analysis
1403	--	1414	Shuhua Zhang, Weibei Dou, Huazhong Yang. MDCT Sinusoidal Analysis for Audio Signals Analysis and Processing
1415	--	1423	Balaji Vasan Srinivasan, Yuancheng Luo, Daniel Garcia-Romero, Dmitry N. Zotkin, Ramani Duraiswami. A Symmetric Kernel Partial Least Squares Framework for Speaker Recognition
1424	--	1433	X. Cai, W. Li. Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization
1434	--	1444	Stanislaw Gorlow, J. D. Reiss. Model-Based Inversion of Dynamic Range Compression
1445	--	1457	Matthew McCallum, Bernard J. Guillemin. Stochastic-Deterministic MMSE STFT Speech Enhancement With General A Priori Information
1458	--	1468	Ali Hassan, Robert I. Damper, Mahesan Niranjan. On Acoustic Emotion Recognition: Compensating for Covariate Shift
1469	--	1480	F. Liu, Y. Liu. Towards Abstractive Speech Summarization: Exploring Unsupervised and Supervised Approaches for Spoken Utterance Compression
1481	--	1488	Vesa Välimäki, Heidi-Maria Lehtonen, Marko Takanen. A Perceptual Study on Velvet Noise and Its Variants at Different Pulse Densities
1489	--	1501	Olivier Derrien, Roland Badeau, Gaël Richard. Parametric Audio Coding With Exponentially Damped Sinusoids
1502	--	1512	Danilo Comminiello, Michele Scarpiniti, Luis Antonio Azpicueta-Ruiz, Jerónimo Arenas-García, Aurelio Uncini. Functional Link Adaptive Filters for Nonlinear Acoustic Echo Cancellation
1513	--	1523	Shmulik Markovich Golan, Sharon Gannot, Israel Cohen. Performance of the SDW-MWF With Randomly Located Microphones in a Reverberant Enclosure
1524	--	1533	Stefan Bilbao. Modeling of Complex Geometries and Boundary Conditions in Finite Difference/Finite Volume Time Domain Room Acoustics Simulation

1123	--	1133	Emanuel A. P. Habets, Jacob Benesty. Multi-Microphone Noise Reduction Based on Orthogonal Noise Signal Decompositions
1134	--	1144	Ping Xu, Pascale Fung. Cross-Lingual Language Modeling for Low-Resource Speech Recognition
1145	--	1157	Dongwen Ying, YongHong Yan. Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain
1158	--	1169	Ciprian Chelba, Peng Xu, Fernando Pereira, Thomas Richardson. Large Scale Distributed Acoustic Modeling With Back-Off ℕ-Grams
1170	--	1179	John Kane, Christer Gobl. Wavelet Maxima Dispersion for Breathy to Tense Voice Discrimination
1180	--	1189	Bin Zhang 0009, Alex Marin, Brian Hutchinson, Mari Ostendorf. Learning Phrase Patterns for Text Classification
1190	--	1200	Ilker Bayram, Mustafa E. Kamasak. A Simple Prior for Audio Signals
1201	--	1216	Lars-Johan Brännmark, Adrian Bahne, Anders Ahlén. Compensation of Loudspeaker-Room Responses in a Robust MIMO Control Framework
1228	--	1239	Ki-Seung Lee. Position-Dependent Crosstalk Cancellation Using Space Partitioning
1240	--	1250	Yusuke Hioka, Ken'ichi Furuya, Kazunori Kobayashi, Kenta Niwa, Yoichi Haneda. Underdetermined Sound Source Separation Using Power Spectrum Density Estimated by Combination of Directivity Gain
1251	--	1260	Benjamin Lecouteux, Georges Linares, Yannick Estève, Guillaume Gravier. Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding
1261	--	1271	Saman Mousazadeh, Israel Cohen. Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering
1272	--	1284	Hung-yi Lee, Lin-Shan Lee. Enhanced Spoken Term Detection Using Support Vector Machines and Weighted Pseudo Examples
1285	--	1294	Tom Ko, Brian Mak. Eigentriphones for Context-Dependent Acoustic Modeling
1295	--	1307	David Rybach, Hermann Ney, Ralf Schlüter. Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR

899	--	906	Guangzhao Bao, Zhongfu Ye, Xu Xu, Yingyue Zhou. A Compressed Sensing Approach to Blind Separation of Speech Mixture Based on a Two-Layer Sparsity Model
907	--	922	Shing-Chow Chan, Y. J. Chu, Z. G. Zhang, Kai Man Tsui. A New Variable Regularized QR Decomposition-Based Recursive Least M-Estimate Algorithm - Performance Analysis and Acoustic Applications
923	--	933	Jesper Rindom Jensen, Mads Græsbøll Christensen, Søren Holdt Jensen. Nonlinear Least Squares Methods for Joint DOA and Pitch Estimation
934	--	944	Sandro Cumani, Pietro Laface. Memory and Computation Trade-Offs for Efficient I-Vector Extraction
945	--	958	Emanuel A. P. Habets, Jacob Benesty. A Two-Stage Beamforming Approach for Noise Reduction and Dereverberation
959	--	970	Marco Liuni, Axel Röbel, Ewa Matusiak, Marco Romito, Xavier Rodet. Automatic Adaptation of the Time-Frequency Resolution for Sound Analysis and Re-Synthesis
971	--	982	Hiroshi Sawada, Hirokazu Kameoka, Shoko Araki, Naonori Ueda. Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data
983	--	997	Robert M. Nickel, Ramón Fernandez Astudillo, Dorothea Kolossa, Rainer Martin. Corpus-Based Speech Enhancement With Uncertainty Modeling and Cepstral Smoothing
998	--	1011	Nasser Mohammadiha, Arne Leijon. Nonnegative HMM for Babble Noise Derived From Speech HMM: Application to Speech Enhancement
1012	--	1022	Wei Rao, Man-Wai Mak. Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning
1023	--	1034	Ramón Fernandez Astudillo, Reinhold Orglmeister. Computing MMSE Estimates and Residual Uncertainty Directly in the Feature Domain of ASR using STFT Domain Speech Distortion Models
1035	--	1045	Petko N. Petkov, Gustav Eje Henter, W. Bastiaan Kleijn. Maximizing Phoneme Recognition Accuracy for Enhanced Speech Intelligibility in Noise
1046	--	1059	Maciej Niedzwiecki, Marcin Ciolek. Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
1060	--	1089	Li Deng, Xiao Li. Machine Learning Paradigms for Speech Recognition: An Overview
1090	--	1101	Coskun Mermer, Murat Saraclar, Ruhi Sarikaya. Improving Statistical Machine Translation Using Bayesian Word Alignment and Gibbs Sampling
1102	--	1112	Xiaodan Zhu, Colin Cherry, Gerald Penn. A Graph-Partitioning Framework for Aligning Hierarchical Topic Structures to Presentations
1113	--	1118	Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. Binaural Integrated Active Noise Control and Noise Reduction in Hearing Aids

685	--	696	Shoichi Koyama, Ken'ichi Furuya, Yusuke Hiwasaki, Yoichi Haneda. Analytical Approach to Wave Field Reconstruction Filtering in Spatio-Temporal Frequency Domain
697	--	710	Xiao-lei Zhang, Ji Wu. Deep Belief Networks Based Voice Activity Detection
711	--	724	Emmanuel Ravelli, Vinay Melkote, Tejaswi Nanjundaswamy, Kenneth Rose. Joint Optimization of Base and Enhancement Layers in Scalable Audio Coding
725	--	736	Cemil Demir, Murat Saraclar, Ali Taylan Cemgil. Single-Channel Speech-Music Separation for Robust ASR With Mixture Models
737	--	748	Athanasia Zlatintsi, Petros Maragos. Multiscale Fractal Analysis of Musical Instrument Signals With Application to Recognition
749	--	761	Shakeel Ahmed, Muhammad Tahir Akhtar, Xi Zhang. Robust Auxiliary-Noise-Power Scheduling in Active Noise Control Systems With Online Secondary Path Modeling
762	--	774	Yakun Hu, Dapeng Wu, Antonio Nucci. Fuzzy-Clustering-Based Decision Tree Approach for Large Population Speaker Identification
775	--	785	Nicki Holighaus, Monika Dörfler, Gino Angelo M. Velasco, Thomas Grill. A Framework for Invertible, Real-Time Constant-Q Transforms
786	--	797	Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee. A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition
798	--	805	JiHoon Park, Kwang-Ki Kim, Minsoo Hahn. Vocal Removal From Multiobject Audio Using Harmonic Information for Karaoke Service
806	--	815	John Woodruff, DeLiang Wang. Binaural Detection, Localization, and Segregation in Reverberant Environments Based on Joint Pitch and Azimuth Cues
816	--	827	Carlos Vaquero, Alfonso Ortega, Antonio Miguel, Eduardo Lleida. Quality Assessment for Speaker Diarization and Its Application in Speaker Characterization
828	--	841	Alireza Masnadi-Shirazi, Bhaskar D. Rao. An ICA-SCT-PHD Filter Approach for Tracking and Separation of Unknown Time-Varying Number of Sources
842	--	853	Taufiq Hasan, John H. L. Hansen. Acoustic Factor Analysis for Robust Speaker Verification
854	--	867	Gayadhar Pradhan, S. R. Mahadeva Prasanna. Speaker Verification by Vowel and Nonvowel Like Segmentation
868	--	878	Shing-Chow Chan, Y. J. Chu, Z. G. Zhang. A New Variable Regularized Transform Domain NLMS Adaptive Filtering Algorithm - Acoustic Applications and Performance Analysis
879	--	888	Nicolas Ellaham, Christian Giguere, Wail Gueaieb. Evaluation of the Phase-Inversion Signal Separation Method When Using Nonlinear Hearing Aids

463	--	475	Hongsen He, Lifu Wu, Jing Lu, Xiaojun Qiu, Jingdong Chen. Time Difference of Arrival Estimation Exploiting Multichannel Spatio-Temporal Prediction
476	--	487	Shan Liang, Wenju Liu, Wei Jiang. A New Bayesian Method Incorporating With Local Correlation for IBM Estimation
488	--	497	Stephan Tassart. Band-Limited Impulse Train Generation Using Sampled Infinite Impulse Responses of Analog Filters
498	--	507	Xin Chen, Yunxin Zhao. Building Acoustic Model Ensembles by Data Sampling With Enhanced Trainings and Features
508	--	519	Simone Spagnol, Michele Geronazzo, Federico Avanzini. On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features
520	--	530	Vipul Arora, Laxmidhar Behera. On-Line Melody Extraction From Polyphonic Audio Using Harmonic Cluster Tracking
531	--	543	Meinard Müller, Nanzhu Jiang, Peter Grosche. A Robust Fitness Measure for Capturing Repetitions in Music Recordings With Applications to Audio Thumbnailing
544	--	555	Shi-Xiong Zhang, Mark J. F. Gales. Structured SVMs for Automatic Speech Recognition
556	--	566	Daniel Erro, Eva Navas, Inma Hernáez. Parametric Voice Conversion Based on Bilinear Frequency Warping Plus Amplitude Scaling
567	--	578	Shuhua Zhang, Laurent Girin. Fast and Accurate Direct MDCT to DFT Conversion With Arbitrary Window Functions
579	--	586	Ladan Baghai-Ravary. The Inherent Temporal Precision of Phoneme Transitions
587	--	597	Matt Shannon, Heiga Zen, William Byrne. Autoregressive Models for Statistical Parametric Speech Synthesis
598	--	610	Jesper Kjær Nielsen, Mads Græsbøll Christensen, Søren Holdt Jensen. Default Bayesian Estimation of the Fundamental Frequency
611	--	623	Ziqiang Shi, Jiqing Han, Tieran Zheng, Ji Li. Identification of Objectionable Audio Segments Based on Pseudo and Heterogeneous Mixture Models
624	--	635	José A. González, Antonio M. Peinado, Ning Ma, Angel M. Gomez, Jon Barker. MMSE-Based Missing-Feature Reconstruction With Temporal Modeling for Robust Speech Recognition
636	--	648	Bassam Jabaian, Laurent Besacier, Fabrice Lefevre. Comparison and Combination of Lightly Supervised Approaches for Language Portability of a Spoken Language Understanding System
649	--	658	Renxian Zhang, Wenjie Li, Dehong Gao, Ouyang You. Automatic Twitter Topic Summarization With Speech Acts
659	--	668	Weibin Zhang, Pascale Fung. Sparse Inverse Covariance Matrices for Low Resource Speech Recognition
669	--	674	Jonathan Botts, José Escolano, Ning Xiang. Design of IIR Filters With Bayesian Model Selection and Parameter Estimation
675	--	680	Kais Khaldi, Abdel-Ouahab Boudraa. Audio Watermarking Via EMD

223	--	233	Alexandre Trilla, Francesc Alías. Sentence-Based Sentiment Analysis for Expressive Text-to-Speech
234	--	246	Paolo Annibale, Jason Filos, Patrick A. Naylor, Rudolf Rabenstein. TDOA-Based Speed of Sound Estimation for Air Temperature and Room Geometry Inference
247	--	259	Jung-Woo Choi, Yang-Hann Kim. Sound Field Reproduction of a Virtual Source Inside a Loudspeaker Array With Minimal External Radiation
260	--	269	Wei Wu, Mari Ostendorf. Graph-Based Query Strategies for Active Learning
270	--	279	Yuxuan Wang, Kun Han, DeLiang Wang. Exploring Monaural Features for Classification-Based Speech Segregation
280	--	290	Yao Qian, Frank K. Soong, Zhi-Jie Yan. A Unified Trajectory Tiling Approach to High Quality Speech Rendering
291	--	300	Erinç Dikici, Murat Semerci, Murat Saraclar, Ethem Alpaydin. Classification and Ranking Approaches to Discriminative Language Modeling for ASR
301	--	312	Boyan Huang, Yegui Xiao, Jinwei Sun, Guo Wei. A Variable Step-Size FXLMS Algorithm for Narrowband Active Noise Control
313	--	321	Stefano D'Angelo, Jyri Pakarinen, Vesa Välimäki. New Family of Wave-Digital Triode Models
322	--	335	Saeed Mosayyebpour, Morteza Esmaeili, T. Aaron Gulliver. Single-Microphone Early and Late Reverberation Suppression in Noisy Speech
336	--	342	Chengshi Zheng, Hao Liu, Renhua Peng, Xiaodong Li. A Statistical Analysis of Two-Channel Post-Filter Estimators in Isotropic Noise Fields
343	--	356	Shmulik Markovich Golan, Sharon Gannot, Israel Cohen. Distributed Multiple Constraints Generalized Sidelobe Canceler for Fully Connected Wireless Acoustic Sensor Networks
357	--	366	Ian McGraw, Ibrahim Badr, James R. Glass. Learning Lexicons From Speech Using a Pronunciation Mixture Model
367	--	377	Jonathan Dennis, Tran Huy Dat, Engsiong Chng. Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification
378	--	387	Nasim Radmanesh, Ian S. Burnett. Generation of Isolated Wideband Sound Fields Using a Combined Two-stage Lasso-LS Algorithm
388	--	396	Dong Yu, Li Deng, Frank Seide. The Deep Tensor Neural Network With Applications to Large Vocabulary Speech Recognition
397	--	406	Manas A. Pathak, Bhiksha Raj. Privacy-Preserving Speaker Verification and Identification Using Gaussian Mixture Models
407	--	415	Abigail A. Kressner, David V. Anderson, Christopher J. Rozell. Evaluating the Generalization of the Hearing Aid Speech Quality Index (HASQI)
416	--	426	Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali. A Multistream Feature Framework Based on Bandpass Modulation Filtering for Robust Speech Recognition
427	--	438	Chiong-Ching Lai, Sven Nordholm, Yee Hong Leung. Design of Steerable Spherical Broadband Beamformers With Flexible Sensor Configurations
439	--	443	Antonio Canclini, Fabio Antonacci, Augusto Sarti, Stefano Tubaro. Acoustic Source Localization With Distributed Asynchronous Microphone Networks
444	--	448	Stefano Gaiotto. A Tuning-Less Approach in Secondary Path Modeling in Active Noise Control Systems
449	--	455	Matt Speed, Damian T. Murphy, David M. Howard. Three-Dimensional Digital Waveguide Mesh Simulation of Cylindrical Vocal Tract Analogs

2471	--	2480	A. P. Prathosh, T. V. Ananthapadmanabha, A. G. Ramakrishnan. Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index
2481	--	2492	Tomas Dekens, Werner Verhelst. Body Conducted Speech Enhancement by Equalization and Signal Fusion
2493	--	2505	Dejan Markovic, Fabio Antonacci, Augusto Sarti, Stefano Tubaro. Soundfield Imaging in the Ray Space
2506	--	2515	Partha Lal, Simon King. Cross-Lingual Automatic Speech Recognition Using Tandem Features
2516	--	2531	Tomohiro Nakatani, Shoko Araki, Takuya Yoshioka, Marc Delcroix, Masakiyo Fujimoto. Dominance Based Integration of Spatial and Spectral Features for Speech Enhancement
2532	--	2540	Yotam Peled, Boaz Rafaely. Linearly-Constrained Minimum-Variance Method for Spherical Microphone Arrays Based on Plane-Wave Decomposition of the Sound Field
2541	--	2553	Jean-Louis Durrieu, Jean-Philippe Thiran. Source/Filter Factorial Hidden Markov Model, With Application to Pitch and Formant Tracking
2554	--	2569	Katherine Ellis, Emanuele Coviello, Antoni B. Chan, Gert R. G. Lanckriet. A Bag of Systems Representation for Music Auto-Tagging
2570	--	2582	Aroor Dinesh Dileep, C. Chandra Sekhar. HMM Based Intermediate Matching Kernel for Classification of Sequential Patterns of Speech Using Support Vector Machines
2583	--	2594	Oliver Thiergart, Giovanni Del Galdo, Maja Taseska, Emanuel A. P. Habets. Geometry-Based Spatial Sound Acquisition Using Distributed Microphone Arrays
2595	--	2606	Jesper Rindom Jensen, Jacob Benesty, Mads Græsbøll Christensen, Jingdong Chen. A Class of Optimal Rectangular Filtering Matrices for Single-Channel Signal Enhancement in the Time Domain
2607	--	2615	Yizhao Ni, Matt McVicar, Raúl Santos-Rodríguez, Tijl De Bie. Understanding Effects of Subjectivity in Measuring Chord Estimation Accuracy
2616	--	2626	Georg Heigold, Hermann Ney, Ralf Schlüter. Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs
2627	--	2637	Bruno Defraene, Naim Mansour, Steven De Hertogh, Toon van Waterschoot, Moritz Diehl, Marc Moonen. Declipping of Audio Signals Using Perceptual Compressed Sensing

2231	--	2243	Stephen J. Wright, Dimitri Kanevsky, Li Deng, Xiaodong He, Georg Heigold, Haizhou Li. Optimization Algorithms and Applications for Speech and Language Processing
2244	--	2254	Gillian M. Chin, Jorge Nocedal, Peder A. Olsen, Steven J. Rennie. Second Order Methods for Optimizing Convex Matrix Functions and Sparse Covariance Clustering
2255	--	2266	Theodoros Tsiligkaridis, Etienne Marcheret, Vaibhava Goel. A Difference of Convex Functions Approach to Large-Scale Log-Linear Model Estimation
2267	--	2276	Tara N. Sainath, Brian Kingsbury, Hagen Soltau, Bhuvana Ramabhadran. Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks
2277	--	2289	Tuomas Virtanen, Jort Florent Gemmeke, Bhiksha Raj. Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio
2290	--	2300	Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne. Large Vocabulary Speech Recognition on Parallel Architectures
2301	--	2312	Rémi Mignot, Laurent Daudet, François Ollivier. Room Reverberation Reconstruction: Interpolation of the Early Part Using Compressed Sensing
2313	--	2323	Min Zhang 0005, Wenliang Chen, Xiangyu Duan, Rong Zhang. Improving Graph-Based Dependency Parsing Models With Dependency Language Models
2324	--	2336	Florian Pflug, Tim Fingscheidt. Robust Ultra-Low Latency Soft-Decision Decoding of Linear PCM Audio
2337	--	2345	Koji Seto, Tokunbo Ogunfunmi. Scalable Speech Coding for IP Networks: Beyond iLBC
2346	--	2355	Kenta Niwa, Yusuke Hioka, Ken'ichi Furuya, Yoichi Haneda. Diffused Sensing for Sharp Directive Beamforming
2356	--	2367	Symeon Delikaris-Manias, Ville Pulkki. Cross Pattern Coherence Algorithm for Spatial Filtering Applications Utilizing Microphone Arrays
2368	--	2378	Bai Ying Lei, Ing Yann Soon, Ee-Leng Tan. Robust SVD-Based Audio Watermarking Scheme With Differential Evolution Optimization
2379	--	2392	Nikos Malandrakis, Alexandros Potamianos, Elias Iosif, Shrikanth S. Narayanan. Distributional Semantic Models for Affective Text Analysis
2393	--	2402	Pasi Pertilä, Matti S. Hämäläinen, Mikael Mieskolainen. Passive Temporal Offset Estimation of Multichannel Recordings of an Ad-Hoc Microphone Array
2403	--	2411	Liang Vincent Wang, Woon-Seng Gan, Andy W. H. Khong, Sen M. Kuo. Convergence Analysis of Narrowband Feedback Active Noise Control System With Imperfect Secondary Path Estimation
2412	--	2424	Chi-Man Pun, Xiaochen Yuan. Robust Segments Detector for De-Synchronization Resilient Audio Watermarking
2425	--	2438	Miranti Indar Mandasari, Rahim Saeidi, Mitchell McLaren, David A. van Leeuwen. Quality Measure Functions for Calibration of Speaker Recognition Systems in Various Duration Conditions
2439	--	2450	Fabian Triefenbach, Azarakhsh Jalalvand, Kris Demuynck, Jean-Pierre Martens. Acoustic Modeling With Hierarchical Reservoirs
2451	--	2464	Donghyeon Lee, Minwoo Jeong, Kyungduk Kim, Seonghan Ryu, Gary Geunbae Lee. Unsupervised Spoken Language Understanding for a Multi-Domain Dialog System

1993	--	2005	William Hartmann, Arun Narayanan, Eric Fosler-Lussier, DeLiang Wang. A Direct Masking Approach to Robust ASR
2006	--	2014	Yow-Bang Wang, Shang-wen Li, Lin-Shan Lee. An Experimental Analysis on Integrating Multi-Stream Spectro-Temporal, Cepstral and Pitch Information for Mandarin Speech Recognition
2015	--	2028	Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass. Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach
2029	--	2041	Zbynek Koldovský, Jirí Málek, Petr Tichavský, Francesco Nesta. Semi-Blind Noise Extraction Using Partially Known Position of the Target Source
2042	--	2056	Mads Graesboll Christensen. Accurate Estimation of Low Fundamental Frequencies From Real-Valued Measurements
2057	--	2072	Philippe Esling, Carlos Agon. Multiobjective Time Series Matching for Audio Classification and Retrieval
2073	--	2084	Chao Zhang, Yi Liu, Yunqing Xia, Xuan Wang, Chin-Hui Lee. Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition
2085	--	2095	Gilles Degottex, Yannis Stylianou. Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model
2096	--	2107	Bilei Zhu, Wei Li, Ruijiang Li, Xiangyang Xue. Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation
2108	--	2117	Sadao Hiroya. Non-Negative Temporal Decomposition of Speech Parameters by Multiplicative Update Rules
2118	--	2128	Cyril Joder, Slim Essid, Gaël Richard. Learning Optimal Features for Polyphonic Audio-to-Score Alignment
2129	--	2139	Zhen-Hua Ling, Li Deng, Dong Yu. Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis
2140	--	2151	Nasser Mohammadiha, Paris Smaragdis, Arne Leijon. Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization
2152	--	2161	Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee. Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems
2162	--	2171	Nikolay D. Gaubitch, Mike Brookes, Patrick A. Naylor. Blind Channel Magnitude Response Estimation in Speech Using Spectrum Classification
2172	--	2181	Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe, Nobuaki Minematsu, Keikichi Hirose. Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting
2182	--	2192	Takuya Yoshioka, Tomohiro Nakatani. Noise Model Transfer: Novel Approach to Robustness Against Nonstationary Noise
2193	--	2206	Despoina Pavlidi, Anthony Griffin, Matthieu Puigt, Athanasios Mouchtaris. Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array
2207	--	2220	Sefki Kolozali, Mathieu Barthet, György Fazekas, Mark Sandler. Automatic Ontology Generation for Musical Instruments Based on Audio Analysis

1	--	11	Stanislaw Gorlow, Sylvain Marchand. Informed Audio Source Separation Using Linearly Constrained Spatial Filters
12	--	26	Florin Ghido, Ioan Tabus. Sparse Modeling for Lossless Audio Compression
27	--	36	Xiguang Zheng, Christian Ritz, Jiangtao Xi. Encoding Navigable Speech Sources: A Psychoacoustic-Based Analysis-by-Synthesis Approach
37	--	48	Jwu-Sheng Hu, Ming-Tang Lee, Chia-Hsing Yang. ∞ Filter
49	--	60	Yi-Chin Huang, Chung-Hsien Wu, Yu-Ting Chao. Personalized Spectral and Prosody Conversion Using Frame-Based Codeword Distribution and Adaptive CRF
61	--	70	Nilesh Madhu, Ann Spriet, Sofie Jansen, Raphael Koning, Jan Wouters. The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses
71	--	82	Zafar Rafii, Bryan Pardo. REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation
83	--	96	Sree Hari Krishnan Parthasarathi, Hervé Bourlard, Daniel Gatica-Perez. Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations
97	--	107	Feng Huang, Tan Lee. Pitch Estimation in Noisy Speech Using Accumulated Peak Spectrum and Sparse Estimation Technique
108	--	119	Néstor Becerra Yoma, Claudio Garretón, Fernando Huenupán, Ignacio Catalan, Jorge Wuth Sepúlveda. On Reducing Harmonic and Sampling Distortion in Vocal Tract Length Normalization
120	--	129	Ke Hu, DeLiang Wang. An Unsupervised Approach to Cochannel Speech Separation
130	--	142	Ronen Talmon, Israel Cohen, Sharon Gannot. Single-Channel Transient Interference Suppression With Diffusion Maps
143	--	153	Nima Yousefian, Philipos C. Loizou. A Dual-Microphone Algorithm That Can Cope With Competing-Talker Scenarios
154	--	165	Miguel Ferrer, Alberto González, Maria de Diego, Gema Pinero. Convex Combination Filtered-X Algorithms for Active Noise Control Systems
166	--	175	Kun Han, DeLiang Wang. Towards Generalizing Classification Based Speech Separation
176	--	183	Nicolas Sturmel, Laurent Daudet. Informed Source Separation Using Iterative Reconstruction
184	--	194	Ziqiang Shi, Jiqing Han, Tieran Zheng, Shiwen Deng. Audio Segment Classification Using Online Learning Based Tensor Representation Feature Discrimination
195	--	204	Hai-son Le, Ilya Oparin, Alexandre Allauzen, Jean-Luc Gauvain, François Yvon. Structured Output Layer Neural Network Language Models for Speech Recognition
205	--	217	Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi. Articulatory Control of HMM-Based Parametric Speech Synthesis Using Feature-Space-Switched Multiple Regression

External Links

Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 21, Issue 9

Volume 21, Issue 8

Volume 21, Issue 7

Volume 21, Issue 6

Volume 21, Issue 5

Volume 21, Issue 4

Volume 21, Issue 3

Volume 21, Issue 2

Volume 21, Issue 12

Volume 21, Issue 11

Volume 21, Issue 10

Volume 21, Issue 1

External Links

Journal: IEEE Transactions on Audio, Speech &amp; Language Processing

Volume 21, Issue 9

Volume 21, Issue 8

Volume 21, Issue 7

Volume 21, Issue 6

Volume 21, Issue 5

Volume 21, Issue 4

Volume 21, Issue 3

Volume 21, Issue 2

Volume 21, Issue 12

Volume 21, Issue 11

Volume 21, Issue 10

Volume 21, Issue 1

Journal: IEEE Transactions on Audio, Speech & Language Processing