Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 24, Issue 12

2218 -- 2230Andrea Cogliati, Zhiyao Duan, Brendt Wohlberg. Context-Dependent Piano Music Transcription With Convolutional Sparse Coding
2231 -- 2240Yanmin Qian, Tian Tan, Dong Yu. Neural Network Based Multi-Factor Aware Joint Training for Robust Speech Recognition
2241 -- 2250Lahiru Samarakoon, Khe Chai Sim. Factorized Hidden Layer Adaptation for Deep Neural Network Based Acoustic Modeling
2251 -- 2262Martin Krawczyk-Becker, Timo Gerkmann. On MMSE-Based Estimation of Amplitude and Complex Speech Spectral Coefficients Under Phase-Uncertainty
2263 -- 2276Yanmin Qian, Mengxiao Bi, Tian Tan, Kai Yu. Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition
2277 -- 2287Yi-Chan Wu, Homer H. Chen. Generation of Affective Accompaniment in Accordance With Emotion Flow
2288 -- 2300Mahmood Movassagh, Peter Kabal. Scalable Audio Coding Using Trellis-Based Optimized Joint Entropy Coding and Quantization
2301 -- 2312Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei, Philip N. Garner. Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding
2313 -- 2326David Dov, Ronen Talmon, Israel Cohen. Kernel Method for Voice Activity Detection in the Presence of Transients
2327 -- 2340Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida. Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments
2341 -- 2353Hardik B. Sailor, Hemant A. Patil. Novel Unsupervised Auditory Filterbank Learning Using Convolutional RBM for Speech Recognition
2354 -- 2367Sidsel Marie Nørholm, Jesper Rindom Jensen, Mads Græsbøll Christensen. Instantaneous Fundamental Frequency Estimation With Optimal Segmentation for Nonstationary Voiced Speech
2368 -- 2376Sheng Zhang, Jiashu Zhang, Hongyu Han. Robust Variable Step-Size Decorrelation Normalized Least-Mean-Square Algorithm and its Application to Acoustic Echo Cancellation
2377 -- 2389Tom Barker, Tuomas Virtanen. Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms
2390 -- 2399Jinxin Liu, XueFeng Chen. Adaptive Compensation of Misequalization in Narrowband Active Noise Equalizer Systems
2400 -- 2413Atsunori Ogawa, Takaaki Hori, Atsushi Nakamura. Estimating Speech Recognition Accuracy Based on Error Type Classification
2414 -- 2424Finnian Kelly, John H. L. Hansen. Score-Aging Calibration for Speaker Verification
2425 -- 2438Bochen Li, Zhiyao Duan. An Approach to Score Following for Piano Performances With the Sustained Effect
2439 -- 2452Niko Moritz, Birger Kollmeier, Jörn Anemüller. Integration of Optimized Modulation Filter Sets Into Deep Neural Networks for Automatic Speech Recognition
2453 -- 2465Simon Leglaive, Roland Badeau, Gaël Richard. Multichannel Audio Source Separation With Probabilistic Reverberation Priors
2466 -- 2480Sakari Tervo. Single Snapshot Detection and Estimation of Reflections From Room Impulse Responses in the Spherical Harmonic Domain
2481 -- 2494Dejan Markovic, Fabio Antonacci, Lucio Bianchi, Stefano Tubaro, Augusto Sarti. Extraction of Acoustic Sources Through the Processing of Sound Field Maps in the Ray Space
2495 -- 2506Pavlos Papadopoulos, Andreas Tsiartas, Shrikanth Narayanan. Long-Term SNR Estimation of Speech Signals in Known and Unknown Channel Conditions
2507 -- 2515Ingo R. Titze, Anil Palaparthi. Sensitivity of Source-Filter Interaction to Specific Vocal Tract Shapes
2516 -- 2530Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot. A Hybrid Approach for Speech Enhancement Using MoG Model and Neural Network Phoneme Classifier
2531 -- 2543Gongping Huang, Jacob Benesty, Jingdong Chen. Superdirective Beamforming Based on the Krylov Matrix