Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 25, Issue 11

2045 -- 2058Qinghua Huang, Lin Zhang, Yong Fang. Two-Stage Decoupled DOA Estimation Based on Real Spherical Harmonics for Spherical Arrays
2059 -- 2070Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda. Duration-Controlled LSTM for Polyphonic Sound Event Detection
2071 -- 2084Monisankha Pal, Goutam Saha. Spectral Mapping Using Prior Re-Estimation of i-Vectors and System Fusion for Voice Conversion
2085 -- 2097Seppo Enarvi, Peter Smit, Sami Virpioja, Mikko Kurimo. Automatic Speech Recognition With Very Large Conversational Finnish and Estonian Vocabularies
2098 -- 2111Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss, Sébastien Marcel. Long-Term Spectral Statistics for Voice Presentation Attack Detection
2112 -- 2124Brian Hamilton, Stefan Bilbao. FDTD Methods for 3-D Room Acoustics Simulation With High-Order Accuracy in Space and Time
2125 -- 2137Pejman Mowlaee, Martin Blass, W. Bastiaan Kleijn. New Results in Modulation-Domain Single-Channel Speech Enhancement
2138 -- 2151Dylan Menzies, Filippo Maria Fazi. Decoding and Compression of Channel and Scene Objects for Spatial Audio
2152 -- 2161Eunwoo Song, Frank K. Soong, Hong-Goo Kang. Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems
2162 -- 2175Pulkit Sharma, Vinayak Abrol, Anil Kumar Sao. Deep-Sparse-Representation-Based Features for Speech Recognition
2176 -- 2187Iynkaran Natgunanathan, Yong Xiang, Guang Hua, Gleb Beliakov, John Yearwood. Patchwork-Based Multilayer Audio Watermarking
2188 -- 2198Chengzhu Yu, John H. L. Hansen. Active Learning Based Constrained Clustering For Speaker Diarization
2199 -- 2208Emil Solsbæk Ottosen, Monika Dörfler. A Phase Vocoder Based on Nonstationary Gabor Frames
2209 -- 2222Boaz Schwartz, Sharon Gannot, Emanuel A. P. Habets. Two Model-Based EM Algorithms for Blind Source Separation in Noisy Environments
2223 -- 2236Maja Taseska, Emanuel A. P. Habets. Nonstationary Noise PSD Matrix Estimation for Multichannel Blind Speech Extraction
2237 -- 2250Bruno Di Giorgi, Simon Dixon, Massimiliano Zanoni, Augusto Sarti. A Data-Driven Model of Tonal Chord Sequence Complexity
2251 -- 0Nikolaos Stefanakis, Despoina Pavlidi, Athanasios Mouchtaris. Corrections to "Perpendicular Cross-Spectra Fusion for Sound Source Localization With a Planar Microphone Array"