2045 | -- | 2058 | Qinghua Huang, Lin Zhang, Yong Fang. Two-Stage Decoupled DOA Estimation Based on Real Spherical Harmonics for Spherical Arrays |
2059 | -- | 2070 | Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda. Duration-Controlled LSTM for Polyphonic Sound Event Detection |
2071 | -- | 2084 | Monisankha Pal, Goutam Saha. Spectral Mapping Using Prior Re-Estimation of i-Vectors and System Fusion for Voice Conversion |
2085 | -- | 2097 | Seppo Enarvi, Peter Smit, Sami Virpioja, Mikko Kurimo. Automatic Speech Recognition With Very Large Conversational Finnish and Estonian Vocabularies |
2098 | -- | 2111 | Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss, Sébastien Marcel. Long-Term Spectral Statistics for Voice Presentation Attack Detection |
2112 | -- | 2124 | Brian Hamilton, Stefan Bilbao. FDTD Methods for 3-D Room Acoustics Simulation With High-Order Accuracy in Space and Time |
2125 | -- | 2137 | Pejman Mowlaee, Martin Blass, W. Bastiaan Kleijn. New Results in Modulation-Domain Single-Channel Speech Enhancement |
2138 | -- | 2151 | Dylan Menzies, Filippo Maria Fazi. Decoding and Compression of Channel and Scene Objects for Spatial Audio |
2152 | -- | 2161 | Eunwoo Song, Frank K. Soong, Hong-Goo Kang. Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems |
2162 | -- | 2175 | Pulkit Sharma, Vinayak Abrol, Anil Kumar Sao. Deep-Sparse-Representation-Based Features for Speech Recognition |
2176 | -- | 2187 | Iynkaran Natgunanathan, Yong Xiang, Guang Hua, Gleb Beliakov, John Yearwood. Patchwork-Based Multilayer Audio Watermarking |
2188 | -- | 2198 | Chengzhu Yu, John H. L. Hansen. Active Learning Based Constrained Clustering For Speaker Diarization |
2199 | -- | 2208 | Emil Solsbæk Ottosen, Monika Dörfler. A Phase Vocoder Based on Nonstationary Gabor Frames |
2209 | -- | 2222 | Boaz Schwartz, Sharon Gannot, Emanuel A. P. Habets. Two Model-Based EM Algorithms for Blind Source Separation in Noisy Environments |
2223 | -- | 2236 | Maja Taseska, Emanuel A. P. Habets. Nonstationary Noise PSD Matrix Estimation for Multichannel Blind Speech Extraction |
2237 | -- | 2250 | Bruno Di Giorgi, Simon Dixon, Massimiliano Zanoni, Augusto Sarti. A Data-Driven Model of Tonal Chord Sequence Complexity |
2251 | -- | 0 | Nikolaos Stefanakis, Despoina Pavlidi, Athanasios Mouchtaris. Corrections to "Perpendicular Cross-Spectra Fusion for Sound Source Localization With a Planar Microphone Array" |