Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 22, Issue 9

1345 -- 1354Bruno Masiero, Michael Vorländer. A Framework for the Calculation of Dynamic Crosstalk Cancellation Filters
1355 -- 1365Alexander Schasse, Rainer Martin. Estimation of Subband Speech Correlations for Noise Reduction via MVDR Processing
1366 -- 1378Michal Novotny, Jan Rusz, Roman Cmejla, Evzen Ruzicka. Automatic Evaluation of Articulatory Disorders in Parkinson's Disease
1379 -- 1390Felicia Lim, Wancheng Zhang, Emanuel A. P. Habets, Patrick A. Naylor. Robust Multichannel Dereverberation using Relaxed Multichannel Least Squares
1391 -- 1402Sina Hamidi Ghalehjegh, Richard C. Rose. Linear Regression Based Acoustic Adaptation for the Subspace Gaussian Mixture Model
1403 -- 1412Jonathan Botts, Lauri Savioja. Spectral and Pseudospectral Properties of Finite Difference Models Used in Audio and Room Acoustics
1413 -- 1423Yong Xiang, Iynkaran Natgunanathan, Song Guo, Wanlei Zhou, Saeid Nahavandi. Patchwork-Based Audio Watermarking Method Robust to De-synchronization Attacks
1424 -- 1433Ian Vince McLoughlin. Super-Audible Voice Activity Detection
1434 -- 1448Atiyeh Alinaghi, Philip J. B. Jackson, Qingju Liu, Wenwu Wang. Joint Mixing Vector and Binaural Model Based Stereo Source Separation

Volume 22, Issue 8

1225 -- 1235Zhibao Li, Ka Fai Cedric Yiu, Sven Nordholm. On the Indoor Beamformer Design With Reverberation
1236 -- 1247Matthew B. Hawes, Wei Liu. Sparse Array Design for Wideband Beamforming With Reduced Complexity in Tapped Delay-Lines
1248 -- 1259Yi FanChiang, Cheng-Wen Wei, Yi-Le Meng, Yu-Wen Lin, Shyh-Jye Jou, Tian-Sheuan Chang. Low Complexity Formant Estimation Adaptive Feedback Cancellation for Hearing Aids Using Pitch Based Processing
1260 -- 1273Simon Conan, Olivier Derrien, Mitsuko Aramaki, Sølvi Ystad, Richard Kronland-Martinet. A Synthesis Model With Intuitive Control Capabilities for Rolling Sounds
1274 -- 1284Christian Schüldt, Peter Händel. Decay Rate Estimators and Their Performance for Blind Reverberation Time Estimation
1285 -- 1295Sriram Ganapathy, Sri Harish Reddy Mallidi, Hynek Hermansky. Robust Feature Extraction Using Modulation Filtering of Autoregressive Models
1296 -- 1305Bo Li, Khe Chai Sim. A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks
1306 -- 1319Emre Yilmaz, Jort Florent Gemmeke, Hugo Van Hamme. Noise Robust Exemplar Matching Using Sparse Representations of Speech
1320 -- 1335Dominic Schmid, Gerald Enzner, Sarmad Malik, Dorothea Kolossa, Rainer Martin. Variational Bayesian Inference for Multichannel Dereverberation and Noise Reduction

Volume 22, Issue 7

1117 -- 1129Mohamad Hasan Bahari, Najim Dehak, Hugo Van Hamme, Lukas Burget, Ahmed M. Ali, Jim Glass. Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition
1130 -- 1138Guangzhao Bao, Yangfei Xu, Zhongfu Ye. Learning a Discriminative Dictionary for Single-Channel Speech Separation
1139 -- 1147Ian J. Kelly, Francis M. Boland. Detecting Arrivals in Room Impulse Responses With Dynamic Time Warping
1148 -- 1157Markus Guldenschuh, Raymond A. de Callafon. Detection of Secondary-Path Irregularities in Active Noise Control Headphones
1158 -- 1171Sin-Horng Chen, Chiao-Hua Hsieh, Chen-Yu Chiang, Hsi-Chun Hsiao, Yih-Ru Wang, Yuan-Fu Liao, Hsiu-Min Yu. Modeling of Speaking Rate Influences on Mandarin Speech Prosody and Its Application to Speaking Rate-controlled TTS
1172 -- 1183Danilo Comminiello, Michele Scarpiniti, Luis Antonio Azpicueta-Ruiz, Jerónimo Arenas-García, Aurelio Uncini. Nonlinear Acoustic Echo Cancellation Based on Sparse Functional Link Representations
1184 -- 1194Wen Zhang 0002, Thushara D. Abhayapala. Three Dimensional Sound Field Reproduction using Multiple Circular Loudspeaker Arrays: Functional Analysis Guided Approach
1195 -- 1207Maja Taseska, Emanuel A. P. Habets. Informed Spatial Filtering for Sound Extraction Using Distributed Microphone Arrays
1208 -- 1218Mo Shen, Daisuke Kawahara, Sadao Kurohashi. Dependency Parse Reranking with Rich Subtree Features

Volume 22, Issue 6

1003 -- 1012Vipul Arora, Laxmidhar Behera. Musical Source Clustering and Identification in Polyphonic Audio
1013 -- 1022Rajeev C. Nongpiur. Design of Minimax Broadband Beamformers that are Robust to Microphone Gain, Phase, and Position Errors
1023 -- 1036Arun Venkitaraman, Chandra Sekhar Seelamantula. Binaural Signal Processing Motivated Generalized Analytic Signal Construction and AM-FM Demodulation
1037 -- 1046Jürgen T. Geiger, Felix Weninger, Jort F. Gemmeke, Martin Wöllmer, Björn Schuller, Gerhard Rigoll. Memory-Enhanced Neural Networks and NMF for Robust ASR
1047 -- 1055Haiquan Zhao, Yi Yu, Shibin Gao, Xiangping Zeng, Zhengyou He. Memory Proportionate APA with Individual Activation Factors for Acoustic Echo Cancellation
1056 -- 1068Mehrdad J. Gangeh, Pouria Fewzee, Ali Ghodsi, Mohamed S. Kamel, Fakhri Karray. Multiview Supervised Dictionary Learning in Speech Emotion Recognition
1069 -- 1081Jae Hun Choi, Joon-Hyuk Chang. Dual-Microphone Voice Activity Detection Technique Based on Two-Step Power Level Difference Ratio
1082 -- 1095Xavier Alameda-Pineda, Radu Horaud. A Geometric Approach to Sound Source Localization from Time-Delay Estimates
1096 -- 1108Klaus Reindl, Stefan Meier, Hendrik Barfuss, Walter Kellermann. Minimum Mutual Information-Based Linearly Constrained Broadband Signal Extraction

Volume 22, Issue 5

871 -- 880Weibin Zhang, Pascale Fung. Discriminatively Trained Sparse Inverse Covariance Matrices for Speech Recognition
881 -- 896Hung-yi Lee, Sz-Rung Shiang, Ching-feng Yeh, Yun-Nung Chen, Yu Huang, Sheng-yi Kong, Lin-Shan Lee. Spoken Knowledge Organization by Semantic Structuring and a Prototype Course Lecture System for Personalized Learning
897 -- 909L. Zão, R. Coelho, Patrick Flandrin. Speech Enhancement with EMD and Hurst-Based Mode Selection
910 -- 920Daniele Giacobello, Mads Græsbøll Christensen, Tobias Lindstrøm Jensen, Manohar N. Murthi, Søren Holdt Jensen, Marc Moonen. Stable 1-Norm Error Minimization Based Linear Predictors for Speech Modeling
921 -- 934Yesenia Lacouture-Parodi, Emanuel A. P. Habets, Jingdong Chen, Jacob Benesty. Multichannel Noise Reduction in the Karhunen-Loève Expansion Domain
935 -- 943Seyed Omid Sadjadi, John H. L. Hansen. Blind Spectral Weighting for Robust Speaker Identification under Reverberation Mismatch
944 -- 953Gautam Varma Mantena, Sivanand Achanta, Kishore Prahallad. Query-by-Example Spoken Term Detection using Frequency Domain Linear Prediction and Non-Segmental Dynamic Time Warping
954 -- 964Christopher Osterwise, Steven L. Grant. On Over-Determined Frequency Domain BSS
965 -- 976Daniel P. Jarrett, Maja Taseska, Emanuel A. P. Habets, Patrick A. Naylor. Noise Reduction in the Spherical Harmonic Domain Using a Tradeoff Beamformer and Narrowband DOA Estimates
979 -- 993Verena Rieser, Oliver Lemon, Simon Keizer. Natural Language Generation as Incremental Planning Under Uncertainty: Adaptive Information Presentation for Statistical Dialogue Systems
993 -- 994Jordan Cheer, Stephen J. Elliott. Comments on "Complete Parallel Narrowband Active Noise Control Systems"

Volume 22, Issue 4

745 -- 777Jinyu Li, Li Deng, Yifan Gong, Reinhold Haeb-Umbach. An Overview of Noise-Robust Automatic Speech Recognition
778 -- 784Ruhi Sarikaya, Geoffrey E. Hinton, Anoop Deoras. Application of Deep Belief Networks for Natural Language Understanding
785 -- 799Romain Serizel, Marc Moonen, Bas van Dijk, Jan Wouters. Low-rank Approximation Based Multichannel Wiener Filter Algorithms for Noise Reduction with Application in Cochlear Implants
800 -- 815Marco Crocco, Andrea Trucco. Design of Superdirective Planar Arrays With Sparse Aperiodic Layouts for Processing Broadband Signals via 3-D Beamforming
816 -- 825José R. Zapata, Matthew E. P. Davies, Emilia Gómez. Multi-Feature Beat Tracking
826 -- 835Arun Narayanan, DeLiang Wang. Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
836 -- 845Xiaojia Zhao, Yuxuan Wang, DeLiang Wang. Robust Speaker Identification in Noisy and Reverberant Conditions
846 -- 857Sandro Cumani, Oldrich Plchot, Pietro Laface. On the use of i-vector posterior distributions in Probabilistic Linear Discriminant Analysis
858 -- 862Chung-Hsien Wu, Han-Ping Shen, Yan-Ting Yang. Chinese-English Phone Set Construction for Code-Switching ASR Using Acoustic and DNN-Extracted Articulatory Features

Volume 22, Issue 3

585 -- 595Chung-Hsien Wu, Yi-Chin Huang, Chung-Han Lee, Jun-Cheng Guo. Synthesis of Spontaneous Speech With Syllable Contraction Using State-Based Context-Dependent Voice Transformation
596 -- 607Manu Airaksinen, Tuomo Raitio, Brad H. Story, Paavo Alku. Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction
608 -- 619Jae-Mo Yang, Hong-Goo Kang. Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction
620 -- 633Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard, Volkan Cevher. Structured Sparsity Models for Reverberant Speech Separation
634 -- 646Rajan S. Rashobh, Andy W. H. Khong, Di Liu. Multichannel Equalization in the KLT and Frequency Domains With Application to Speech Dereverberation
647 -- 658Prasanga N. Samarasinghe, Thushara D. Abhayapala, Mark A. Poletti. Wavefield Analysis Over Large Areas Using Distributed Higher Order Microphones
659 -- 671Wen-Li Wei, Chung-Hsien Wu, Jen-Chun Lin, Han Li. Exploiting Psychological Factors for Interaction Style Recognition in Spoken Conversation
672 -- 681Stanislaw Andrzej Raczynski, Emmanuel Vincent. Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
682 -- 696Dalei Wu, Wei-Ping Zhu, M. N. S. Swamy. The Theory of Compressive Sensing Matching Pursuit Considering Time-domain Noise with Application to Speech Enhancement
697 -- 710Tejaswi Nanjundaswamy, Kenneth Rose. Cascaded Long Term Prediction for Enhanced Compression of Polyphonic Audio Signals
711 -- 726Kartik Audhkhasi, Andreas M. Zavou, Panayiotis G. Georgiou, Shrikanth S. Narayanan. Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
727 -- 739Joonas Nikunen, Tuomas Virtanen. Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation

Volume 22, Issue 2

293 -- 302Dehong Gao, Wenjie Li, Xiaoyan Cai, Renxian Zhang, Ouyang You. Sequential Summarization: A Full View of Twitter Trending Topics
303 -- 313P. W. J. van Hengel, Johannes D. Krijnders. A Comparison of Spectro-Temporal Representations of Audio Signals
314 -- 324Imed Zitouni, Yassine Benajiba. Aligned-Parallel-Corpora Based Semi-Supervised Learning for Arabic Mention Detection
325 -- 334Emilio Molina, Ana M. Barbancho, Lorenzo J. Tardón, Isabel Barbancho. Dissonance Reduction In Polyphonic Audio Using Harmonic Reorganization
335 -- 346Daniel Pak-Kong Lun, Tak-Wai Shen, K. C. Ho. A Novel Expectation-Maximization Framework for Speech Enhancement in Non-Stationary Noise Environments
347 -- 353Stefano Cosentino, Tiago H. Falk, David McAlpine, Torsten Marquardt. Cochlear Implant Filterbank Design and Optimization: A Simulation Study
354 -- 367Mehrez Souden, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani. Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays
368 -- 380Heikki Kallasjoki, Jort F. Gemmeke, Kalle J. Palomäki. Estimating Uncertainty to Improve Exemplar-Based Feature Enhancement for Noise Robust Speech Recognition
381 -- 391Taufiq Hasan, John H. L. Hansen. Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise
392 -- 402Ofer Schwartz, Sharon Gannot. Speaker Tracking Using Recursive EM Algorithms
403 -- 416Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka, Chin-Hui Lee. A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling
417 -- 429Pui-Yu Hui, Helen Meng. Latent Semantic Analysis for Multimodal User Input With Speech and Gestures
430 -- 440Jesper Jensen, Cees H. Taal. Speech Intelligibility Prediction Based on Mutual Information
441 -- 452Andrea Primavera, Stefania Cecchi, Junfeng Li, Francesco Piazza. Objective and Subjective Investigation on a Novel Method for Digital Reverberator Parameters Estimation
453 -- 464Matt Speed, Damian T. Murphy, David M. Howard. Modeling the Vocal Tract Transfer Function Using a 3D Digital Waveguide Mesh
465 -- 476Hüseyin Hacihabiboglu. Theoretical Analysis of Open Spherical Microphone Arrays for Acoustic Intensity Measurements
477 -- 492Taemin Cho, Juan Pablo Bello. On the Relative Importance of Individual Components of Chord Recognition Systems
493 -- 504Takuma Otsuka, Katsuhiko Ishiguro, Hiroshi Sawada, Hiroshi G. Okuno. Bayesian Nonparametrics for Microphone Array Processing
505 -- 517Jianjun He, Ee-Leng Tan, Woon-Seng Gan. Linear Estimation Based Primary-Ambient Extraction for Stereo Audio Signals
518 -- 530Sira Gonzalez, Mike Brookes. PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise
531 -- 541Min Zhang 0005, Xiangyu Duan, Wenliang Chen. Bayesian Constituent Context Model for Grammar Induction
542 -- 555Dah-Chung Chang, Fei-Tao Chu. Feedforward Active Noise Control With a New Variable Tap-Length and Step-Size Filtered-X LMS Algorithm
556 -- 575Matt McVicar, Raúl Santos-Rodriguez, Yizhao Ni, Tijl De Bie. Automatic Chord Estimation from Audio: A Review of the State of the Art

Volume 22, Issue 12

1687 -- 0Li Deng. Farewell editorial: keeping up the momentum of innovations
1688 -- 1700Sree Harsha Yella, Hervé Bourlard. Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations
1701 -- 1712Ravi K. Chivukula, Yuriy A. Reznik, Yanyan Hu, Venkat Devarajan, Mythreya Jayendra-Lakshman. Fast algorithms for low-delay TDAC filterbanks in MPEG-4 AAC-ELD
1713 -- 1725Shaofei Xue, Ossama Abdel Hamid, Hui Jiang 0001, Li-Rong Dai, Qingfeng Liu. Fast adaptation of deep neural network based on discriminant codes for speech recognition
1726 -- 1737Matthew E. P. Davies, Philippe Hamel, Kazuyoshi Yoshii, Masataka Goto. AutoMashUpper: automatic creation of multi-song music mashups
1738 -- 1749Chao Weng, David L. Thomson, Patrick Haffner, Biing-Hwang Juang. Latent semantic rational kernels for topic spotting on conversational speech
1750 -- 1764Neil Wachowski, Mahmood R. Azimi-Sadjadi. Detection and classification of nonstationary transient signals using sparse approximations and Bayesian networks
1765 -- 1776Graham Percival, George Tzanetakis. Streamlined tempo estimation based on autocorrelation and cross-correlation with pulses
1777 -- 1791Annea Barkefors, Mikael Sternad, Lars-Johan Brännmark. Design and analysis of linear quadratic Gaussian feedforward controllers for active noise control
1792 -- 1802Maximo Cobos, Juan José Pérez Solano, Santiago Felici-Castell, Jaume Segura, Juan M. Navarro. Cumulative-sum-based localization of sound events in low-cost wireless acoustic sensor networks
1803 -- 1814Vladimir Tourbabin, Boaz Rafaely. Theoretical framework for the optimization of microphone array configuration for humanoid robot audition
1815 -- 1824Yuriy V. Zakharov, Vitor H. Nascimento. Sliding-window RLS low-cost implementation of proportionate affine projection algorithms
1825 -- 1832Stefano D'Angelo, Vesa Välimäki. Generalized Moog ladder filter: part I-linear analysis and parameterization
1833 -- 1848Na Yang, He Ba, Weiyang Cai, Ilker Demirkol, Wendi Heinzelman. BaNa: a noise resilient fundamental frequency detection algorithm for speech and music
1849 -- 1858Yuxuan Wang, Arun Narayanan, DeLiang Wang. On training targets for supervised speech separation
1859 -- 1872Ling-Hui Chen, Zhen-Hua Ling, Li-juan Liu, Li-Rong Dai. Voice conversion using deep neural networks with layer-wise generative training
1873 -- 1883Stefano D'Angelo, Vesa Välimäki. Generalized Moog ladder filter: part II-explicit nonlinear model through a novel delay-free loop implementation method
1884 -- 1893Zafar Rafii, Zhiyao Duan, Bryan Pardo. Combining rhythm-based and pitch-based methods for background and melody separation
1894 -- 1904Jussi Rämö, Vesa Välimäki, Balázs Bank. High-precision parallel graphic equalizer
1905 -- 1917Yannis Panagakis, Constantine Kotropoulos, Gonzalo R. Arce. Music genre classification via joint sparse low-rank representation of audio features
1918 -- 1930Akira Maezawa, Katsutoshi Itoyama, Kazuyoshi Yoshii, Hiroshi G. Okuno. Nonparametric Bayesian dereverberation of power spectrograms based on infinite-order autoregressive processes
1931 -- 1940Martin Krawczyk, Timo Gerkmann. STFT phase reconstruction in voiced speech for an improved single-channel speech enhancement
1941 -- 1950Vahid Khanagha, Khalid Daoudi, Hussein M. Yahia. Detection of glottal closure instants based on the microcanonical multiscale formalism
1951 -- 1964A. Venturini, L. Zão, R. Coelho. On speech features fusion, α-integration Gaussian modeling and multi-style training for noise robust speaker classification
1965 -- 1977Peter Foster, Matthias Mauch, Simon Dixon. Sequential complexity as a descriptor for musical similarity
1978 -- 1992Gang Liu, John H. L. Hansen. An investigation into back-end advancements for speaker recognition in multi-session and noisy enrollment scenarios
1993 -- 2002Jitong Chen, Yuxuan Wang, DeLiang Wang. A feature study for classification-based speech separation at low signal-to-noise ratios
2003 -- 2011Jelle Van Mourik, Damian Murphy. Explicit higher-order FDTD schemes for 3D room acoustic simulation
2012 -- 2024Pei Chee Yong, Sven Nordholm, Hai Huyen Dam. Effective binaural multi-channel processing algorithm for improved environmental presence
2025 -- 2033Austin Chen, Mark A. Hasegawa-Johnson. Mixed stereo audio classification using a stereo-input mixed-to-panned level feature
2034 -- 2047Gongping Huang, Jacob Benesty, Tao Long, Jingdong Chen. A family of maximum SNR filters for noise reduction
2048 -- 2058Su Yan, Xiaojun Wan. SRRank: leveraging semantic roles for extractive multi-document summarization
2059 -- 2073Hideyuki Tachibana, Nobutaka Ono, Hirokazu Kameoka, Shigeki Sagayama. Harmonic/percussive sound separation based on anisotropic smoothness of spectrograms
2074 -- 2086Jose Manuel Gil-Cacho, Toon van Waterschoot, Marc Moonen, Søren Holdt Jensen. A frequency-domain adaptive filter (FDAF) prediction error method (PEM) framework for double-talk-robust acoustic echo cancellation
2087 -- 2100Qi Wang, W. L. Woo, S. S. Dlay. Informed single-channel speech separation using HMM-GMM user-generated exemplar source
2101 -- 2111Daniel Erro, Tudor-Catalin Zorila, Yannis Stylianou. Enhancing the intelligibility of statistically generated synthetic speech by means of noise-independent modifications
2112 -- 2121Yi Jiang, DeLiang Wang, Runsheng Liu, Zhenming Feng. Binaural classification for reverberant speech segregation using deep neural networks
2122 -- 2132Li Su, Hsin-Ming Lin, Yi-Hsuan Yang. Sparse modeling of magnitude and phase-derived spectra for playing technique classification
2133 -- 2145Vinod Veera Reddy, Andy W. H. Khong, Boon Poh Ng. Unambiguous speech DOA estimation under spatial aliasing conditions
2146 -- 2157Amir Mohammadi, Seyyed Saeed Sarfjoo, Cenk Demiroglu. Eigenvoice speaker adaptation with minimal data for statistical speech synthesis systems using a MAP approach and nearest-neighbors
2158 -- 2168Kun Han, DeLiang Wang. Neural network based pitch tracking in very noisy speech
2169 -- 2181Yongsheng Mu, Peifeng Ji, Wei Ji, Ming Wu, Jun Yang. Modeling and compensation for the distortion of parametric loudspeakers using a one-dimension Volterra filter
2182 -- 2196Oliver Thiergart, Maja Taseska, Emanuel A. P. Habets. An informed parametric spatial filter based on instantaneous direction-of-arrival estimates
2197 -- 2206João Felipe Santos, Tiago H. Falk. Updating the SRMR-CI metric for improved intelligibility prediction for cochlear implant users
2207 -- 2217Seon-Man Kim, Hong Kook Kim. Direction-of-arrival based SNR estimation for dual-microphone speech enhancement
2218 -- 2232Takuma Otsuka, Katsuhiko Ishiguro, Takuya Yoshioka, Hiroshi Sawada, Hiroshi G. Okuno. Multichannel sound source dereverberation and separation for arbitrary number of sources based on Bayesian nonparametrics
2233 -- 2243Johannes Traa, Paris Smaragdis. Multichannel source separation and tracking with RANSAC and directional statistics
2244 -- 2255Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai-Doss, Hervé Bourlard, Qingmin Liao. Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array
2256 -- 0Yi FanChiang, Cheng-Wen Wei, Yi-Le Meng, Yu-Wen Lin, Shyh-Jye Jou, Tian-Sheuan Chang. Correction to "Low complexity formant estimation adaptive feedback cancellation for hearing aids using pitch based processing"

Volume 22, Issue 11

1581 -- 1589Jian Xu, Zhi-Jie Yan, Qiang Huo. An Unsupervised Adaptation Approach to Leveraging Feedback Loop Data by Using i-Vector for Data Clustering and Selection
1590 -- 1600Sandro Cumani, Pietro Laface. Large-Scale Training of Pairwise Support Vector Machines for Speaker Recognition
1601 -- 1611Jun Du, Qiang Huo. An Improved VTS Feature Compensation using Mixture Models of Distortion and IVN Training for Noisy Speech Recognition
1612 -- 1623Masahito Togami, Yohei Kawaguchi. Simultaneous Optimization of Acoustic Echo Reduction, Speech Dereverberation, and Noise Reduction against Mutual Interference
1624 -- 1635Jorge Lorente, Miguel Ferrer, Maria de Diego, Alberto González. GPU Implementation of Multichannel Adaptive Algorithms for Local Active Noise Control
1636 -- 1647Thomas Helie. Simulation of Fractional-Order Low-Pass Filters
1648 -- 1659Bruno Defraene, Toon van Waterschoot, Moritz Diehl, Marc Moonen. Embedded-Optimization-Based Loudspeaker Precompensation Using a Hammerstein Loudspeaker Model
1660 -- 1669Guangsen Wang, Khe Chai Sim. Regression-Based Context-Dependent Modeling of Deep Neural Networks for Speech Recognition
1670 -- 1680Roland Badeau, Mark D. Plumbley. Multichannel High-Resolution NMF for Modeling Convolutive Mixtures of Non-Stationary Signals in the Time-Frequency Domain

Volume 22, Issue 10

1455 -- 1466Liheng Zhao, Jacob Benesty, Jingdong Chen. Design of Robust Differential Microphone Arrays
1467 -- 1482Pooja Jain, Ram Bilas Pachori. Event-Based Method for Instantaneous Fundamental Frequency Estimation from Voiced Speech Based on Eigenvalue Decomposition of the Hankel Matrix
1483 -- 1493Yonatan Vaizman, Brian McFee, Gert R. G. Lanckriet. Codebook-Based Audio Feature Representation for Music Information Retrieval
1494 -- 1505O. Nadiri, B. Rafaely. Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test
1506 -- 1521Zhizheng Wu, Tuomas Virtanen, Engsiong Chng, Haizhou Li. Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion
1522 -- 1532Dumidu S. Talagala, Wen Zhang 0002, Thushara D. Abhayapala. Efficient Multi-Channel Adaptive Room Compensation for Spatial Soundfield Reproduction Using a Modal Decomposition
1533 -- 1545Ossama Abdel Hamid, Abdel-rahman Mohamed, Hui Jiang 0001, Li Deng, Gerald Penn, Dong Yu. Convolutional Neural Networks for Speech Recognition
1546 -- 1557Shoichi Koyama, Ken'ichi Furuya, Yusuke Hiwasaki, Yoichi Haneda, Yôiti Suzuki. Wave Field Reconstruction Filtering in Cylindrical Harmonic Domain for With-Height Recording and Reproduction
1558 -- 1570Chia-Ping Chen, Yi-Chin Huang, Chung-Hsien Wu, Kuan-De Lee. Polyglot Speech Synthesis Based on Cross-Lingual Frame Selection Using Auditory and Articulatory Features

Volume 22, Issue 1

5 -- 0Li Deng, Steve Renals, Marcello Federico, Mari Ostendorf. Editorial: Expanding the Technical Reach of our Transactions
6 -- 16Jalal Taghia, Rainer Martin. Objective Intelligibility Measures Based on Mutual Information for Speech Subjected to Speech Enhancement Processing
17 -- 27Liang Lu, Arnab Ghoshal, Steve Renals. Cross-Lingual Subspace Gaussian Mixture Models for Low-Resource Speech Recognition
28 -- 40Milica Gasic, Steve Young. Gaussian Processes for POMDP-Based Dialogue Manager Optimization
41 -- 53Imen Marrakchi-Mezghani, Gaël Mahé, Sonia Djaziri Larbi, Meriem Jaïdane, Monia Turki-Hadj Alouane. Nonlinear Audio Systems Identification Through Audio Input Gaussianization
54 -- 66Joao B. Crespo, Richard C. Hendriks. Multizone Speech Reinforcement
67 -- 79Chao Pan, Jingdong Chen, Jacob Benesty. Performance Study of the MVDR Beamformer as a Function of the Source Incidence Angle
80 -- 94Hung-yi Lee, Lin-Shan Lee. Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk Over Acoustic Similarity Graphs
95 -- 109Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach. A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech
110 -- 124Nancy F. Chen, Sharon W. Tam, Wade Shen, Joseph P. Campbell. Characterizing Phonetic Transformations and Acoustic Differences Across English Dialects
125 -- 137Dejan Markovic, Konrad Kowalczyk, Fabio Antonacci, Christian Hofmann, Augusto Sarti, Walter Kellermann. Estimation of Acoustic Reflection Coefficients Through Pseudospectrum Matching
138 -- 150Zhiyao Duan, Jinyu Han, Bryan Pardo. Multi-pitch Streaming of Harmonic Sound Mixtures
151 -- 160Shilin Liu, Khe Chai Sim. Temporally Varying Weight Regression: A Semi-Parametric Trajectory Model for Automatic Speech Recognition
161 -- 171Vikrant Singh Tomar, Richard C. Rose. A Family of Discriminative Manifold Learning Algorithms and Their Application to Speech Recognition
172 -- 183Hironori Doi, Tomoki Toda, Keigo Nakamura, Hiroshi Saruwatari, Kiyohiro Shikano. Alaryngeal Speech Enhancement Based on One-to-Many Eigenvoice Conversion
184 -- 192Ebru Arisoy, Stanley F. Chen, Bhuvana Ramabhadran, Abhinav Sethy. Converting Neural Network Language Models into Back-off Language Models for Efficient Decoding in Automatic Speech Recognition
193 -- 204Craig T. Jin, Nicolas Epain, Abhaya Parthy. Design, Optimization and Evaluation of a Dual-Radius Spherical Microphone Array
205 -- 216Rémi Mignot, Gilles Chardon, Laurent Daudet. Low Frequency Interpolation of Room Impulse Responses Using Compressed Sensing
217 -- 227Mohammed Senoussaoui, Patrick Kenny, Themos Stafylakis, Pierre Dumouchel. A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization
228 -- 237Hideyuki Tachibana, Nobutaka Ono, Shigeki Sagayama. Singing Voice Enhancement in Monaural Music Signals Based on Two-stage Harmonic/Percussive Sound Separation on Multiple Resolution Spectrograms
238 -- 247Noam R. Shabtai, Boaz Rafaely. Generalized Spherical Array Beamforming for Binaural Speech Reproduction
248 -- 259Sandro Cumani, Pietro Laface. Factorized Sub-Space Estimation for Fast and Memory Effective I-vector Extraction
260 -- 273Yuan Zeng, Richard C. Hendriks. Distributed Delay and Sum Beamformer for Speech Enhancement via Randomized Gossip
274 -- 286Zhenghua Li, Min Zhang, Wanxiang Che, Ting Liu, Wenliang Chen. Joint Optimization for Chinese POS Tagging and Dependency Parsing