IEEE Transactions on Audio, Speech & Language Processing

researchr

You are not signed in
Sign in
Sign up

1345	--	1354	Bruno Masiero, Michael Vorländer. A Framework for the Calculation of Dynamic Crosstalk Cancellation Filters
1355	--	1365	Alexander Schasse, Rainer Martin. Estimation of Subband Speech Correlations for Noise Reduction via MVDR Processing
1366	--	1378	Michal Novotny, Jan Rusz, Roman Cmejla, Evzen Ruzicka. Automatic Evaluation of Articulatory Disorders in Parkinson's Disease
1379	--	1390	Felicia Lim, Wancheng Zhang, Emanuel A. P. Habets, Patrick A. Naylor. Robust Multichannel Dereverberation using Relaxed Multichannel Least Squares
1391	--	1402	Sina Hamidi Ghalehjegh, Richard C. Rose. Linear Regression Based Acoustic Adaptation for the Subspace Gaussian Mixture Model
1403	--	1412	Jonathan Botts, Lauri Savioja. Spectral and Pseudospectral Properties of Finite Difference Models Used in Audio and Room Acoustics
1413	--	1423	Yong Xiang, Iynkaran Natgunanathan, Song Guo, Wanlei Zhou, Saeid Nahavandi. Patchwork-Based Audio Watermarking Method Robust to De-synchronization Attacks
1424	--	1433	Ian Vince McLoughlin. Super-Audible Voice Activity Detection
1434	--	1448	Atiyeh Alinaghi, Philip J. B. Jackson, Qingju Liu, Wenwu Wang. Joint Mixing Vector and Binaural Model Based Stereo Source Separation

1225	--	1235	Zhibao Li, Ka Fai Cedric Yiu, Sven Nordholm. On the Indoor Beamformer Design With Reverberation
1236	--	1247	Matthew B. Hawes, Wei Liu. Sparse Array Design for Wideband Beamforming With Reduced Complexity in Tapped Delay-Lines
1248	--	1259	Yi FanChiang, Cheng-Wen Wei, Yi-Le Meng, Yu-Wen Lin, Shyh-Jye Jou, Tian-Sheuan Chang. Low Complexity Formant Estimation Adaptive Feedback Cancellation for Hearing Aids Using Pitch Based Processing
1260	--	1273	Simon Conan, Olivier Derrien, Mitsuko Aramaki, Sølvi Ystad, Richard Kronland-Martinet. A Synthesis Model With Intuitive Control Capabilities for Rolling Sounds
1274	--	1284	Christian Schüldt, Peter Händel. Decay Rate Estimators and Their Performance for Blind Reverberation Time Estimation
1285	--	1295	Sriram Ganapathy, Sri Harish Reddy Mallidi, Hynek Hermansky. Robust Feature Extraction Using Modulation Filtering of Autoregressive Models
1296	--	1305	Bo Li, Khe Chai Sim. A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks
1306	--	1319	Emre Yilmaz, Jort Florent Gemmeke, Hugo Van Hamme. Noise Robust Exemplar Matching Using Sparse Representations of Speech
1320	--	1335	Dominic Schmid, Gerald Enzner, Sarmad Malik, Dorothea Kolossa, Rainer Martin. Variational Bayesian Inference for Multichannel Dereverberation and Noise Reduction

1117	--	1129	Mohamad Hasan Bahari, Najim Dehak, Hugo Van Hamme, Lukas Burget, Ahmed M. Ali, Jim Glass. Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition
1130	--	1138	Guangzhao Bao, Yangfei Xu, Zhongfu Ye. Learning a Discriminative Dictionary for Single-Channel Speech Separation
1139	--	1147	Ian J. Kelly, Francis M. Boland. Detecting Arrivals in Room Impulse Responses With Dynamic Time Warping
1148	--	1157	Markus Guldenschuh, Raymond A. de Callafon. Detection of Secondary-Path Irregularities in Active Noise Control Headphones
1158	--	1171	Sin-Horng Chen, Chiao-Hua Hsieh, Chen-Yu Chiang, Hsi-Chun Hsiao, Yih-Ru Wang, Yuan-Fu Liao, Hsiu-Min Yu. Modeling of Speaking Rate Influences on Mandarin Speech Prosody and Its Application to Speaking Rate-controlled TTS
1172	--	1183	Danilo Comminiello, Michele Scarpiniti, Luis Antonio Azpicueta-Ruiz, Jerónimo Arenas-García, Aurelio Uncini. Nonlinear Acoustic Echo Cancellation Based on Sparse Functional Link Representations
1184	--	1194	Wen Zhang 0002, Thushara D. Abhayapala. Three Dimensional Sound Field Reproduction using Multiple Circular Loudspeaker Arrays: Functional Analysis Guided Approach
1195	--	1207	Maja Taseska, Emanuel A. P. Habets. Informed Spatial Filtering for Sound Extraction Using Distributed Microphone Arrays
1208	--	1218	Mo Shen, Daisuke Kawahara, Sadao Kurohashi. Dependency Parse Reranking with Rich Subtree Features

1003	--	1012	Vipul Arora, Laxmidhar Behera. Musical Source Clustering and Identification in Polyphonic Audio
1013	--	1022	Rajeev C. Nongpiur. Design of Minimax Broadband Beamformers that are Robust to Microphone Gain, Phase, and Position Errors
1023	--	1036	Arun Venkitaraman, Chandra Sekhar Seelamantula. Binaural Signal Processing Motivated Generalized Analytic Signal Construction and AM-FM Demodulation
1037	--	1046	Jürgen T. Geiger, Felix Weninger, Jort F. Gemmeke, Martin Wöllmer, Björn Schuller, Gerhard Rigoll. Memory-Enhanced Neural Networks and NMF for Robust ASR
1047	--	1055	Haiquan Zhao, Yi Yu, Shibin Gao, Xiangping Zeng, Zhengyou He. Memory Proportionate APA with Individual Activation Factors for Acoustic Echo Cancellation
1056	--	1068	Mehrdad J. Gangeh, Pouria Fewzee, Ali Ghodsi, Mohamed S. Kamel, Fakhri Karray. Multiview Supervised Dictionary Learning in Speech Emotion Recognition
1069	--	1081	Jae Hun Choi, Joon-Hyuk Chang. Dual-Microphone Voice Activity Detection Technique Based on Two-Step Power Level Difference Ratio
1082	--	1095	Xavier Alameda-Pineda, Radu Horaud. A Geometric Approach to Sound Source Localization from Time-Delay Estimates
1096	--	1108	Klaus Reindl, Stefan Meier, Hendrik Barfuss, Walter Kellermann. Minimum Mutual Information-Based Linearly Constrained Broadband Signal Extraction

871	--	880	Weibin Zhang, Pascale Fung. Discriminatively Trained Sparse Inverse Covariance Matrices for Speech Recognition
881	--	896	Hung-yi Lee, Sz-Rung Shiang, Ching-feng Yeh, Yun-Nung Chen, Yu Huang, Sheng-yi Kong, Lin-Shan Lee. Spoken Knowledge Organization by Semantic Structuring and a Prototype Course Lecture System for Personalized Learning
897	--	909	L. Zão, R. Coelho, Patrick Flandrin. Speech Enhancement with EMD and Hurst-Based Mode Selection
910	--	920	Daniele Giacobello, Mads Græsbøll Christensen, Tobias Lindstrøm Jensen, Manohar N. Murthi, Søren Holdt Jensen, Marc Moonen. Stable 1-Norm Error Minimization Based Linear Predictors for Speech Modeling
921	--	934	Yesenia Lacouture-Parodi, Emanuel A. P. Habets, Jingdong Chen, Jacob Benesty. Multichannel Noise Reduction in the Karhunen-Loève Expansion Domain
935	--	943	Seyed Omid Sadjadi, John H. L. Hansen. Blind Spectral Weighting for Robust Speaker Identification under Reverberation Mismatch
944	--	953	Gautam Varma Mantena, Sivanand Achanta, Kishore Prahallad. Query-by-Example Spoken Term Detection using Frequency Domain Linear Prediction and Non-Segmental Dynamic Time Warping
954	--	964	Christopher Osterwise, Steven L. Grant. On Over-Determined Frequency Domain BSS
965	--	976	Daniel P. Jarrett, Maja Taseska, Emanuel A. P. Habets, Patrick A. Naylor. Noise Reduction in the Spherical Harmonic Domain Using a Tradeoff Beamformer and Narrowband DOA Estimates
979	--	993	Verena Rieser, Oliver Lemon, Simon Keizer. Natural Language Generation as Incremental Planning Under Uncertainty: Adaptive Information Presentation for Statistical Dialogue Systems
993	--	994	Jordan Cheer, Stephen J. Elliott. Comments on "Complete Parallel Narrowband Active Noise Control Systems"

745	--	777	Jinyu Li, Li Deng, Yifan Gong, Reinhold Haeb-Umbach. An Overview of Noise-Robust Automatic Speech Recognition
778	--	784	Ruhi Sarikaya, Geoffrey E. Hinton, Anoop Deoras. Application of Deep Belief Networks for Natural Language Understanding
785	--	799	Romain Serizel, Marc Moonen, Bas van Dijk, Jan Wouters. Low-rank Approximation Based Multichannel Wiener Filter Algorithms for Noise Reduction with Application in Cochlear Implants
800	--	815	Marco Crocco, Andrea Trucco. Design of Superdirective Planar Arrays With Sparse Aperiodic Layouts for Processing Broadband Signals via 3-D Beamforming
816	--	825	José R. Zapata, Matthew E. P. Davies, Emilia Gómez. Multi-Feature Beat Tracking
826	--	835	Arun Narayanan, DeLiang Wang. Investigation of Speech Separation as a Front-End for Noise Robust Speech Recognition
836	--	845	Xiaojia Zhao, Yuxuan Wang, DeLiang Wang. Robust Speaker Identification in Noisy and Reverberant Conditions
846	--	857	Sandro Cumani, Oldrich Plchot, Pietro Laface. On the use of i-vector posterior distributions in Probabilistic Linear Discriminant Analysis
858	--	862	Chung-Hsien Wu, Han-Ping Shen, Yan-Ting Yang. Chinese-English Phone Set Construction for Code-Switching ASR Using Acoustic and DNN-Extracted Articulatory Features

585	--	595	Chung-Hsien Wu, Yi-Chin Huang, Chung-Han Lee, Jun-Cheng Guo. Synthesis of Spontaneous Speech With Syllable Contraction Using State-Based Context-Dependent Voice Transformation
596	--	607	Manu Airaksinen, Tuomo Raitio, Brad H. Story, Paavo Alku. Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction
608	--	619	Jae-Mo Yang, Hong-Goo Kang. Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction
620	--	633	Afsaneh Asaei, Mohammad Golbabaee, Hervé Bourlard, Volkan Cevher. Structured Sparsity Models for Reverberant Speech Separation
634	--	646	Rajan S. Rashobh, Andy W. H. Khong, Di Liu. Multichannel Equalization in the KLT and Frequency Domains With Application to Speech Dereverberation
647	--	658	Prasanga N. Samarasinghe, Thushara D. Abhayapala, Mark A. Poletti. Wavefield Analysis Over Large Areas Using Distributed Higher Order Microphones
659	--	671	Wen-Li Wei, Chung-Hsien Wu, Jen-Chun Lin, Han Li. Exploiting Psychological Factors for Interaction Style Recognition in Spoken Conversation
672	--	681	Stanislaw Andrzej Raczynski, Emmanuel Vincent. Genre-Based Music Language Modeling with Latent Hierarchical Pitman-Yor Process Allocation
682	--	696	Dalei Wu, Wei-Ping Zhu, M. N. S. Swamy. The Theory of Compressive Sensing Matching Pursuit Considering Time-domain Noise with Application to Speech Enhancement
697	--	710	Tejaswi Nanjundaswamy, Kenneth Rose. Cascaded Long Term Prediction for Enhanced Compression of Polyphonic Audio Signals
711	--	726	Kartik Audhkhasi, Andreas M. Zavou, Panayiotis G. Georgiou, Shrikanth S. Narayanan. Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
727	--	739	Joonas Nikunen, Tuomas Virtanen. Direction of Arrival Based Spatial Covariance Model for Blind Sound Source Separation

293	--	302	Dehong Gao, Wenjie Li, Xiaoyan Cai, Renxian Zhang, Ouyang You. Sequential Summarization: A Full View of Twitter Trending Topics
303	--	313	P. W. J. van Hengel, Johannes D. Krijnders. A Comparison of Spectro-Temporal Representations of Audio Signals
314	--	324	Imed Zitouni, Yassine Benajiba. Aligned-Parallel-Corpora Based Semi-Supervised Learning for Arabic Mention Detection
325	--	334	Emilio Molina, Ana M. Barbancho, Lorenzo J. Tardón, Isabel Barbancho. Dissonance Reduction In Polyphonic Audio Using Harmonic Reorganization
335	--	346	Daniel Pak-Kong Lun, Tak-Wai Shen, K. C. Ho. A Novel Expectation-Maximization Framework for Speech Enhancement in Non-Stationary Noise Environments
347	--	353	Stefano Cosentino, Tiago H. Falk, David McAlpine, Torsten Marquardt. Cochlear Implant Filterbank Design and Optimization: A Simulation Study
354	--	367	Mehrez Souden, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani. Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays
368	--	380	Heikki Kallasjoki, Jort F. Gemmeke, Kalle J. Palomäki. Estimating Uncertainty to Improve Exemplar-Based Feature Enhancement for Noise Robust Speech Recognition
381	--	391	Taufiq Hasan, John H. L. Hansen. Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise
392	--	402	Ofer Schwartz, Sharon Gannot. Speaker Tracking Using Recursive EM Algorithms
403	--	416	Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka, Chin-Hui Lee. A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling
417	--	429	Pui-Yu Hui, Helen Meng. Latent Semantic Analysis for Multimodal User Input With Speech and Gestures
430	--	440	Jesper Jensen, Cees H. Taal. Speech Intelligibility Prediction Based on Mutual Information
441	--	452	Andrea Primavera, Stefania Cecchi, Junfeng Li, Francesco Piazza. Objective and Subjective Investigation on a Novel Method for Digital Reverberator Parameters Estimation
453	--	464	Matt Speed, Damian T. Murphy, David M. Howard. Modeling the Vocal Tract Transfer Function Using a 3D Digital Waveguide Mesh
465	--	476	Hüseyin Hacihabiboglu. Theoretical Analysis of Open Spherical Microphone Arrays for Acoustic Intensity Measurements
477	--	492	Taemin Cho, Juan Pablo Bello. On the Relative Importance of Individual Components of Chord Recognition Systems
493	--	504	Takuma Otsuka, Katsuhiko Ishiguro, Hiroshi Sawada, Hiroshi G. Okuno. Bayesian Nonparametrics for Microphone Array Processing
505	--	517	Jianjun He, Ee-Leng Tan, Woon-Seng Gan. Linear Estimation Based Primary-Ambient Extraction for Stereo Audio Signals
518	--	530	Sira Gonzalez, Mike Brookes. PEFAC - A Pitch Estimation Algorithm Robust to High Levels of Noise
531	--	541	Min Zhang 0005, Xiangyu Duan, Wenliang Chen. Bayesian Constituent Context Model for Grammar Induction
542	--	555	Dah-Chung Chang, Fei-Tao Chu. Feedforward Active Noise Control With a New Variable Tap-Length and Step-Size Filtered-X LMS Algorithm
556	--	575	Matt McVicar, Raúl Santos-Rodriguez, Yizhao Ni, Tijl De Bie. Automatic Chord Estimation from Audio: A Review of the State of the Art

1687	--	0	Li Deng. Farewell editorial: keeping up the momentum of innovations
1688	--	1700	Sree Harsha Yella, Hervé Bourlard. Overlapping speech detection using long-term conversational features for speaker diarization in meeting room conversations
1701	--	1712	Ravi K. Chivukula, Yuriy A. Reznik, Yanyan Hu, Venkat Devarajan, Mythreya Jayendra-Lakshman. Fast algorithms for low-delay TDAC filterbanks in MPEG-4 AAC-ELD
1713	--	1725	Shaofei Xue, Ossama Abdel Hamid, Hui Jiang 0001, Li-Rong Dai, Qingfeng Liu. Fast adaptation of deep neural network based on discriminant codes for speech recognition
1726	--	1737	Matthew E. P. Davies, Philippe Hamel, Kazuyoshi Yoshii, Masataka Goto. AutoMashUpper: automatic creation of multi-song music mashups
1738	--	1749	Chao Weng, David L. Thomson, Patrick Haffner, Biing-Hwang Juang. Latent semantic rational kernels for topic spotting on conversational speech
1750	--	1764	Neil Wachowski, Mahmood R. Azimi-Sadjadi. Detection and classification of nonstationary transient signals using sparse approximations and Bayesian networks
1765	--	1776	Graham Percival, George Tzanetakis. Streamlined tempo estimation based on autocorrelation and cross-correlation with pulses
1777	--	1791	Annea Barkefors, Mikael Sternad, Lars-Johan Brännmark. Design and analysis of linear quadratic Gaussian feedforward controllers for active noise control
1792	--	1802	Maximo Cobos, Juan José Pérez Solano, Santiago Felici-Castell, Jaume Segura, Juan M. Navarro. Cumulative-sum-based localization of sound events in low-cost wireless acoustic sensor networks
1803	--	1814	Vladimir Tourbabin, Boaz Rafaely. Theoretical framework for the optimization of microphone array configuration for humanoid robot audition
1815	--	1824	Yuriy V. Zakharov, Vitor H. Nascimento. Sliding-window RLS low-cost implementation of proportionate affine projection algorithms
1825	--	1832	Stefano D'Angelo, Vesa Välimäki. Generalized Moog ladder filter: part I-linear analysis and parameterization
1833	--	1848	Na Yang, He Ba, Weiyang Cai, Ilker Demirkol, Wendi Heinzelman. BaNa: a noise resilient fundamental frequency detection algorithm for speech and music
1849	--	1858	Yuxuan Wang, Arun Narayanan, DeLiang Wang. On training targets for supervised speech separation
1859	--	1872	Ling-Hui Chen, Zhen-Hua Ling, Li-juan Liu, Li-Rong Dai. Voice conversion using deep neural networks with layer-wise generative training
1873	--	1883	Stefano D'Angelo, Vesa Välimäki. Generalized Moog ladder filter: part II-explicit nonlinear model through a novel delay-free loop implementation method
1884	--	1893	Zafar Rafii, Zhiyao Duan, Bryan Pardo. Combining rhythm-based and pitch-based methods for background and melody separation
1894	--	1904	Jussi Rämö, Vesa Välimäki, Balázs Bank. High-precision parallel graphic equalizer
1905	--	1917	Yannis Panagakis, Constantine Kotropoulos, Gonzalo R. Arce. Music genre classification via joint sparse low-rank representation of audio features
1918	--	1930	Akira Maezawa, Katsutoshi Itoyama, Kazuyoshi Yoshii, Hiroshi G. Okuno. Nonparametric Bayesian dereverberation of power spectrograms based on infinite-order autoregressive processes
1931	--	1940	Martin Krawczyk, Timo Gerkmann. STFT phase reconstruction in voiced speech for an improved single-channel speech enhancement
1941	--	1950	Vahid Khanagha, Khalid Daoudi, Hussein M. Yahia. Detection of glottal closure instants based on the microcanonical multiscale formalism
1951	--	1964	A. Venturini, L. Zão, R. Coelho. On speech features fusion, α-integration Gaussian modeling and multi-style training for noise robust speaker classification
1965	--	1977	Peter Foster, Matthias Mauch, Simon Dixon. Sequential complexity as a descriptor for musical similarity
1978	--	1992	Gang Liu, John H. L. Hansen. An investigation into back-end advancements for speaker recognition in multi-session and noisy enrollment scenarios
1993	--	2002	Jitong Chen, Yuxuan Wang, DeLiang Wang. A feature study for classification-based speech separation at low signal-to-noise ratios
2003	--	2011	Jelle Van Mourik, Damian Murphy. Explicit higher-order FDTD schemes for 3D room acoustic simulation
2012	--	2024	Pei Chee Yong, Sven Nordholm, Hai Huyen Dam. Effective binaural multi-channel processing algorithm for improved environmental presence
2025	--	2033	Austin Chen, Mark A. Hasegawa-Johnson. Mixed stereo audio classification using a stereo-input mixed-to-panned level feature
2034	--	2047	Gongping Huang, Jacob Benesty, Tao Long, Jingdong Chen. A family of maximum SNR filters for noise reduction
2048	--	2058	Su Yan, Xiaojun Wan. SRRank: leveraging semantic roles for extractive multi-document summarization
2059	--	2073	Hideyuki Tachibana, Nobutaka Ono, Hirokazu Kameoka, Shigeki Sagayama. Harmonic/percussive sound separation based on anisotropic smoothness of spectrograms
2074	--	2086	Jose Manuel Gil-Cacho, Toon van Waterschoot, Marc Moonen, Søren Holdt Jensen. A frequency-domain adaptive filter (FDAF) prediction error method (PEM) framework for double-talk-robust acoustic echo cancellation
2087	--	2100	Qi Wang, W. L. Woo, S. S. Dlay. Informed single-channel speech separation using HMM-GMM user-generated exemplar source
2101	--	2111	Daniel Erro, Tudor-Catalin Zorila, Yannis Stylianou. Enhancing the intelligibility of statistically generated synthetic speech by means of noise-independent modifications
2112	--	2121	Yi Jiang, DeLiang Wang, Runsheng Liu, Zhenming Feng. Binaural classification for reverberant speech segregation using deep neural networks
2122	--	2132	Li Su, Hsin-Ming Lin, Yi-Hsuan Yang. Sparse modeling of magnitude and phase-derived spectra for playing technique classification
2133	--	2145	Vinod Veera Reddy, Andy W. H. Khong, Boon Poh Ng. Unambiguous speech DOA estimation under spatial aliasing conditions
2146	--	2157	Amir Mohammadi, Seyyed Saeed Sarfjoo, Cenk Demiroglu. Eigenvoice speaker adaptation with minimal data for statistical speech synthesis systems using a MAP approach and nearest-neighbors
2158	--	2168	Kun Han, DeLiang Wang. Neural network based pitch tracking in very noisy speech
2169	--	2181	Yongsheng Mu, Peifeng Ji, Wei Ji, Ming Wu, Jun Yang. Modeling and compensation for the distortion of parametric loudspeakers using a one-dimension Volterra filter
2182	--	2196	Oliver Thiergart, Maja Taseska, Emanuel A. P. Habets. An informed parametric spatial filter based on instantaneous direction-of-arrival estimates
2197	--	2206	João Felipe Santos, Tiago H. Falk. Updating the SRMR-CI metric for improved intelligibility prediction for cochlear implant users
2207	--	2217	Seon-Man Kim, Hong Kook Kim. Direction-of-arrival based SNR estimation for dual-microphone speech enhancement
2218	--	2232	Takuma Otsuka, Katsuhiko Ishiguro, Takuya Yoshioka, Hiroshi Sawada, Hiroshi G. Okuno. Multichannel sound source dereverberation and separation for arbitrary number of sources based on Bayesian nonparametrics
2233	--	2243	Johannes Traa, Paris Smaragdis. Multichannel source separation and tracking with RANSAC and directional statistics
2244	--	2255	Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai-Doss, Hervé Bourlard, Qingmin Liao. Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array
2256	--	0	Yi FanChiang, Cheng-Wen Wei, Yi-Le Meng, Yu-Wen Lin, Shyh-Jye Jou, Tian-Sheuan Chang. Correction to "Low complexity formant estimation adaptive feedback cancellation for hearing aids using pitch based processing"

1581	--	1589	Jian Xu, Zhi-Jie Yan, Qiang Huo. An Unsupervised Adaptation Approach to Leveraging Feedback Loop Data by Using i-Vector for Data Clustering and Selection
1590	--	1600	Sandro Cumani, Pietro Laface. Large-Scale Training of Pairwise Support Vector Machines for Speaker Recognition
1601	--	1611	Jun Du, Qiang Huo. An Improved VTS Feature Compensation using Mixture Models of Distortion and IVN Training for Noisy Speech Recognition
1612	--	1623	Masahito Togami, Yohei Kawaguchi. Simultaneous Optimization of Acoustic Echo Reduction, Speech Dereverberation, and Noise Reduction against Mutual Interference
1624	--	1635	Jorge Lorente, Miguel Ferrer, Maria de Diego, Alberto González. GPU Implementation of Multichannel Adaptive Algorithms for Local Active Noise Control
1636	--	1647	Thomas Helie. Simulation of Fractional-Order Low-Pass Filters
1648	--	1659	Bruno Defraene, Toon van Waterschoot, Moritz Diehl, Marc Moonen. Embedded-Optimization-Based Loudspeaker Precompensation Using a Hammerstein Loudspeaker Model
1660	--	1669	Guangsen Wang, Khe Chai Sim. Regression-Based Context-Dependent Modeling of Deep Neural Networks for Speech Recognition
1670	--	1680	Roland Badeau, Mark D. Plumbley. Multichannel High-Resolution NMF for Modeling Convolutive Mixtures of Non-Stationary Signals in the Time-Frequency Domain

1455	--	1466	Liheng Zhao, Jacob Benesty, Jingdong Chen. Design of Robust Differential Microphone Arrays
1467	--	1482	Pooja Jain, Ram Bilas Pachori. Event-Based Method for Instantaneous Fundamental Frequency Estimation from Voiced Speech Based on Eigenvalue Decomposition of the Hankel Matrix
1483	--	1493	Yonatan Vaizman, Brian McFee, Gert R. G. Lanckriet. Codebook-Based Audio Feature Representation for Music Information Retrieval
1494	--	1505	O. Nadiri, B. Rafaely. Localization of Multiple Speakers under High Reverberation using a Spherical Microphone Array and the Direct-Path Dominance Test
1506	--	1521	Zhizheng Wu, Tuomas Virtanen, Engsiong Chng, Haizhou Li. Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion
1522	--	1532	Dumidu S. Talagala, Wen Zhang 0002, Thushara D. Abhayapala. Efficient Multi-Channel Adaptive Room Compensation for Spatial Soundfield Reproduction Using a Modal Decomposition
1533	--	1545	Ossama Abdel Hamid, Abdel-rahman Mohamed, Hui Jiang 0001, Li Deng, Gerald Penn, Dong Yu. Convolutional Neural Networks for Speech Recognition
1546	--	1557	Shoichi Koyama, Ken'ichi Furuya, Yusuke Hiwasaki, Yoichi Haneda, Yôiti Suzuki. Wave Field Reconstruction Filtering in Cylindrical Harmonic Domain for With-Height Recording and Reproduction
1558	--	1570	Chia-Ping Chen, Yi-Chin Huang, Chung-Hsien Wu, Kuan-De Lee. Polyglot Speech Synthesis Based on Cross-Lingual Frame Selection Using Auditory and Articulatory Features

5	--	0	Li Deng, Steve Renals, Marcello Federico, Mari Ostendorf. Editorial: Expanding the Technical Reach of our Transactions
6	--	16	Jalal Taghia, Rainer Martin. Objective Intelligibility Measures Based on Mutual Information for Speech Subjected to Speech Enhancement Processing
17	--	27	Liang Lu, Arnab Ghoshal, Steve Renals. Cross-Lingual Subspace Gaussian Mixture Models for Low-Resource Speech Recognition
28	--	40	Milica Gasic, Steve Young. Gaussian Processes for POMDP-Based Dialogue Manager Optimization
41	--	53	Imen Marrakchi-Mezghani, Gaël Mahé, Sonia Djaziri Larbi, Meriem Jaïdane, Monia Turki-Hadj Alouane. Nonlinear Audio Systems Identification Through Audio Input Gaussianization
54	--	66	Joao B. Crespo, Richard C. Hendriks. Multizone Speech Reinforcement
67	--	79	Chao Pan, Jingdong Chen, Jacob Benesty. Performance Study of the MVDR Beamformer as a Function of the Source Incidence Angle
80	--	94	Hung-yi Lee, Lin-Shan Lee. Improved Semantic Retrieval of Spoken Content by Document/Query Expansion with Random Walk Over Acoustic Similarity Graphs
95	--	109	Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach. A New Observation Model in the Logarithmic Mel Power Spectral Domain for the Automatic Recognition of Noisy Reverberant Speech
110	--	124	Nancy F. Chen, Sharon W. Tam, Wade Shen, Joseph P. Campbell. Characterizing Phonetic Transformations and Acoustic Differences Across English Dialects
125	--	137	Dejan Markovic, Konrad Kowalczyk, Fabio Antonacci, Christian Hofmann, Augusto Sarti, Walter Kellermann. Estimation of Acoustic Reflection Coefficients Through Pseudospectrum Matching
138	--	150	Zhiyao Duan, Jinyu Han, Bryan Pardo. Multi-pitch Streaming of Harmonic Sound Mixtures
151	--	160	Shilin Liu, Khe Chai Sim. Temporally Varying Weight Regression: A Semi-Parametric Trajectory Model for Automatic Speech Recognition
161	--	171	Vikrant Singh Tomar, Richard C. Rose. A Family of Discriminative Manifold Learning Algorithms and Their Application to Speech Recognition
172	--	183	Hironori Doi, Tomoki Toda, Keigo Nakamura, Hiroshi Saruwatari, Kiyohiro Shikano. Alaryngeal Speech Enhancement Based on One-to-Many Eigenvoice Conversion
184	--	192	Ebru Arisoy, Stanley F. Chen, Bhuvana Ramabhadran, Abhinav Sethy. Converting Neural Network Language Models into Back-off Language Models for Efficient Decoding in Automatic Speech Recognition
193	--	204	Craig T. Jin, Nicolas Epain, Abhaya Parthy. Design, Optimization and Evaluation of a Dual-Radius Spherical Microphone Array
205	--	216	Rémi Mignot, Gilles Chardon, Laurent Daudet. Low Frequency Interpolation of Room Impulse Responses Using Compressed Sensing
217	--	227	Mohammed Senoussaoui, Patrick Kenny, Themos Stafylakis, Pierre Dumouchel. A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization
228	--	237	Hideyuki Tachibana, Nobutaka Ono, Shigeki Sagayama. Singing Voice Enhancement in Monaural Music Signals Based on Two-stage Harmonic/Percussive Sound Separation on Multiple Resolution Spectrograms
238	--	247	Noam R. Shabtai, Boaz Rafaely. Generalized Spherical Array Beamforming for Binaural Speech Reproduction
248	--	259	Sandro Cumani, Pietro Laface. Factorized Sub-Space Estimation for Fast and Memory Effective I-vector Extraction
260	--	273	Yuan Zeng, Richard C. Hendriks. Distributed Delay and Sum Beamformer for Speech Enhancement via Randomized Gossip
274	--	286	Zhenghua Li, Min Zhang, Wanxiang Che, Ting Liu, Wenliang Chen. Joint Optimization for Chinese POS Tagging and Dependency Parsing

External Links

Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 22, Issue 9

Volume 22, Issue 8

Volume 22, Issue 7

Volume 22, Issue 6

Volume 22, Issue 5

Volume 22, Issue 4

Volume 22, Issue 3

Volume 22, Issue 2

Volume 22, Issue 12

Volume 22, Issue 11

Volume 22, Issue 10

Volume 22, Issue 1

External Links

Journal: IEEE Transactions on Audio, Speech &amp; Language Processing

Volume 22, Issue 9

Volume 22, Issue 8

Volume 22, Issue 7

Volume 22, Issue 6

Volume 22, Issue 5

Volume 22, Issue 4

Volume 22, Issue 3

Volume 22, Issue 2

Volume 22, Issue 12

Volume 22, Issue 11

Volume 22, Issue 10

Volume 22, Issue 1

Journal: IEEE Transactions on Audio, Speech & Language Processing