Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 20, Issue 9

2397 -- 2408Nikolaos Mitianoudis. A Generalized Directional Laplacian Distribution : Estimation, Mixture Models and Audio Source Separation
2409 -- 2419Janne Pylkkönen, Mikko Kurimo. Analysis of Extended Baum-Welch and Constrained Optimization for Discriminative Training of HMMs
2420 -- 2432Alex Southern, Damian T. Murphy, Lauri Savioja. Spatial Encoding of Finite Difference Time Domain Acoustic Models for Auralization
2433 -- 2447Marco Crocco, Andrea Trucco. Stochastic and Analytic Optimization of Sparse Aperiodic Arrays and Broadband Beamformers With Robust Superdirective Patterns
2448 -- 2460Masashi Okada, Takao Onoye, Wataru Kobayashi. A Ray Tracing Simulation of Sound Diffraction Based on the Analytic Secondary Source Model
2461 -- 2469Ryouichi Nishimura. Audio Watermarking Using Spatial Masking and Ambisonics
2470 -- 2481Flávio R. Avila, Luiz W. P. Biscainho. Bayesian Restoration of Audio Signals Degraded by Impulsive Noise Modeled as Individual Pulses
2482 -- 2491Woojay Jeon, Changxue Ma, Dusan Macho. Statistical Utterance Comparison for Speaker Clustering Using Factor Analysis
2492 -- 2504Justin Jian Zhang, Pascale Fung. Automatic Parliamentary Meeting Minute Generation Using Rhetorical Structure Modeling
2505 -- 2517Tomoki Toda, Mikihiro Nakagiri, Kiyohiro Shikano. Statistical Voice Conversion Techniques for Body-Conducted Unvoiced Speech Enhancement
2518 -- 2527Arun Narayanan, DeLiang Wang. A CASA-Based System for Long-Term SNR Estimation
2528 -- 2538Ronen Talmon, Israel Cohen, Sharon Gannot, Ronald R. Coifman. Supervised Graph-Based Processing for Sequential Transient Interference Suppression
2539 -- 2548Andre Holzapfel, Matthew E. P. Davies, José R. Zapata, João Lobato Oliveira, Fabien Gouyon. Selective Sampling for Beat Tracking Evaluation
2549 -- 2563Meng Guo, Søren Holdt Jensen, Jesper Jensen. Novel Acoustic Feedback Cancellation Approaches in Hearing Aid Applications Using Probe Noise and Probe Noise Enhancement
2564 -- 2574Jens Ahrens, Sascha Spors. A Modal Analysis of Spatial Discretization of Spherical Loudspeaker Distributions Used for Sound Field Synthesis
2575 -- 2585V. Tourbabin, Morag Agmon, Boaz Rafaely, Joseph Tabrikian. Optimal Real-Weighted Beamforming With Application to Linear and Spherical Arrays
2586 -- 2601Pejman Mowlaee, Rahim Saeidi, Mads Græsbøll Christensen, Zheng-Hua Tan, Tomi Kinnunen, Pasi Fränti, Søren Holdt Jensen. A Joint Approach for Single-Channel Speaker Identification and Speech Separation
2602 -- 2612Berlin Chen, Kuan-Yu Chen, Pei-Ning Chen, Yi-Wen Chen. Spoken Document Retrieval With Unsupervised Query Modeling Techniques
2613 -- 2617Kruthiventi S. S. Srinivas, Kishore Prahallad. An FIR Implementation of Zero Frequency Filtering of Speech Signals

Volume 20, Issue 8

2181 -- 2190Leonardo O. Nunes, Flávio R. Avila, Alan Freihof Tygel, Luiz W. P. Biscainho, Bowon Lee, Amir Said, Ronald W. Schafer. A Parametric Objective Quality Assessment Tool for Speech Signals Degraded by Acoustic Echo
2191 -- 2206Yong Zhao 0008, Biing-Hwang Juang. Nonlinear Compensation Using the Gauss-Newton Method for Noise-Robust Speech Recognition
2207 -- 2218Brian McFee, Luke Barrington, Gert R. G. Lanckriet. Learning Content Similarity for Music Recommendation
2219 -- 2231Hannu Pulakka, Ulpu Remes, Santeri Yrttiaho, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku. Bandwidth Extension of Telephone Speech to Low Frequencies Using Sinusoidal Synthesis and a Gaussian Mixture Model
2232 -- 2239Iynkaran Natgunanathan, Yong Xiang, Yue Rong, Wanlei Zhou, Song Guo. Robust Patchwork-Based Embedding and Decoding Scheme for Digital Audio Watermarking
2240 -- 2251Yotaro Kubo, Shinji Watanabe, Takaaki Hori, Atsushi Nakamura. Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition
2252 -- 2264Xiaodong Cui, Jian Xue, Xin Chen, Peder A. Olsen, Pierre L. Dognin, Upendra V. Chaudhari, John R. Hershey, Bowen Zhou. Hidden Markov Acoustic Modeling With Bootstrap and Restructuring for Low-Resourced Languages
2265 -- 2279Amit Das, John H. L. Hansen. Phoneme Selective Speech Enhancement Using Parametric Estimators and the Mixture Maximum Model: A Unifying Approach
2280 -- 2290Phillip L. De Leon, Michael Pucher, Junichi Yamagishi, Inma Hernáez, Ibon Saratxaga. Evaluation of Speaker Verification Security and Detection of HMM-Based Synthetic Speech
2291 -- 2300Wei-Ho Tsai, Hsin-Chieh Lee. Singer Identification Based on Spoken Data in Voice Characterization
2301 -- 2312Daniel Felps, Christian Geng, Ricardo Gutierrez-Osuna. Foreign Accent Conversion Through Concatenative Synthesis in the Articulatory Domain
2313 -- 2328Gustavo Reis, Francisco Fernández de Vega, Aníbal Ferreira. Automatic Transcription of Polyphonic Piano Music Using Genetic Algorithms, Adaptive Spectral Envelope Modeling, and Dynamic Noise Level Estimation
2329 -- 2340Soroosh Mariooryad, Carlos Busso. Generating Human-Like Behaviors Using Joint, Speech-Driven Models for Conversational Agents
2341 -- 2351Hasim Sak, Murat Saraclar, Tunga Gungor. Morpholexical and Discriminative Language Models for Turkish Automatic Speech Recognition
2352 -- 2364Yongwon Jeong. Adaptation of Hidden Markov Models Using Model-as-Matrix Representation
2365 -- 2377Seyedmahdad Mirsamadi, Shabnam Ghaffarzadegan, Hamid Sheikhzadeh, Seyed Mohammad Ahadi, Amir Hossein Rezaie. Efficient Frequency Domain Implementation of Noncausal Multichannel Blind Deconvolution for Convolutive Mixtures of Speech
2378 -- 2387Barry-John Theobald, Iain Matthews. Relating Objective and Subjective Performance Measures for AAM-Based Visual Speech Synthesis
2388 -- 2392Terence Betlehem, Christopher Withers. Sound Field Reproduction With Energy Constraint on Loudspeaker Weights

Volume 20, Issue 7

1913 -- 1922Theodoros Giannakopoulos, Sergios Petridis. Fisher Linear Semi-Discriminant Analysis for Speaker Diarization
1923 -- 1935Xiaodong Cui, Jing Huang, Jen-Tzung Chien. Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition
1936 -- 1947Jacob L. Newman, Stephen J. Cox. Language Identification Using Visual Features
1948 -- 1963Jesper Rindom Jensen, Jacob Benesty, Mads Græsbøll Christensen, Søren Holdt Jensen. Enhancement of Single-Channel Periodic Signals in the Time-Domain
1964 -- 1975Marco Compagnoni, Paolo Bestagini, Fabio Antonacci, Augusto Sarti, Stefano Tubaro. Localization of Acoustic Sources Through the Fitting of Propagation Cones Using Multiple Independent Arrays
1976 -- 1989Jung-Woo Choi, Yang-Hann Kim. Integral Approach for Reproduction of Virtual Sound Source Surrounded by Loudspeaker Array
1990 -- 2001Tomi Kinnunen, Rahim Saeidi, Filip Sedlak, Kong-Aik Lee, Johan Sandberg, Maria Hansson-Sandsten, Haizhou Li. Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification
2002 -- 2015Wen-Lin Zhang, Weiqiang Zhang, Bi-Cheng Li, Dan Qu, Michael T. Johnson. Bayesian Speaker Adaptation Based on a New Hierarchical Probabilistic Model
2016 -- 2030Tobias May, Steven van de Par, Armin Kohlrausch. A Binaural Scene Analyzer for Joint Localization and Recognition of Speakers in the Presence of Interfering Noise Sources and Reverberation
2031 -- 2044Armando Muscariello, Guillaume Gravier, Frédéric Bimbot. Unsupervised Motif Acquisition in Speech via Seeded Discovery and Template Matching Combination
2045 -- 2058César González Ferreras, David Escudero Mancebo, Carlos Vivaracho-Pascual, Valentín Cardeñoso-Payo. Improving Automatic Classification of Prosodic Events by Pairwise Coupling
2059 -- 2064Maximo Cobos, José J. López. Maximum a Posteriori Binary Mask Estimation for Underdetermined Source Separation Using Smoothed Posteriors
2065 -- 2079Sarmad Malik, Gerald Enzner. State-Space Frequency-Domain Adaptive Filtering for Nonlinear Acoustic Echo Cancellation
2080 -- 2094Ryoichi Miyazaki, Hiroshi Saruwatari, Takayuki Inoue, Yu Takahashi, Kiyohiro Shikano, Kazunobu Kondo. Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction
2095 -- 2110Hung-yi Lee, Chia-Ping Chen, Lin-Shan Lee. Integrating Recognition and Retrieval With Relevance Feedback for Spoken Term Detection
2111 -- 2122Mohamed I. Alkanhal, Mohammed A. Al-Badrashiny, Mansour M. Alghamdi, Abdulaziz O. Al-Qabbany. Automatic Stochastic Arabic Spelling Correction With Emphasis on Space Insertions and Deletions
2123 -- 2133Stephen J. Elliott, Jordan Cheer, Jung-Woo Choi, Youngtae Kim. Robustness and Regularization of Personal Audio Systems
2134 -- 2148Lakshmi Saheer, John Dines, Philip N. Garner. Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis
2149 -- 2158Yongqiang Wang, M. J. F. Gales. Speaker and Noise Factorization for Robust Speech Recognition

Volume 20, Issue 6

1669 -- 1684Sin-Horng Chen, Jyh-Her Yang, Chen-Yu Chiang, Ming-Chieh Liu, Yih-Ru Wang. A New Prosody-Assisted Mandarin ASR System
1685 -- 1697Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. A Zone-of-Quiet Based Approach to Integrated Active Noise Control and Noise Reduction for Speech Enhancement in Hearing Aids
1698 -- 1712Christian D. Sigg, Tomas Dikk, Joachim M. Buhmann. Speech Enhancement Using Generative Dictionary Learning
1713 -- 1724Heiga Zen, Norbert Braunschweiler, Sabine Buchholz, Mark J. F. Gales, Kate Knill, Sacha Krstulovic, Javier Latorre. Statistical Parametric Speech Synthesis Based on Speaker and Language Factorization
1725 -- 1733Christian Schüldt, Fredric Lindström, Ingvar Claesson. A Delay-Based Double-Talk Detector
1734 -- 1745Alastair J. Manders, David M. Simpson 0001, Steven L. Bell. Objective Prediction of the Sound Quality of Music Processed by an Adaptive Feedback Canceller
1746 -- 1758Shoichi Koyama, Ken'ichi Furuya, Yusuke Hiwasaki, Yoichi Haneda. Reproducing Virtual Sound Sources in Front of a Loudspeaker Array Using Inverse Wave Propagator
1759 -- 1770Justin Salamon, Emilia Gómez. Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics
1771 -- 1783Yizhao Ni, Matt McVicar, Raúl Santos-Rodriguez, Tijl De Bie. An End-to-End Machine Learning System for Harmonic Analysis of Music
1784 -- 1794Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu. Statistical Voice Conversion Based on Noisy Channel Model
1795 -- 1807Daniel Angus, Andrew E. Smith, Janet Wiles. Human Communication as Coupled Time Series: Quantifying Multi-Participant Recurrence
1808 -- 1817Claire Masterson, Gavin Kearney, Marcin Gorzel, Francis M. Boland. HRIR Order Reduction Using Approximate Factorization
1818 -- 1828Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka. Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors
1829 -- 1842Jan Ole Jungmann, Radoslaw Mazur, Markus Kallinger, Tiemin Mei, Alfred Mertins. Combined Acoustic MIMO Channel Crosstalk Cancellation and Room Impulse Response Reshaping
1843 -- 1856Tianyu T. Wang, Thomas F. Quatieri. Two-Dimensional Speech-Signal Modeling
1857 -- 1868Isabel Barbancho, Lorenzo J. Tardón, Simone Sammartino, Ana M. Barbancho. Inharmonicity-Based Method for the Automatic Generation of Guitar Tablature
1869 -- 1883Amit Das, John H. L. Hansen. Constrained Iterative Speech Enhancement Using Phonetic Classes
1884 -- 1893Abbas Keshavarz, Saeed Mosayyebpour, Mehrzad Biguesh, T. Aaron Gulliver, Morteza Esmaeili. Speech-Model Based Accurate Blind Reverberation Time Estimation Using an LPC Filter
1894 -- 1903Anil Kumar Vuppala, Jainath Yadav, Saswat Chakrabarti, K. Sreenivasa Rao. Vowel Onset Point Detection for Low Bit Rate Coded Speech

Volume 20, Issue 5

1421 -- 1448Vesa Välimäki, Julian D. Parker, Lauri Savioja, Julius O. Smith, Jonathan S. Abel. Fifty Years of Artificial Reverberation
1449 -- 1460Flavio P. Ribeiro, Dinei A. F. Florêncio, Demba E. Ba, Cha Zhang. Geometrically Constrained Room Modeling With Compact Microphone Arrays
1461 -- 1472Wenliang Chen, Jun'ichi Kazama, Min Zhang, Yoshimasa Tsuruoka, Yujie Zhang, Yiou Wang, Kentaro Torisawa, Haizhou Li. Bitext Dependency Parsing With Auto-Generated Bilingual Treebank
1473 -- 1481K. Lakhdhar, R. Lefebvre. Context-Based Adaptive Arithmetic Encoding of EAVQ Indices
1482 -- 1491Chao-Ling Hsu, DeLiang Wang, Jyh-Shing Roger Jang, Ke Hu. A Tandem Algorithm for Singing Pitch Extraction and Voice Separation From Music Accompaniment
1492 -- 1502Zhen-Hua Ling, Li-Rong Dai. Minimum Kullback-Leibler Divergence Parameter Generation for HMM-Based Speech Synthesis
1503 -- 1512John Woodruff, DeLiang Wang. Binaural Localization of Multiple Sources in Reverberant and Noisy Environments
1513 -- 1525Welly Naptali, Masatoshi Tsuchiya, Seiichi Nakagawa. Topic-Dependent-Class-Based $n$-Gram Language Model
1526 -- 1541Jesper Rindom Jensen, Jacob Benesty, Mads Græsbøll Christensen, Søren Holdt Jensen. Non-Causal Time-Domain Filters for Single-Channel Noise Reduction
1542 -- 1552Kamil Adilolu, Robert Anniés, Elio Wahlen, Hendrik Purwins, Klaus Obermayer. A Graphical Representation and Dissimilarity Measure for Basic Everyday Sound Events
1553 -- 1564Cees H. Taal, Richard C. Hendriks, Richard Heusdens. A Low-Complexity Spectro-Temporal Distortion Measure for Audio Processing Applications
1565 -- 1572Huawei Chen, Wee Ser, Jianjiang Zhou. Robust Nearfield Wideband Beamformer Design Using Worst Case Mean Performance Optimization With Passband Response Variance Constraint
1573 -- 1584D. Rama Sanand, Srinivasan Umesh. VTLN Using Analytically Determined Linear-Transformation on Conventional MFCC
1585 -- 1596Sandro Cumani, Pietro Laface. Analysis of Large-Scale SVM Training Algorithms for Language and Speaker Recognition
1597 -- 1607Xiaoyan Cai, Wenjie Li. Mutually Reinforced Manifold-Ranking Based Relevance Propagation Model for Query-Focused Multi-Document Summarization
1608 -- 1616Xiaojia Zhao, Yang Shao, DeLiang Wang. CASA-Based Robust Speaker Identification
1617 -- 1632Saeed Mosayyebpour, Hamid Sheikhzadeh, T. Aaron Gulliver, Morteza Esmaeili. Single-Microphone LP Residual Skewness-Based Inverse Filtering of the Room Impulse Response
1633 -- 1643Upendra V. Chaudhari, Michael Picheny. Matching Criteria for Vocabulary-Independent Search
1644 -- 1657Daniele Giacobello, Mads Græsbøll Christensen, Manohar N. Murthi, Søren Holdt Jensen, Marc Moonen. Sparse Linear Prediction and Its Applications to Speech Processing
1658 -- 1663Stefan Bilbao. Optimized FDTD Schemes for 3-D Acoustic Wave Propagation

Volume 20, Issue 4

1085 -- 1095S. Nakagawa, L. Wang, S. Ohtsuka. Speaker Identification and Verification by Combining MFCC and Phase Information
1096 -- 1108Riccardo Miotto, Gert R. G. Lanckriet. A Generative Context Model for Semantic Music Annotation and Retrieval
1109 -- 1117C. C. Lin, R. T.-H. Tsai. A Generative Data Augmentation Model for Enhancing Chinese Dialect Pronunciation Prediction
1118 -- 1133Alexey Ozerov, Emmanuel Vincent, Frédéric Bimbot. A General Flexible Framework for the Handling of Prior Information in Audio Source Separation
1134 -- 1144Jia-Min Ren, Jyh-Shing Roger Jang. Discovering Time-Constrained Sequential Patterns for Music Genre Classification
1145 -- 1157Virginia Estellers, Mihai Gurban, Jean-Philippe Thiran. On Dynamic Stream Weighting for Audio-Visual Speech Recognition
1158 -- 1166Navin Chatlani, John J. Soraghan. EMD-Based Filtering (EMDF) of Low-Frequency Noise for Speech Enhancement
1167 -- 1176Haiyan Shu, Haibin Huang, Susanto Rahardja. Analysis of Bit-Plane Probability for Generalized Gaussian Distribution and its Application in Audio Coding
1177 -- 1188Tobias Rosenkranz, Henning Puder. Improving Robustness of Codebook-Based Noise Estimation Approaches With Delta Codebooks
1189 -- 1195Ines Hafizovic, Carl-Inge Colombo Nilsen, Sverre Holm. Transformation Between Uniform Linear and Spherical Microphone Arrays With Symmetric Responses
1196 -- 1206Xiaohong Yang, Yufang Yang. Prosodic Realization of Rhetorical Structure in Chinese Discourse
1207 -- 1216David T. Yeh. Automated Physical Modeling of Nonlinear Audio Circuits for Real-Time Audio Effects - Part II: BJT and Vacuum Tube Examples
1217 -- 1232Manish Narwaria, Weisi Lin, Ian Vince McLoughlin, Sabu Emmanuel, Liang-Tien Chia. Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression
1233 -- 1243Wei-Ho Tsai, Hsin-Chieh Lee. Automatic Evaluation of Karaoke Singing Based on Pitch, Volume, and Rhythm Features
1244 -- 1255Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito. Round-Robin Duel Discriminative Language Models
1256 -- 1269Yiteng Arden Huang, Jacob Benesty. A Multi-Frame Approach to the Frequency-Domain Single-Channel Noise Reduction Problem
1270 -- 1281Miroslav Zivanovic, Johan Schoukens. Single and Piecewise Polynomials for Modeling of Pitched Sounds
1282 -- 1296Yaakov Bucris, Israel Cohen, Miriam A. Doron. Bayesian Focusing for Coherent Wideband Beamforming
1297 -- 1312Hélène Papadopoulos, Geoffroy Peeters. Local Key Estimation From an Audio Signal Relying on Harmonic and Metrical Structures
1313 -- 1323Elizabeth Godoy, Olivier Rosec, Thierry Chonavel. Voice Conversion Using Dynamic Frequency Warping With Amplitude Scaling, for Parallel or Nonparallel Corpora
1324 -- 1336Ruofei Chen, Cheung-fat Chan, Hing-Cheung So. Model-Based Speech Enhancement With Improved Spectral Envelope Estimation via Dynamics Tracking
1337 -- 1346Qun Feng Tan, Shrikanth S. Narayanan. Novel Variations of Group Sparse Regularization Techniques With Applications to Noise Robust Automatic Speech Recognition
1347 -- 1361Rubén Solera-Ureña, Ana I. García-Moral, Carmen Peláez-Moreno, Manel Martínez-Ramón, Fernando Díaz-de-María. Real-Time Robust Automatic Speech Recognition Using Compact Support Vector Machines
1362 -- 1371Amin Fazel, Shantanu Chakrabartty. Sparse Auditory Reproducing Kernel (SPARK) Features for Noise-Robust Speech Recognition
1372 -- 1382Jorge I. Marin-Hurtado, Devangi N. Parikh, David V. Anderson. Perceptually Inspired Noise-Reduction Method for Binaural Hearing Aids
1383 -- 1393Timo Gerkmann, Richard C. Hendriks. Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay
1394 -- 1399Haiquan Zhao, Xiangping Zeng, Xiaoqiang Zhang, Zhengyou He, Tian-rui Li, Weidong Jin. Adaptive Extended Pipelined Second-Order Volterra Filter for Nonlinear Active Noise Controller
1400 -- 1408Damián Marelli, Mitsuko Aramaki, Richard Kronland-Martinet, Charles Verron. An Efficient Time-Frequency Method for Synthesizing Noisy Sounds With Short Transients and Narrow Spectral Components
1409 -- 1415Maurice F. Fallon, Simon J. Godsill. Acoustic Source Localization and Tracking of a Time-Varying Number of Speakers

Volume 20, Issue 3

717 -- 730Kazuyoshi Yoshii, Masataka Goto. A Nonparametric Bayesian Multipitch Analyzer Based on Infinite Latent Harmonic Allocation
731 -- 741Siddika Parlak, Murat Saraclar. Performance Analysis and Improvement of Turkish Broadcast News Retrieval
742 -- 754Haohai Sun, Shefeng Yan, U. Peter Svensson. Optimal Higher Order Ambisonics Encoding With Predefined Constraints
755 -- 766Mitchell McLaren, David A. van Leeuwen. Source-Normalized LDA for Robust Speaker Recognition Using i-Vectors From Multiple Speech Sources
767 -- 779Elias K. Kokkinis, Joshua D. Reiss, John Mourjopoulos. A Wiener Filter Approach to Microphone Leakage Reduction in Close-Microphone Applications
780 -- 793Qiang Fu, Yong Zhao 0008, Biing-Hwang Juang. Automatic Speech Recognition Based on Non-Uniform Error Criteria
794 -- 805Heiga Zen, Mark J. F. Gales, Yoshihiko Nankaku, Keiichi Tokuda. Product of Experts for Statistical Parametric Speech Synthesis
806 -- 817Elina Helander, Hanna Silén, Tuomas Virtanen, Moncef Gabbouj. Voice Conversion Using Dynamic Kernel Partial Least Squares Regression
818 -- 827Ning Ma, Jon Barker, Heidi Christensen, Phil Green. Combining Speech Fragment Decoding and Adaptive Noise Floor Modeling
828 -- 843Liang-Che Sun, Lin-Shan Lee. Modulation Spectrum Equalization for Improved Robust Speech Recognition
844 -- 853Matija Marolt. Automatic Transcription of Bell Chiming Recordings
854 -- 867Emanuël Anco Peter Habets, Jacob Benesty, Patrick A. Naylor. A Speech Distortion and Interference Rejection Constraint Beamformer
868 -- 874Yousheng Chen, Qin Gong. A Normalized Beamforming Algorithm for Broadband Speech Using a Continuous Interleaved Sampling Strategy
875 -- 887Sabato Marco Siniscalchi, Dau-Cheng Lyu, Torbjørn Svendsen, Chin-Hui Lee. Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data
888 -- 899Xiang Lin, Andy W. H. Khong, Patrick A. Naylor. A Forced Spectral Diversity Algorithm for Speech Dereverberation in the Presence of Near-Common Zeros
900 -- 914Yu-Hsiang Bosco Chiu, Bhiksha Raj, Richard M. Stern. Learning-Based Auditory Encoding for Robust Speech Recognition
915 -- 921Ana M. Barbancho, Anssi Klapuri, Lorenzo J. Tardón, Isabel Barbancho. Automatic Transcription of Guitar Chords and Fingering From Audio
922 -- 932Amir Adler, Valentin Emiya, Maria G. Jafari, Michael Elad, Rémi Gribonval, Mark D. Plumbley. Audio Inpainting
933 -- 944Wei Chu, Abeer Alwan. SAFE: A Statistical Approach to F0 Estimation Under Clean and Noisy Conditions
945 -- 953Ashish Panda, Thambipillai Srikanthan. Psychoacoustic Model Compensation for Robust Speaker Verification in Environmental Noise
947 -- 960Emanuel A. P. Habets, Jacob Benesty. A Perspective on Frequency-Domain Beamformers in Room Acoustics
968 -- 981Thomas Drugman, Thierry Dutoit. The Deterministic Plus Stochastic Model of the Residual Signal and Its Applications
982 -- 993Shing-Chow Chan, Y. Chu. Performance Analysis and Design of FxLMS Algorithm in Broadband ANC System With Online Secondary-Path Modeling
994 -- 1006Thomas Drugman, Mark R. P. Thomas, Jon Gudnason, Patrick A. Naylor, Thierry Dutoit. Detection of Glottal Closure Instants From Speech Signals: A Quantitative Review
1007 -- 1021Alfonso Perez Carrillo, Jordi Bonada, Esteban Maestre, Enric Guaus, Merlijn Blaauw. Performance Control Driven Violin Timbre Model Based on Neural Networks
1022 -- 1031Ravi K. Chivukula, Yuriy A. Reznik, Venkat Devarajan, Mythreya Jayendra-Lakshman. Fast Algorithms for Low-Delay SBR Filterbanks in MPEG-4 AAC-ELD
1032 -- 1042Xianyu Zhao, Yuan Dong. Variational Bayesian Joint Factor Analysis Models for Speaker Verification
1043 -- 1055Ashutosh Pandey, V. John Mathews. Adaptive Gain Processing With Offending Frequency Suppression for Digital Hearing Aids
1056 -- 1068Tamar Shoham, David Malah, Slava Shechtman. Quality Preserving Compression of a Concatenative Text-To-Speech Acoustic Database
1069 -- 1073Vladimir Despotovic, Norbert Goertz, Zoran Peric. Nonlinear Long-Term Prediction of Speech Based on Truncated Volterra Series
1074 -- 1080Siow Yong Low, Svetha Venkatesh, Sven Nordholm. A Spectral Slit Approach to Doubletalk Detection

Volume 20, Issue 2

356 -- 370Xavier Anguera Miró, Simon Bozonnet, Nicholas W. D. Evans, Corinne Fredouille, Gerald Friedland, Oriol Vinyals. Speaker Diarization: A Review of Recent Research
371 -- 381Gerald Friedland, Adam Janin, David Imseng, Xavier Anguera Miró, Luke R. Gottlieb, Marijn Huijbregts, Mary Tai Knox, Oriol Vinyals. The ICSI RT-09 Speaker Diarization System
382 -- 392Nicholas W. D. Evans, Simon Bozonnet, Dong Wang, Corinne Fredouille, Raphaël Troncy. A Comparative Study of Bottom-Up and Top-Down Approaches to Speaker Diarization
393 -- 403Marijn Huijbregts, David A. van Leeuwen, Chuck Wooters. Speaker Diarization Error Analysis Using Oracle Components
404 -- 413Marijn Huijbregts, David A. van Leeuwen. Large-Scale Speaker Diarization for Long Recordings and Small Collections
414 -- 425Oshry Ben-Harush, Itshak Lapidot, Hugo Guterman. Initialization of Iterative-Based Speaker Diarization Systems for Telephone Conversations
426 -- 435José Manuel Pardo, Roberto Barra-Chicote, Rubén San Segundo, Ricardo de Córdoba, Beatriz Martínez-González. Speaker Diarization Features: The UPM Contribution to the RT09 Evaluation
436 -- 446Martin Zelenák, Carlos Segura, Jordi Luque, Javier Hernando. Simultaneous Speech Detection With Spatial Features for Speaker Diarization
447 -- 460Katsuhiko Ishiguro, Takeshi Yamada, Shoko Araki, Tomohiro Nakatani, Hiroshi Sawada. Probabilistic Speaker Diarization With Bag-of-Words Representations of Speaker Angle Information
461 -- 473Tin Lay Nwe, Hanwu Sun, Bin Ma, Haizhou Li. Speaker Clustering and Cluster Purification Methods for RT07 and RT09 Evaluation Meeting Data
474 -- 485Fernando Batista, Helena Moniz, Isabel Trancoso, Nuno J. Mamede. Bilingual Experiments on Automatic Recovery of Capitalization and Punctuation of Automatic Speech Transcripts
486 -- 498Thomas Hain, Lukás Burget, John Dines, Philip N. Garner, Frantisek Grézl, Asmaa El Hannani, Marijn Huijbregts, Martin Karafiát, Mike Lincoln, Vincent Wan. Transcribing Meetings With the AMIDA Systems
499 -- 513Takaaki Hori, Shoko Araki, Takuya Yoshioka, Masakiyo Fujimoto, Shinji Watanabe, Takanobu Oba, Atsunori Ogawa, Kazuhiro Otsuka, Dan Mikami, Keisuke Kinoshita, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato. Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera
514 -- 525Joan Serrà, Holger Kantz, Xavier Serra, Ralph G. Andrzejak. Predictability of Music Descriptor Time Series and its Application to Cover Song Detection
526 -- 539Marco Dinarelli, Alessandro Moschitti, Giuseppe Riccardi. Discriminative Reranking for Spoken Language Understanding
540 -- 550Ebru Arisoy, Murat Saraclar, Brian Roark, Izhak Shafran. Discriminative Language Modeling With Linguistic and Statistically Derived Features
551 -- 564Björn Hoffmeister, Georg Heigold, David Rybach, Ralf Schlüter, Hermann Ney. WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding
565 -- 574Alberto Sanchís, Alfons Juan, Enrique Vidal. A Word-Based Naïve Bayes Classifier for Confidence Estimation in Speech Recognition
575 -- 584Wen Zhang 0002, Mengqiu Zhang, Rodney A. Kennedy, Thushara D. Abhayapala. On High-Resolution Head-Related Transfer Function Measurements: An Efficient Sampling Scheme
585 -- 598Sungrack Yun, Chang D. Yoo. Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification
599 -- 609Nima Yousefian, Philipos C. Loizou. A Dual-Microphone Speech Enhancement Algorithm Based on the Coherence Function
610 -- 619Laura E. Boucheron, Phillip L. De Leon, Steven Sandoval. Low Bit-Rate Speech Coding Through Quantization of Mel-Frequency Cepstral Coefficients
620 -- 631Nam Soo Kim, Tae Gyoon Kang, Shin Jae Kang, Chang Woo Han, Doo Hwa Hong. Speech Feature Mapping Based on Switching Linear Dynamic System
632 -- 645Yi-Cheng Pan, Hung-yi Lee, Lin-Shan Lee. Interactive Spoken Document Retrieval With Suggested Key Terms Ranked by a Markov Decision Process
646 -- 660Jake Gunther. Learning Echo Paths During Continuous Double-Talk Using Semi-Blind Source Separation
661 -- 675Meng Yu, Wenye Ma, Jack Xin, Stanley Osher. 1 Regularized Convex Speech Enhancement Model and Fast Computation by the Split Bregman Method
676 -- 689Hüseyin Hacihabiboglu, Zoran Cvetkovic. Multichannel Dereverberation Theorems and Robustness Issues
690 -- 698Laura Romoli, Stefania Cecchi, Paolo Peretti, Francesco Piazza. A Mixed Decorrelation Approach for Stereo Acoustic Echo Cancellation Based on the Estimation of the Fundamental Frequency
699 -- 704Jacob Benesty, Mehrez Souden, Yiteng Huang. A Perspective on Differential Microphone Arrays in the Context of Noise Reduction
705 -- 708Frédéric Mustière, Martin Bouchard, Miodrag Bolic. All-Pole Modeling of Discrete Spectral Powers: A Unified Approach
709 -- 0Takayuki Arai, Nao Hodoshima, Keiichi Yasu. Errata to "Using Steady-State Suppression to Improve Speech Intelligibility in Reverberant Environments for Elderly Listeners"

Volume 20, Issue 10

2625 -- 0Mari Ostendorf. A Message from the Vice President of Publications on New Developments in Signal Processing Society Publications
2626 -- 2636Sundar Harshavardhan, Chandra Sekhar Seelamantula, Thippur V. Sreenivas. A Mixture Model Approach for Formant Tracking and the Robustness of Student's-t Distribution
2637 -- 2647Steven Hargreaves, Anssi Klapuri, Mark Sandler. Structural Segmentation of Multitrack Audio
2648 -- 2656Matthew Gibson, Thomas Hain. Correctness-Adjusted Unsupervised Discriminative Acoustic Model Adaptation
2657 -- 2671Bruno Defraene, Toon van Waterschoot, Hans Joachim Ferreau, Moritz Diehl, Marc Moonen. Real-Time Perception-Based Clipping of Audio Signals Using Convex Optimization
2672 -- 2682G. Ananthakrishnan, Olov Engwall, Daniel Neiberg. Exploring the Predictability of Non-Unique Acoustic-to-Articulatory Mappings
2683 -- 2695Fabio Antonacci, Jason Filos, Mark R. P. Thomas, Emanuël Anco Peter Habets, Augusto Sarti, Patrick A. Naylor, Stefano Tubaro. Inference of Room Geometry From Acoustic Impulse Responses
2696 -- 2706João Lobato Oliveira, Matthew E. P. Davies, Fabien Gouyon, Luís Paulo Reis. Beat Tracking for Multiple Applications: A Multi-Agent System Architecture With State Recovery
2707 -- 2720Takuya Yoshioka, Tomohiro Nakatani. Generalization of Multi-Channel Linear Prediction Methods for Blind MIMO Impulse Response Shortening

Volume 20, Issue 1

1 -- 0Helen Meng. Farewell Editorial
2 -- 3Li Deng. Inaugural Editorial: Riding the Tidal Wave of Human-Centric Information Processing - Innovate, Outreach, Collaborate, Connect, Expand, and Win
4 -- 6Dong Yu, Geoffrey E. Hinton, Nelson Morgan, Jen-Tzung Chien, Shigeki Sagayama. Introduction to the Special Section on Deep Learning for Speech and Language Processing
7 -- 13Nelson Morgan. Deep and Wide: Multiple Layers in Automatic Speech Recognition
14 -- 22Abdel-rahman Mohamed, George E. Dahl, Geoffrey E. Hinton. Acoustic Modeling Using Deep Belief Networks
23 -- 29Garimella S. V. S. Sivaram, Hynek Hermansky. Sparse Multilayer Perceptron for Phoneme Recognition
30 -- 42George E. Dahl, Dong Yu, Li Deng, Alex Acero. Context-Dependent Pre-Trained Deep Neural Networks for Large-Vocabulary Speech Recognition
43 -- 54George Saon, Jen-Tzung Chien. Bayesian Sensing Hidden Markov Models
55 -- 66Jen-Tzung Chien, Chuang-Hua Chueh. Topic-Based Hierarchical Segmentation
67 -- 81I. Yücel Özbek, Mark Hasegawa-Johnson, Mübeccel Demirekler. On Improving Dynamic State Space Approaches to Articulatory Inversion With MAP-Based Parameter Estimation
82 -- 91Mark R. P. Thomas, Jon Gudnason, Patrick A. Naylor. Estimation of Glottal Closing and Opening Instants in Voiced Speech Using the YAGA Algorithm
92 -- 102Jesper Jensen, Richard C. Hendriks. Spectral Magnitude Minimum Mean-Square Error Estimation Using Binary and Continuous Gain Functions
103 -- 107Hen-Geul Yeh, Carlos Rangel Ruiz. Fixed-Point Implementation of Cascaded Forward-Backward Adaptive Predictors
108 -- 121Tobias May, Steven van de Par, Armin Kohlrausch. Noise-Robust Speaker Recognition Combining Missing Data Techniques and Universal Background Modeling
122 -- 135Alberto Carini, Stefania Cecchi, Francesco Piazza, Ivan Omiciuolo, Giovanni L. Sicuranza. Multiple Position Room Response Equalization in Frequency Domain
136 -- 146Iman S. Mossavat, Petko N. Petkov, W. Bastiaan Kleijn, Oliver Amft. A Hierarchical Bayesian Approach to Modeling Heterogeneity in Speech Quality Assessment
147 -- 161Thomas Ulrich Christiansen, Steven Greenberg. Perceptual Confusions Among Consonants, Revisited - Cross-Spectral Integration of Phonetic-Feature Information and Consonant Recognition
162 -- 174Enzo De Sena, Hüseyin Hacihabiboglu, Zoran Cvetkovic. On the Design and Implementation of Higher Order Differential Microphones
175 -- 189Ted S. Wada, Biing-Hwang Juang. Enhancement of Residual Echo for Robust Acoustic Echo Cancellation
190 -- 199Adam M. Stark, Mark D. Plumbley. Performance Following: Real-Time Prediction of Musical Sequences Without a Score
200 -- 210Matthias Mauch, Hiromasa Fujihara, Masataka Goto. Integrating Additional Chord Information Into HMM-Based Lyrics-to-Audio Alignment
211 -- 222Berlin Chen, Shih-Hsiang Lin. A Risk-Aware Modeling Framework for Speech Summarization
223 -- 233Richard C. Hendriks, Timo Gerkmann. Noise Correlation Matrix Estimation for Multi-Microphone Speech Enhancement
234 -- 245Giovanni L. Sicuranza, Alberto Carini. On the BIBO Stability Condition of Adaptive Recursive FLANN Filters With Application to Nonlinear Active Noise Control
246 -- 260Francesco Nesta, Maurizio Omologo. Generalized State Coherence Transform for Multidimensional TDOA Estimation of Multiple Sources
261 -- 275Yasmín Montenegro M., José Carlos M. Bermudez. Transient Mean-Square Analysis of Prediction Error Method-Based Adaptive Feedback Cancellation in Hearing Aids
276 -- 289Lei Xie, Lilei Zheng, Zihan Liu, Yanning Zhang. Laplacian Eigenmaps for Automatic Story Segmentation of Broadcast News
290 -- 301Norberto Degara, Enrique Argones-Rúa, Antonio Pena, Soledad Torres-Guijarro, Matthew E. P. Davies, Mark D. Plumbley. Reliability-Informed Beat Tracking of Musical Signals
302 -- 313Jen-Tzung Chien, Hsin-Lung Hsieh. Convex Divergence ICA for Blind Source Separation
314 -- 321Han-gil Moon. A Low-Complexity Design for an MP3 Multi-Channel Audio Decoding System
322 -- 335Celia Shahnaz, Wei-Ping Zhu, M. Omair Ahmad. Pitch Estimation Based on a Harmonic Sinusoidal Autocorrelation Model and a Time-Domain Matching Scheme
336 -- 341Claudio Garretón, Néstor Becerra Yoma. Telephone Channel Compensation in Speaker Verification Using a Polynomial Approximation in the Log-Filter-Bank Energy Domain
342 -- 348Vishweshwara Rao, Pradeep Gaddipati, Preeti Rao. Signal-Driven Window-Length Adaptation for Sinusoid Detection in Polyphonic Music