Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 21, Issue 9

1777 -- 1790Zhanyu Ma, Arne Leijon, W. Bastiaan Kleijn. Vector quantization of LSF parameters with a mixture of dirichlet distributions
1791 -- 1804Liang Lu, K. K. Chin, Arnab Ghoshal, Stephen Renals. Joint Uncertainty Decoding for Noise Robust Subspace Gaussian Mixture Models
1805 -- 1817Dimitrios Giannoulis, Anssi Klapuri. Musical Instrument Recognition in Polyphonic Audio Using Missing Feature Approach
1818 -- 1829Freddy William, Abhijeet Sangwan, John H. L. Hansen. Automatic Accent Assessment Using Phonetic Mismatch and Human Perception
1830 -- 1840Stanislaw Andrzej Raczynski, Emmanuel Vincent, Shigeki Sagayama. Dynamic Bayesian Networks for Symbolic Polyphonic Pitch Modeling
1841 -- 1853Raymond W. M. Ng, Tan Lee, Cheung Chi Leung, Bin Ma, Haizhou Li. Spoken Language Recognition With Prosodic Features
1854 -- 1866Benoit Fuentes, Roland Badeau, Gaël Richard. Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription
1867 -- 1878Jose Manuel Gil-Cacho, Marco Signoretto, Toon van Waterschoot, Marc Moonen, Søren Holdt Jensen. Nonlinear Acoustic Echo Cancellation Based on a Sliding-Window Leaky Kernel Affine Projection Algorithm
1879 -- 1890Ina Kodrasi, Stefan Goetze, Simon Doclo. Regularization for Partial Multichannel Equalization for Speech Dereverberation
1891 -- 1899Roman Scharrer, Michael Vorländer. Sound Field Classification in Small Microphone Arrays Using Spatial Coherences
1900 -- 1912Muhammad Salman Khan, Syed M. Naqvi, Ata ur-Rehman, Wenwu Wang, Jonathon A. Chambers. Video-Aided Model-Based Source Separation in Real Reverberant Rooms
1913 -- 1928Mehrez Souden, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani, Hiroshi Sawada. A Multichannel MMSE-Based Framework for Speech Source Separation and Noise Reduction
1929 -- 1939Matías Zanartu, Julio C. Ho, Daryush D. Mehta, Robert E. Hillman, George R. Wodicka. Subglottal Impedance-Based Inverse Filtering of Voiced Sounds Using Neck Surface Acceleration
1940 -- 1952Alex Southern, Samuel Siltanen, Damian T. Murphy, Lauri Savioja. Room Impulse Response Synthesis and Validation Using a Hybrid Acoustic Model
1953 -- 1965Futoshi Asano, Hideki Asoh, Kazuhiro Nakadai. Sound Source Localization Using Joint Bayesian Estimation With a Hierarchical Noise Model
1966 -- 1978Neil Wachowski, Mahmood R. Azimi-Sadjadi. Characterization of Multiple Transient Acoustical Sources From Time-Transform Representations
1979 -- 1986Cheng-Yuan Chang, Sen M. Kuo. Complete Parallel Narrowband Active Noise Control Systems

Volume 21, Issue 8

1539 -- 1549Constantin Paleologu, Jacob Benesty, Silviu Ciochina. Study of the General Kalman Filter for Echo Cancellation
1550 -- 1559Anaik Olivero, Bruno Torrésani, Richard Kronland-Martinet. A Class of Algorithms for Time-Frequency Multiplier Estimation
1560 -- 1572Olaf Schleusing, Tomi Kinnunen, Brad H. Story, Jean-Marc Vesin. Joint Source-Filter Optimization for Accurate Vocal Tract Estimation Using Differential Evolution
1573 -- 1585Dumidu S. Talagala, Wen Zhang 0002, Thushara D. Abhayapala. Broadband DOA Estimation Using Sensor Arrays on Complex-Shaped Rigid Bodies
1586 -- 1597Jiajun Zhang, Feifei Zhai, Chengqing Zong. Syntax-Based Translation With Bilingually Lexicalized Synchronous Tree Substitution Grammars
1598 -- 1611Chang Woo Han, Shin Jae Kang, Nam Soo Kim. Reverberation and Noise Robust Feature Compensation Based on IMM
1612 -- 1621Anoop Deoras, Gökhan Tür, Ruhi Sarikaya, Dilek Z. Hakkani-Tür. Joint Discriminative Decoding of Words and Semantic Tags for Spoken Language Understanding
1622 -- 1631Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong-Aik Lee, Bin Ma, Haizhou Li. Sparse Classifier Fusion for Speaker Verification
1632 -- 1639Sarthak Khanal, Harvey F. Silverman, Rahul R. Shakya. A Free-Source Method (FrSM) for Calibrating a Large-Aperture Microphone Array
1640 -- 1652Volker Leutnant, Alexander Krueger, Reinhold Haeb-Umbach. Bayesian Feature Enhancement for Reverberation and Noise Robust Speech Recognition
1653 -- 1665Enzo De Sena, Hüseyin Hacihabiboglu, Zoran Cvetkovic. Analysis and Design of Multichannel Systems for Perceptual Sound Field Reconstruction
1666 -- 1675Marcelo F. Caetano, Xavier Rodet. Musical Instrument Sound Morphing Guided by Perceptually Motivated Features
1676 -- 1688Bin Cheng, Christian Ritz, Ian S. Burnett, Xiguang Zheng. A General Compression Approach to Multi-Channel Three-Dimensional Audio
1689 -- 1698Weifeng Li, Longbiao Wang, Yicong Zhou, Hervé Bourlard, Qingmin Liao. Robust Log-Energy Estimation and its Dynamic Change Enhancement for In-car Speech Recognition
1699 -- 1712Alexey Ozerov, Antoine Liutkus, Roland Badeau, Gaël Richard. Coding-Based Informed Source Separation: Nonnegative Tensor Factorization Approach
1713 -- 1726David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, Mathew Magimai-Doss. Applying Multi- and Cross-Lingual Stochastic Phone Space Transformations to Non-Native Speech Recognition
1727 -- 1741Eleftheria Georganti, Tobias May, Steven van de Par, John Mourjopoulos. Sound Source Distance Estimation in Rooms based on Statistical Properties of Binaural Signals
1742 -- 1754Michael Wohlmayr, Franz Pernkopf. Model-Based Multiple Pitch Tracking Using Factorial HMMs: Model Adaptation and Inference
1755 -- 1759Ron M. Hecht, Elad Noor, Gil Dobry, Yaniv Zigel, Aharon Bar-Hillel, Naftali Tishby. Effective Model Representation by Information Bottleneck Principle
1760 -- 1765Luis Weruaga, Leonid Dimitrov. The Spectral Nature of Maximum Likelihood Noise Compensated Linear Prediction

Volume 21, Issue 7

1317 -- 1329Charles Verron, P.-A. Gauthier, Jennifer Langlois, Catherine Guastavino. Spectral and Spatial Multichannel Analysis/Synthesis of Interior Aircraft Sounds
1330 -- 1342Chun-an Chan, Lin-Shan Lee. Model-Based Unsupervised Spoken Term Detection with Spoken Queries
1343 -- 1354Jingdong Chen, Jacob Benesty. On the Time-Domain Widely Linear LCMV Filter for Noise Reduction With a Stereo System
1355 -- 1368Ji Ming, Ramji Srinivasan, Danny Crookes, Ayeh Jafari. CLOSE - A Data-Driven Approach to Speech Separation
1369 -- 1380Masahito Togami, Yohei Kawaguchi, Ryu Takeda, Yasunari Obuchi, Nobuo Nukaga. Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function
1381 -- 1390Yuxuan Wang, DeLiang Wang. Towards Scaling Up Classification-Based Speech Separation
1391 -- 1402Simon Arberet, Pierre Vandergheynst, Rafael E. Carrillo, Jean-Philippe Thiran, Yves Wiaux. Sparse Reverberant Audio Source Separation via Reweighted Analysis
1403 -- 1414Shuhua Zhang, Weibei Dou, Huazhong Yang. MDCT Sinusoidal Analysis for Audio Signals Analysis and Processing
1415 -- 1423Balaji Vasan Srinivasan, Yuancheng Luo, Daniel Garcia-Romero, Dmitry N. Zotkin, Ramani Duraiswami. A Symmetric Kernel Partial Least Squares Framework for Speaker Recognition
1424 -- 1433X. Cai, W. Li. Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization
1434 -- 1444Stanislaw Gorlow, J. D. Reiss. Model-Based Inversion of Dynamic Range Compression
1445 -- 1457Matthew McCallum, Bernard J. Guillemin. Stochastic-Deterministic MMSE STFT Speech Enhancement With General A Priori Information
1458 -- 1468Ali Hassan, Robert I. Damper, Mahesan Niranjan. On Acoustic Emotion Recognition: Compensating for Covariate Shift
1469 -- 1480F. Liu, Y. Liu. Towards Abstractive Speech Summarization: Exploring Unsupervised and Supervised Approaches for Spoken Utterance Compression
1481 -- 1488Vesa Välimäki, Heidi-Maria Lehtonen, Marko Takanen. A Perceptual Study on Velvet Noise and Its Variants at Different Pulse Densities
1489 -- 1501Olivier Derrien, Roland Badeau, Gaël Richard. Parametric Audio Coding With Exponentially Damped Sinusoids
1502 -- 1512Danilo Comminiello, Michele Scarpiniti, Luis Antonio Azpicueta-Ruiz, Jerónimo Arenas-García, Aurelio Uncini. Functional Link Adaptive Filters for Nonlinear Acoustic Echo Cancellation
1513 -- 1523Shmulik Markovich Golan, Sharon Gannot, Israel Cohen. Performance of the SDW-MWF With Randomly Located Microphones in a Reverberant Enclosure
1524 -- 1533Stefan Bilbao. Modeling of Complex Geometries and Boundary Conditions in Finite Difference/Finite Volume Time Domain Room Acoustics Simulation

Volume 21, Issue 6

1123 -- 1133Emanuel A. P. Habets, Jacob Benesty. Multi-Microphone Noise Reduction Based on Orthogonal Noise Signal Decompositions
1134 -- 1144Ping Xu, Pascale Fung. Cross-Lingual Language Modeling for Low-Resource Speech Recognition
1145 -- 1157Dongwen Ying, YongHong Yan. Noise Estimation Using a Constrained Sequential Hidden Markov Model in the Log-Spectral Domain
1158 -- 1169Ciprian Chelba, Peng Xu, Fernando Pereira, Thomas Richardson. Large Scale Distributed Acoustic Modeling With Back-Off ℕ-Grams
1170 -- 1179John Kane, Christer Gobl. Wavelet Maxima Dispersion for Breathy to Tense Voice Discrimination
1180 -- 1189Bin Zhang 0009, Alex Marin, Brian Hutchinson, Mari Ostendorf. Learning Phrase Patterns for Text Classification
1190 -- 1200Ilker Bayram, Mustafa E. Kamasak. A Simple Prior for Audio Signals
1201 -- 1216Lars-Johan Brännmark, Adrian Bahne, Anders Ahlén. Compensation of Loudspeaker-Room Responses in a Robust MIMO Control Framework
1228 -- 1239Ki-Seung Lee. Position-Dependent Crosstalk Cancellation Using Space Partitioning
1240 -- 1250Yusuke Hioka, Ken'ichi Furuya, Kazunori Kobayashi, Kenta Niwa, Yoichi Haneda. Underdetermined Sound Source Separation Using Power Spectrum Density Estimated by Combination of Directivity Gain
1251 -- 1260Benjamin Lecouteux, Georges Linares, Yannick Estève, Guillaume Gravier. Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding
1261 -- 1271Saman Mousazadeh, Israel Cohen. Voice Activity Detection in Presence of Transient Noise Using Spectral Clustering
1272 -- 1284Hung-yi Lee, Lin-Shan Lee. Enhanced Spoken Term Detection Using Support Vector Machines and Weighted Pseudo Examples
1285 -- 1294Tom Ko, Brian Mak. Eigentriphones for Context-Dependent Acoustic Modeling
1295 -- 1307David Rybach, Hermann Ney, Ralf Schlüter. Lexical Prefix Tree and WFST: A Comparison of Two Dynamic Search Concepts for LVCSR

Volume 21, Issue 5

899 -- 906Guangzhao Bao, Zhongfu Ye, Xu Xu, Yingyue Zhou. A Compressed Sensing Approach to Blind Separation of Speech Mixture Based on a Two-Layer Sparsity Model
907 -- 922Shing-Chow Chan, Y. J. Chu, Z. G. Zhang, Kai Man Tsui. A New Variable Regularized QR Decomposition-Based Recursive Least M-Estimate Algorithm - Performance Analysis and Acoustic Applications
923 -- 933Jesper Rindom Jensen, Mads Græsbøll Christensen, Søren Holdt Jensen. Nonlinear Least Squares Methods for Joint DOA and Pitch Estimation
934 -- 944Sandro Cumani, Pietro Laface. Memory and Computation Trade-Offs for Efficient I-Vector Extraction
945 -- 958Emanuel A. P. Habets, Jacob Benesty. A Two-Stage Beamforming Approach for Noise Reduction and Dereverberation
959 -- 970Marco Liuni, Axel Röbel, Ewa Matusiak, Marco Romito, Xavier Rodet. Automatic Adaptation of the Time-Frequency Resolution for Sound Analysis and Re-Synthesis
971 -- 982Hiroshi Sawada, Hirokazu Kameoka, Shoko Araki, Naonori Ueda. Multichannel Extensions of Non-Negative Matrix Factorization With Complex-Valued Data
983 -- 997Robert M. Nickel, Ramón Fernandez Astudillo, Dorothea Kolossa, Rainer Martin. Corpus-Based Speech Enhancement With Uncertainty Modeling and Cepstral Smoothing
998 -- 1011Nasser Mohammadiha, Arne Leijon. Nonnegative HMM for Babble Noise Derived From Speech HMM: Application to Speech Enhancement
1012 -- 1022Wei Rao, Man-Wai Mak. Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning
1023 -- 1034Ramón Fernandez Astudillo, Reinhold Orglmeister. Computing MMSE Estimates and Residual Uncertainty Directly in the Feature Domain of ASR using STFT Domain Speech Distortion Models
1035 -- 1045Petko N. Petkov, Gustav Eje Henter, W. Bastiaan Kleijn. Maximizing Phoneme Recognition Accuracy for Enhanced Speech Intelligibility in Noise
1046 -- 1059Maciej Niedzwiecki, Marcin Ciolek. Elimination of Impulsive Disturbances From Archive Audio Signals Using Bidirectional Processing
1060 -- 1089Li Deng, Xiao Li. Machine Learning Paradigms for Speech Recognition: An Overview
1090 -- 1101Coskun Mermer, Murat Saraclar, Ruhi Sarikaya. Improving Statistical Machine Translation Using Bayesian Word Alignment and Gibbs Sampling
1102 -- 1112Xiaodan Zhu, Colin Cherry, Gerald Penn. A Graph-Partitioning Framework for Aligning Hierarchical Topic Structures to Presentations
1113 -- 1118Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. Binaural Integrated Active Noise Control and Noise Reduction in Hearing Aids

Volume 21, Issue 4

685 -- 696Shoichi Koyama, Ken'ichi Furuya, Yusuke Hiwasaki, Yoichi Haneda. Analytical Approach to Wave Field Reconstruction Filtering in Spatio-Temporal Frequency Domain
697 -- 710Xiao-lei Zhang, Ji Wu. Deep Belief Networks Based Voice Activity Detection
711 -- 724Emmanuel Ravelli, Vinay Melkote, Tejaswi Nanjundaswamy, Kenneth Rose. Joint Optimization of Base and Enhancement Layers in Scalable Audio Coding
725 -- 736Cemil Demir, Murat Saraclar, Ali Taylan Cemgil. Single-Channel Speech-Music Separation for Robust ASR With Mixture Models
737 -- 748Athanasia Zlatintsi, Petros Maragos. Multiscale Fractal Analysis of Musical Instrument Signals With Application to Recognition
749 -- 761Shakeel Ahmed, Muhammad Tahir Akhtar, Xi Zhang. Robust Auxiliary-Noise-Power Scheduling in Active Noise Control Systems With Online Secondary Path Modeling
762 -- 774Yakun Hu, Dapeng Wu, Antonio Nucci. Fuzzy-Clustering-Based Decision Tree Approach for Large Population Speaker Identification
775 -- 785Nicki Holighaus, Monika Dörfler, Gino Angelo M. Velasco, Thomas Grill. A Framework for Invertible, Real-Time Constant-Q Transforms
786 -- 797Sabato Marco Siniscalchi, Torbjørn Svendsen, Chin-Hui Lee. A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition
798 -- 805JiHoon Park, Kwang-Ki Kim, Minsoo Hahn. Vocal Removal From Multiobject Audio Using Harmonic Information for Karaoke Service
806 -- 815John Woodruff, DeLiang Wang. Binaural Detection, Localization, and Segregation in Reverberant Environments Based on Joint Pitch and Azimuth Cues
816 -- 827Carlos Vaquero, Alfonso Ortega, Antonio Miguel, Eduardo Lleida. Quality Assessment for Speaker Diarization and Its Application in Speaker Characterization
828 -- 841Alireza Masnadi-Shirazi, Bhaskar D. Rao. An ICA-SCT-PHD Filter Approach for Tracking and Separation of Unknown Time-Varying Number of Sources
842 -- 853Taufiq Hasan, John H. L. Hansen. Acoustic Factor Analysis for Robust Speaker Verification
854 -- 867Gayadhar Pradhan, S. R. Mahadeva Prasanna. Speaker Verification by Vowel and Nonvowel Like Segmentation
868 -- 878Shing-Chow Chan, Y. J. Chu, Z. G. Zhang. A New Variable Regularized Transform Domain NLMS Adaptive Filtering Algorithm - Acoustic Applications and Performance Analysis
879 -- 888Nicolas Ellaham, Christian Giguere, Wail Gueaieb. Evaluation of the Phase-Inversion Signal Separation Method When Using Nonlinear Hearing Aids

Volume 21, Issue 3

463 -- 475Hongsen He, Lifu Wu, Jing Lu, Xiaojun Qiu, Jingdong Chen. Time Difference of Arrival Estimation Exploiting Multichannel Spatio-Temporal Prediction
476 -- 487Shan Liang, Wenju Liu, Wei Jiang. A New Bayesian Method Incorporating With Local Correlation for IBM Estimation
488 -- 497Stephan Tassart. Band-Limited Impulse Train Generation Using Sampled Infinite Impulse Responses of Analog Filters
498 -- 507Xin Chen, Yunxin Zhao. Building Acoustic Model Ensembles by Data Sampling With Enhanced Trainings and Features
508 -- 519Simone Spagnol, Michele Geronazzo, Federico Avanzini. On the Relation Between Pinna Reflection Patterns and Head-Related Transfer Function Features
520 -- 530Vipul Arora, Laxmidhar Behera. On-Line Melody Extraction From Polyphonic Audio Using Harmonic Cluster Tracking
531 -- 543Meinard Müller, Nanzhu Jiang, Peter Grosche. A Robust Fitness Measure for Capturing Repetitions in Music Recordings With Applications to Audio Thumbnailing
544 -- 555Shi-Xiong Zhang, Mark J. F. Gales. Structured SVMs for Automatic Speech Recognition
556 -- 566Daniel Erro, Eva Navas, Inma Hernáez. Parametric Voice Conversion Based on Bilinear Frequency Warping Plus Amplitude Scaling
567 -- 578Shuhua Zhang, Laurent Girin. Fast and Accurate Direct MDCT to DFT Conversion With Arbitrary Window Functions
579 -- 586Ladan Baghai-Ravary. The Inherent Temporal Precision of Phoneme Transitions
587 -- 597Matt Shannon, Heiga Zen, William Byrne. Autoregressive Models for Statistical Parametric Speech Synthesis
598 -- 610Jesper Kjær Nielsen, Mads Græsbøll Christensen, Søren Holdt Jensen. Default Bayesian Estimation of the Fundamental Frequency
611 -- 623Ziqiang Shi, Jiqing Han, Tieran Zheng, Ji Li. Identification of Objectionable Audio Segments Based on Pseudo and Heterogeneous Mixture Models
624 -- 635José A. González, Antonio M. Peinado, Ning Ma, Angel M. Gomez, Jon Barker. MMSE-Based Missing-Feature Reconstruction With Temporal Modeling for Robust Speech Recognition
636 -- 648Bassam Jabaian, Laurent Besacier, Fabrice Lefevre. Comparison and Combination of Lightly Supervised Approaches for Language Portability of a Spoken Language Understanding System
649 -- 658Renxian Zhang, Wenjie Li, Dehong Gao, Ouyang You. Automatic Twitter Topic Summarization With Speech Acts
659 -- 668Weibin Zhang, Pascale Fung. Sparse Inverse Covariance Matrices for Low Resource Speech Recognition
669 -- 674Jonathan Botts, José Escolano, Ning Xiang. Design of IIR Filters With Bayesian Model Selection and Parameter Estimation
675 -- 680Kais Khaldi, Abdel-Ouahab Boudraa. Audio Watermarking Via EMD

Volume 21, Issue 2

223 -- 233Alexandre Trilla, Francesc Alías. Sentence-Based Sentiment Analysis for Expressive Text-to-Speech
234 -- 246Paolo Annibale, Jason Filos, Patrick A. Naylor, Rudolf Rabenstein. TDOA-Based Speed of Sound Estimation for Air Temperature and Room Geometry Inference
247 -- 259Jung-Woo Choi, Yang-Hann Kim. Sound Field Reproduction of a Virtual Source Inside a Loudspeaker Array With Minimal External Radiation
260 -- 269Wei Wu, Mari Ostendorf. Graph-Based Query Strategies for Active Learning
270 -- 279Yuxuan Wang, Kun Han, DeLiang Wang. Exploring Monaural Features for Classification-Based Speech Segregation
280 -- 290Yao Qian, Frank K. Soong, Zhi-Jie Yan. A Unified Trajectory Tiling Approach to High Quality Speech Rendering
291 -- 300Erinç Dikici, Murat Semerci, Murat Saraclar, Ethem Alpaydin. Classification and Ranking Approaches to Discriminative Language Modeling for ASR
301 -- 312Boyan Huang, Yegui Xiao, Jinwei Sun, Guo Wei. A Variable Step-Size FXLMS Algorithm for Narrowband Active Noise Control
313 -- 321Stefano D'Angelo, Jyri Pakarinen, Vesa Välimäki. New Family of Wave-Digital Triode Models
322 -- 335Saeed Mosayyebpour, Morteza Esmaeili, T. Aaron Gulliver. Single-Microphone Early and Late Reverberation Suppression in Noisy Speech
336 -- 342Chengshi Zheng, Hao Liu, Renhua Peng, Xiaodong Li. A Statistical Analysis of Two-Channel Post-Filter Estimators in Isotropic Noise Fields
343 -- 356Shmulik Markovich Golan, Sharon Gannot, Israel Cohen. Distributed Multiple Constraints Generalized Sidelobe Canceler for Fully Connected Wireless Acoustic Sensor Networks
357 -- 366Ian McGraw, Ibrahim Badr, James R. Glass. Learning Lexicons From Speech Using a Pronunciation Mixture Model
367 -- 377Jonathan Dennis, Tran Huy Dat, Engsiong Chng. Image Feature Representation of the Subband Power Distribution for Robust Sound Event Classification
378 -- 387Nasim Radmanesh, Ian S. Burnett. Generation of Isolated Wideband Sound Fields Using a Combined Two-stage Lasso-LS Algorithm
388 -- 396Dong Yu, Li Deng, Frank Seide. The Deep Tensor Neural Network With Applications to Large Vocabulary Speech Recognition
397 -- 406Manas A. Pathak, Bhiksha Raj. Privacy-Preserving Speaker Verification and Identification Using Gaussian Mixture Models
407 -- 415Abigail A. Kressner, David V. Anderson, Christopher J. Rozell. Evaluating the Generalization of the Hearing Aid Speech Quality Index (HASQI)
416 -- 426Sridhar Krishna Nemala, Kailash Patil, Mounya Elhilali. A Multistream Feature Framework Based on Bandpass Modulation Filtering for Robust Speech Recognition
427 -- 438Chiong-Ching Lai, Sven Nordholm, Yee Hong Leung. Design of Steerable Spherical Broadband Beamformers With Flexible Sensor Configurations
439 -- 443Antonio Canclini, Fabio Antonacci, Augusto Sarti, Stefano Tubaro. Acoustic Source Localization With Distributed Asynchronous Microphone Networks
444 -- 448Stefano Gaiotto. A Tuning-Less Approach in Secondary Path Modeling in Active Noise Control Systems
449 -- 455Matt Speed, Damian T. Murphy, David M. Howard. Three-Dimensional Digital Waveguide Mesh Simulation of Cylindrical Vocal Tract Analogs

Volume 21, Issue 12

2471 -- 2480A. P. Prathosh, T. V. Ananthapadmanabha, A. G. Ramakrishnan. Epoch Extraction Based on Integrated Linear Prediction Residual Using Plosion Index
2481 -- 2492Tomas Dekens, Werner Verhelst. Body Conducted Speech Enhancement by Equalization and Signal Fusion
2493 -- 2505Dejan Markovic, Fabio Antonacci, Augusto Sarti, Stefano Tubaro. Soundfield Imaging in the Ray Space
2506 -- 2515Partha Lal, Simon King. Cross-Lingual Automatic Speech Recognition Using Tandem Features
2516 -- 2531Tomohiro Nakatani, Shoko Araki, Takuya Yoshioka, Marc Delcroix, Masakiyo Fujimoto. Dominance Based Integration of Spatial and Spectral Features for Speech Enhancement
2532 -- 2540Yotam Peled, Boaz Rafaely. Linearly-Constrained Minimum-Variance Method for Spherical Microphone Arrays Based on Plane-Wave Decomposition of the Sound Field
2541 -- 2553Jean-Louis Durrieu, Jean-Philippe Thiran. Source/Filter Factorial Hidden Markov Model, With Application to Pitch and Formant Tracking
2554 -- 2569Katherine Ellis, Emanuele Coviello, Antoni B. Chan, Gert R. G. Lanckriet. A Bag of Systems Representation for Music Auto-Tagging
2570 -- 2582Aroor Dinesh Dileep, C. Chandra Sekhar. HMM Based Intermediate Matching Kernel for Classification of Sequential Patterns of Speech Using Support Vector Machines
2583 -- 2594Oliver Thiergart, Giovanni Del Galdo, Maja Taseska, Emanuel A. P. Habets. Geometry-Based Spatial Sound Acquisition Using Distributed Microphone Arrays
2595 -- 2606Jesper Rindom Jensen, Jacob Benesty, Mads Græsbøll Christensen, Jingdong Chen. A Class of Optimal Rectangular Filtering Matrices for Single-Channel Signal Enhancement in the Time Domain
2607 -- 2615Yizhao Ni, Matt McVicar, Raúl Santos-Rodríguez, Tijl De Bie. Understanding Effects of Subjectivity in Measuring Chord Estimation Accuracy
2616 -- 2626Georg Heigold, Hermann Ney, Ralf Schlüter. Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs
2627 -- 2637Bruno Defraene, Naim Mansour, Steven De Hertogh, Toon van Waterschoot, Moritz Diehl, Marc Moonen. Declipping of Audio Signals Using Perceptual Compressed Sensing

Volume 21, Issue 11

2231 -- 2243Stephen J. Wright, Dimitri Kanevsky, Li Deng, Xiaodong He, Georg Heigold, Haizhou Li. Optimization Algorithms and Applications for Speech and Language Processing
2244 -- 2254Gillian M. Chin, Jorge Nocedal, Peder A. Olsen, Steven J. Rennie. Second Order Methods for Optimizing Convex Matrix Functions and Sparse Covariance Clustering
2255 -- 2266Theodoros Tsiligkaridis, Etienne Marcheret, Vaibhava Goel. A Difference of Convex Functions Approach to Large-Scale Log-Linear Model Estimation
2267 -- 2276Tara N. Sainath, Brian Kingsbury, Hagen Soltau, Bhuvana Ramabhadran. Optimization Techniques to Improve Training Speed of Deep Neural Networks for Large Speech Tasks
2277 -- 2289Tuomas Virtanen, Jort Florent Gemmeke, Bhiksha Raj. Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio
2290 -- 2300Patrick Cardinal, Pierre Dumouchel, Gilles Boulianne. Large Vocabulary Speech Recognition on Parallel Architectures
2301 -- 2312Rémi Mignot, Laurent Daudet, François Ollivier. Room Reverberation Reconstruction: Interpolation of the Early Part Using Compressed Sensing
2313 -- 2323Min Zhang 0005, Wenliang Chen, Xiangyu Duan, Rong Zhang. Improving Graph-Based Dependency Parsing Models With Dependency Language Models
2324 -- 2336Florian Pflug, Tim Fingscheidt. Robust Ultra-Low Latency Soft-Decision Decoding of Linear PCM Audio
2337 -- 2345Koji Seto, Tokunbo Ogunfunmi. Scalable Speech Coding for IP Networks: Beyond iLBC
2346 -- 2355Kenta Niwa, Yusuke Hioka, Ken'ichi Furuya, Yoichi Haneda. Diffused Sensing for Sharp Directive Beamforming
2356 -- 2367Symeon Delikaris-Manias, Ville Pulkki. Cross Pattern Coherence Algorithm for Spatial Filtering Applications Utilizing Microphone Arrays
2368 -- 2378Bai Ying Lei, Ing Yann Soon, Ee-Leng Tan. Robust SVD-Based Audio Watermarking Scheme With Differential Evolution Optimization
2379 -- 2392Nikos Malandrakis, Alexandros Potamianos, Elias Iosif, Shrikanth S. Narayanan. Distributional Semantic Models for Affective Text Analysis
2393 -- 2402Pasi Pertilä, Matti S. Hämäläinen, Mikael Mieskolainen. Passive Temporal Offset Estimation of Multichannel Recordings of an Ad-Hoc Microphone Array
2403 -- 2411Liang Vincent Wang, Woon-Seng Gan, Andy W. H. Khong, Sen M. Kuo. Convergence Analysis of Narrowband Feedback Active Noise Control System With Imperfect Secondary Path Estimation
2412 -- 2424Chi-Man Pun, Xiaochen Yuan. Robust Segments Detector for De-Synchronization Resilient Audio Watermarking
2425 -- 2438Miranti Indar Mandasari, Rahim Saeidi, Mitchell McLaren, David A. van Leeuwen. Quality Measure Functions for Calibration of Speaker Recognition Systems in Various Duration Conditions
2439 -- 2450Fabian Triefenbach, Azarakhsh Jalalvand, Kris Demuynck, Jean-Pierre Martens. Acoustic Modeling With Hierarchical Reservoirs
2451 -- 2464Donghyeon Lee, Minwoo Jeong, Kyungduk Kim, Seonghan Ryu, Gary Geunbae Lee. Unsupervised Spoken Language Understanding for a Multi-Domain Dialog System

Volume 21, Issue 10

1993 -- 2005William Hartmann, Arun Narayanan, Eric Fosler-Lussier, DeLiang Wang. A Direct Masking Approach to Robust ASR
2006 -- 2014Yow-Bang Wang, Shang-wen Li, Lin-Shan Lee. An Experimental Analysis on Integrating Multi-Stream Spectro-Temporal, Cepstral and Pitch Information for Mandarin Speech Recognition
2015 -- 2028Stephen Shum, Najim Dehak, Réda Dehak, James R. Glass. Unsupervised Methods for Speaker Diarization: An Integrated and Iterative Approach
2029 -- 2041Zbynek Koldovský, Jirí Málek, Petr Tichavský, Francesco Nesta. Semi-Blind Noise Extraction Using Partially Known Position of the Target Source
2042 -- 2056Mads Graesboll Christensen. Accurate Estimation of Low Fundamental Frequencies From Real-Valued Measurements
2057 -- 2072Philippe Esling, Carlos Agon. Multiobjective Time Series Matching for Audio Classification and Retrieval
2073 -- 2084Chao Zhang, Yi Liu, Yunqing Xia, Xuan Wang, Chin-Hui Lee. Reliable Accent-Specific Unit Generation With Discriminative Dynamic Gaussian Mixture Selection for Multi-Accent Chinese Speech Recognition
2085 -- 2095Gilles Degottex, Yannis Stylianou. Analysis and Synthesis of Speech Using an Adaptive Full-Band Harmonic Model
2096 -- 2107Bilei Zhu, Wei Li, Ruijiang Li, Xiangyang Xue. Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation
2108 -- 2117Sadao Hiroya. Non-Negative Temporal Decomposition of Speech Parameters by Multiplicative Update Rules
2118 -- 2128Cyril Joder, Slim Essid, Gaël Richard. Learning Optimal Features for Polyphonic Audio-to-Score Alignment
2129 -- 2139Zhen-Hua Ling, Li Deng, Dong Yu. Modeling Spectral Envelopes Using Restricted Boltzmann Machines and Deep Belief Networks for Statistical Parametric Speech Synthesis
2140 -- 2151Nasser Mohammadiha, Paris Smaragdis, Arne Leijon. Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix Factorization
2152 -- 2161Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee. Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems
2162 -- 2171Nikolay D. Gaubitch, Mike Brookes, Patrick A. Naylor. Blind Channel Magnitude Response Estimation in Speech Using Spectrum Classification
2172 -- 2181Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe, Nobuaki Minematsu, Keikichi Hirose. Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting
2182 -- 2192Takuya Yoshioka, Tomohiro Nakatani. Noise Model Transfer: Novel Approach to Robustness Against Nonstationary Noise
2193 -- 2206Despoina Pavlidi, Anthony Griffin, Matthieu Puigt, Athanasios Mouchtaris. Real-Time Multiple Sound Source Localization and Counting Using a Circular Microphone Array
2207 -- 2220Sefki Kolozali, Mathieu Barthet, György Fazekas, Mark Sandler. Automatic Ontology Generation for Musical Instruments Based on Audio Analysis

Volume 21, Issue 1

1 -- 11Stanislaw Gorlow, Sylvain Marchand. Informed Audio Source Separation Using Linearly Constrained Spatial Filters
12 -- 26Florin Ghido, Ioan Tabus. Sparse Modeling for Lossless Audio Compression
27 -- 36Xiguang Zheng, Christian Ritz, Jiangtao Xi. Encoding Navigable Speech Sources: A Psychoacoustic-Based Analysis-by-Synthesis Approach
37 -- 48Jwu-Sheng Hu, Ming-Tang Lee, Chia-Hsing Yang. ∞ Filter
49 -- 60Yi-Chin Huang, Chung-Hsien Wu, Yu-Ting Chao. Personalized Spectral and Prosody Conversion Using Frame-Based Codeword Distribution and Adaptive CRF
61 -- 70Nilesh Madhu, Ann Spriet, Sofie Jansen, Raphael Koning, Jan Wouters. The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses
71 -- 82Zafar Rafii, Bryan Pardo. REpeating Pattern Extraction Technique (REPET): A Simple Method for Music/Voice Separation
83 -- 96Sree Hari Krishnan Parthasarathi, Hervé Bourlard, Daniel Gatica-Perez. Wordless Sounds: Robust Speaker Diarization Using Privacy-Preserving Audio Representations
97 -- 107Feng Huang, Tan Lee. Pitch Estimation in Noisy Speech Using Accumulated Peak Spectrum and Sparse Estimation Technique
108 -- 119Néstor Becerra Yoma, Claudio Garretón, Fernando Huenupán, Ignacio Catalan, Jorge Wuth Sepúlveda. On Reducing Harmonic and Sampling Distortion in Vocal Tract Length Normalization
120 -- 129Ke Hu, DeLiang Wang. An Unsupervised Approach to Cochannel Speech Separation
130 -- 142Ronen Talmon, Israel Cohen, Sharon Gannot. Single-Channel Transient Interference Suppression With Diffusion Maps
143 -- 153Nima Yousefian, Philipos C. Loizou. A Dual-Microphone Algorithm That Can Cope With Competing-Talker Scenarios
154 -- 165Miguel Ferrer, Alberto González, Maria de Diego, Gema Pinero. Convex Combination Filtered-X Algorithms for Active Noise Control Systems
166 -- 175Kun Han, DeLiang Wang. Towards Generalizing Classification Based Speech Separation
176 -- 183Nicolas Sturmel, Laurent Daudet. Informed Source Separation Using Iterative Reconstruction
184 -- 194Ziqiang Shi, Jiqing Han, Tieran Zheng, Shiwen Deng. Audio Segment Classification Using Online Learning Based Tensor Representation Feature Discrimination
195 -- 204Hai-son Le, Ilya Oparin, Alexandre Allauzen, Jean-Luc Gauvain, François Yvon. Structured Output Layer Neural Network Language Models for Speech Recognition
205 -- 217Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi. Articulatory Control of HMM-Based Parametric Speech Synthesis Using Feature-Space-Switched Multiple Regression