Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 24, Issue 9

1481 -- 1494Daniel Cruz Cavalieri, Sira E. Palazuelos-Cagigas, Teodiano Freire Bastos Filho, Mário Sarcinelli Filho. Combination of Language Models for Word Prediction: An Exponential Approach
1495 -- 1510Ofer Schwartz, Sharon Gannot, Emanuel A. P. Habets. An Expectation-Maximization Algorithm for Multimicrophone Speech Dereverberation and Noise Reduction With Coherence Matrix Estimation
1511 -- 1523Symeon Delikaris-Manias, Juha Vilkamo, Ville Pulkki. Signal-Dependent Spatial Filtering Based on Weighted-Orthogonal Beamformers in the Spherical Harmonic Domain
1524 -- 1534Sheng Li, Yuya Akita, Tatsuya Kawahara. Semi-Supervised Acoustic Model Training by Discriminative Data Selection From Multiple ASR Systems' Hypotheses
1535 -- 1547Christian Dittmar, Meinard Müller. Reverse Engineering the Amen Break - Score-Informed Separation and Restoration Applied to Drum Recordings
1548 -- 1559Chao Pan, Jingdong Chen, Jacob Benesty. Reduced-Order Robust Superdirective Beamforming With Uniform Linear Microphone Arrays
1560 -- 1572Derry Fitzgerald, Antoine Liutkus, Roland Badeau. Projection-Based Demixing of Spatial Audio
1573 -- 1588Lin Wang, Joshua D. Reiss, Andrea Cavallaro. Over-Determined Source Separation and Localization Using Distributed Microphones
1589 -- 1598Yang Liu, Sujian Li, Furu Wei, Heng Ji. Relation Classification Via Modeling Augmented Dependency Paths
1599 -- 1612Adam Kuklasinski, Simon Doclo, Søren Holdt Jensen, Jesper Jensen. Maximum Likelihood PSD Estimation for Speech Enhancement in Reverberation and Noise
1613 -- 1625Sam Karimian-Azari, Jesper Rindom Jensen, Mads Græsbøll Christensen. Computationally Efficient and Noise Robust DOA and Pitch Estimation
1626 -- 1641Daichi Kitamura, Nobutaka Ono, Hiroshi Sawada, Hirokazu Kameoka, Hiroshi Saruwatari. Determined Blind Source Separation Unifying Independent Vector Analysis and Nonnegative Matrix Factorization
1642 -- 1651Nicolas Obin, Axel Roebel. Similarity Search of Acted Voices for Automatic Voice Casting
1652 -- 1664Aditya Arie Nugraha, Antoine Liutkus, Emmanuel Vincent. Multichannel Audio Source Separation With Deep Neural Networks
1665 -- 1676Stephen H. Shum, David F. Harwath, Najim Dehak, James R. Glass. On the Use of Acoustic Unit Discovery for Language Recognition

Volume 24, Issue 8

1334 -- 1347Henning F. Schepker, Simon Doclo. Least-Squares Estimation of the Common Pole-Zero Filter of Acoustic Feedback Paths in Hearing Aids
1348 -- 1363Hannes Pessentheiner, Martin Hagmüller, Gernot Kubin. Localization and Characterization of Multiple Harmonic Sources
1364 -- 1379Hanieh Khalilian, Ivan V. Bajic, Rodney G. Vaughan. Comparison of Loudspeaker Placement Methods for Sound Field Reproduction
1380 -- 1392Cheng-Yen Yang, Chih-Wei Liu, Shyh-Jye Jou. A Systematic ANSI S1.11 Filter Bank Specification Relaxation and Its Efficient Multirate Architecture for Hearing-Aid Systems
1393 -- 1407Bracha Laufer-Goldshtein, Ronen Talmon, Sharon Gannot. Semi-Supervised Sound Source Localization Based on Manifold Regularization
1408 -- 1423Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud. A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures
1424 -- 1437Jun Du, Yanhui Tu, Li-Rong Dai, Chin-Hui Lee. A Regression Approach to Single-Channel Speech Separation Via High-Resolution Deep Neural Networks
1438 -- 1449Xunying Liu, Xie Chen, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland. Two Efficient Lattice Rescoring Methods Using Recurrent Neural Network Language Models
1450 -- 1463Pawel Swietojanski, Jinyu Li, Steve Renals. Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation
1464 -- 1472Meng Zhang, Yang Liu, Huanbo Luan, Maosong Sun. Listwise Ranking Functions for Statistical Machine Translation

Volume 24, Issue 7

1164 -- 1174Min Gao, Jing Lu, Xiaojun Qiu. A Simplified Subband ANC Algorithm Without Secondary Path Modeling
1175 -- 1184Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki. Multiple Non-Negative Matrix Factorization for Many-to-Many Voice Conversion
1185 -- 1193Kai Chen, Qiang Huo. Training Deep Bidirectional LSTM Acoustic Model for LVCSR by a Context-Sensitive-Chunk BPTT Approach
1194 -- 1203Themos Stafylakis, Md. Jahangir Alam, Patrick Kenny. Text-Dependent Speaker Recognition With Random Digit Strings
1204 -- 1218K. T. Deepak, S. R. Mahadeva Prasanna. Foreground Speech Segmentation and Enhancement Using Glottal Closure Instants and Mel Cepstral Coefficients
1219 -- 1229Habib Hajimolahoseini, Rassoul Amirfattahi, Saeed Gazor, Hamid Soltanian-Zadeh. Robust Estimation and Tracking of Pitch Period Using an Efficient Bayesian Filter
1230 -- 1241Subhasmita Sahoo, Aurobinda Routray. A Novel Method of Glottal Inverse Filtering
1242 -- 1254Gilles Degottex, Luc Ardaillon, Axel Roebel. Multi-Frame Amplitude Envelope Estimation for Modification of Singing Voice
1255 -- 1265Zhizheng Wu, Simon King. Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training
1266 -- 1279Xabier Jaureguiberry, Emmanuel Vincent, Gaël Richard. Fusion Methods for Speech Enhancement and Audio Source Separation
1280 -- 1290Rajib Lochan Das, Mrityunjoy Chakraborty. 1 Norm Regularization
1291 -- 1304Maja Taseska, Emanuel A. P. Habets. Spotforming: Spatial Filtering With Distributed Arrays for Position-Selective Sound Acquisition
1305 -- 1314Guangyou Zhou, Zhiwen Xie, Tingting He, Jun Zhao 0001, Xiaohua Tony Hu. Learning the Multilingual Translation Representations for Question Retrieval in Community Question Answering via Non-Negative Matrix Factorization
1315 -- 1329Chanwoo Kim, Richard M. Stern. Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition

Volume 24, Issue 6

994 -- 1005Asli Çelikyilmaz, Ruhi Sarikaya, Minwoo Jeong, Anoop Deoras. An Empirical Investigation of Word Class-Based Features for Natural Language Understanding
1006 -- 1019Duc Hoang Ha Nguyen, Xiong Xiao, Eng Siong Chng, Haizhou Li. Feature Adaptation Using Linear Spectro-Temporal Transform for Robust Speech Recognition
1020 -- 1028Xiaojun Qian, Helen M. Meng, Frank K. Soong. A Two-Pass Framework of Mispronunciation Detection and Diagnosis for Computer-Aided Pronunciation Training
1029 -- 1037Lijiang Chen, Xia Mao, Hong Yan. Text-Independent Phoneme Segmentation Combining EGG and Speech Data
1038 -- 1051Vincent Mohammad Tavakoli, Jesper Rindom Jensen, Mads Græsbøll Christensen, Jacob Benesty. A Framework for Speech Enhancement With Ad Hoc Microphone Arrays
1052 -- 1065Yan-You Chen, Chung-Hsien Wu, Yi-Chin Huang, Shih-Lun Lin, Jhing-Fa Wang. Candidate Expansion and Prosody Adjustment for Natural Speech Synthesis Using a Small Corpus
1066 -- 1078Xueliang Zhang, Hui Zhang, Shuai Nie, Guanglai Gao, Wenju Liu. A Pairwise Algorithm Using the Deep Stacking Network for Speech Separation and Pitch Estimation
1079 -- 1093Lin Wang, Tsz-Kin Hon, Joshua D. Reiss, Andrea Cavallaro. An Iterative Approach to Source Counting and Localization Using Two Distant Microphones
1094 -- 1105Sean O'Leary, Axel Röbel. A Montage Approach to Sound Texture Synthesis
1106 -- 1118Chahid Ouali, Pierre Dumouchel, Vishwa Gupta. Fast Audio Fingerprinting System Using GPU and a Clustering-Based Technique
1119 -- 1128Francisco Raposo, Ricardo Ribeiro 0001, David Martins de Matos. Using Generic Summarization to Improve Music Information Retrieval Tasks
1129 -- 1139Lantian Li, Dong Wang, Chenhao Zhang, Thomas Fang Zheng. Improving Short Utterance Speaker Recognition by Modeling Speech Unit Classes
1140 -- 1154Jalal Taghia, Rainer Martin. A Frequency-Domain Adaptive Line Enhancer With Step-Size Control Based on Mutual Information for Harmonic Noise Reduction

Volume 24, Issue 5

833 -- 845T. J. Tsai, Andreas Stolcke. Robust and Efficient Multiple Alignment of Unsynchronized Meeting Recordings
846 -- 862Simon Receveur, Robin Weib, Tim Fingscheidt. Turbo Automatic Speech Recognition
863 -- 874Ricard Marxer, Hendrik Purwins. Unsupervised Incremental Online Learning and Prediction of Musical Audio Signals
875 -- 889Mohammad Adeli, Jean Rouat, Sean Wood, Stephane Molotchnikoff, Eric Plourde. A Flexible Bio-Inspired Hierarchical Model for Analyzing Musical Timbre
890 -- 900Geliang Zhang, Simon J. Godsill. Fundamental Frequency Estimation in Speech Signals With Variable Rate Particle Filters
901 -- 913Nadine Kroher, Emilia Gómez. Automatic Transcription of Flamenco Singing From Polyphonic Music Recordings
914 -- 926Fiete Winter, Jens Ahrens, Sascha Spors. On Analytic Methods for 2.5-D Local Sound Field Synthesis Using Circular Distributions of Secondary Sources
927 -- 939Siddharth Sigtia, Emmanouil Benetos, Simon Dixon. An End-to-End Neural Network for Polyphonic Piano Music Transcription
940 -- 951Martin Krawczyk-Becker, Timo Gerkmann. Fundamental Frequency Informed Speech Enhancement in a Flexible Statistical Framework
952 -- 966Joseph Szurley, Alexander Bertrand, Bas van Dijk, Marc Moonen. Binaural Noise Cue Preservation in a Binaural Noise Reduction System With a Remote Microphone Signal
967 -- 977Xiao-lei Zhang, DeLiang Wang. A Deep Ensemble Learning Method for Monaural Speech Separation
978 -- 989Haotian Xu, Haotian Ou. Scalable Discovery of Audio Fingerprint Motifs in Broadcast Streams With Determinantal Point Process Based Motif Clustering

Volume 24, Issue 4

612 -- 622Peifeng Li, Guodong Zhou. Joint Argument Inference in Chinese Event Extraction with Argument Consistency and Event Relevance
623 -- 630Jianming Liu, Steven L. Grant. Proportionate Adaptive Filtering for Block-Sparse System Identification
631 -- 644Jesper Rindom Jensen, Jacob Benesty, Mads Græsbøll Christensen. Noise Reduction with Optimal Variable Span Linear Filters
645 -- 658Sidsel Marie Nørholm, Jesper Rindom Jensen, Mads Græsbøll Christensen. Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech
659 -- 668Daryush D. Mehta, Jarrad H. Van Stan, Robert E. Hillman. Relationships Between Vocal Function Measures Derived from an Acoustic Microphone and a Subglottal Neck-Surface Accelerometer
669 -- 679Herman Kamper, Aren Jansen, Sharon Goldwater. Unsupervised Word Segmentation and Lexicon Discovery Using Acoustic Word Embeddings
680 -- 693Ina Kodrasi, Simon Doclo. Joint Dereverberation and Noise Reduction Based on Acoustic Multi-Channel Equalization
694 -- 707Hamid Palangi, Li Deng, Yelong Shen, Jianfeng Gao, Xiaodong He, Jianshu Chen, Xinying Song, Rabab K. Ward. Deep Sentence Embedding Using Long Short-Term Memory Networks: Analysis and Application to Information Retrieval
708 -- 718Michael Jeffet, Noam R. Shabtai, Boaz Rafaely. Theory and Perceptual Evaluation of the Binaural Reproduction and Beamforming Tradeoff in the Generalized Spherical Array Beamformer
719 -- 732Pablo Peso Parada, Dushyant Sharma, Jose Lainez, Daniel Barreda, Toon van Waterschoot, Patrick A. Naylor. A Single-Channel Non-Intrusive C50 Estimator Correlated With Speech Recognition Performance
733 -- 744Ming-Hsiang Su, Chung-Hsien Wu, Yu-Ting Zheng. Exploiting Turn-Taking Temporal Evolution for Personality Trait Perception in Dyadic Conversations
745 -- 754Sadaf Abdul-Rauf, Holger Schwenk, Patrik Lambert, Mohammad Nawaz. Empirical Use of Information Retrieval to Build Synthetic Data for SMT Domain Adaptation
755 -- 767Shinnosuke Takamichi, Tomoki Toda, Alan W. Black, Graham Neubig, Sakriani Sakti, Satoshi Nakamura. Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis
768 -- 783Zhizheng Wu, Phillip L. De Leon, Cenk Demiroglu, Ali Khodabakhsh, Simon King, Zhen-Hua Ling, Daisuke Saito, Bryan Stewart, Tomoki Toda, Mirjam Wester, Junichi Yamagishi. Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance
784 -- 795Kristian Timm Andersen, Marc Moonen. Adaptive Time-Frequency Analysis for Noise Reduction in an Audio Filter Bank With Low Delay
796 -- 806Zhong-qiu Wang, DeLiang Wang. A Joint Training Framework for Robust Automatic Speech Recognition
807 -- 822Huy Phan, Lars Hertel, Marco Maaß, Radoslaw Mazur, Alfred Mertins. Learning Representations for Nonspeech Audio Events Through Their Similarities to Speech Patterns

Volume 24, Issue 3

409 -- 421Reinhard Sonnleitner, Gerhard Widmer. Robust Quad-Based Audio Fingerprinting
422 -- 431Li Dong, Furu Wei, Ke Xu, Shixia Liu, Ming Zhou. Adaptive Multi-Compositionality for Recursive Neural Network Models
432 -- 444Zheng Lin, Xiaolong Jin, Xueke Xu, Yuanzhuo Wang, Xueqi Cheng, Weiping Wang, Dan Meng. An Unsupervised Cross-Lingual Topic Model Framework for Sentiment Classification
445 -- 458Anil M. Nagathil, Claus Weihs, Rainer Martin. Spectral Complexity Reduction of Music Signals for Mitigating Effects of Cochlear Hearing Loss
459 -- 468Tian Tan, Yanmin Qian, Kai Yu. Cluster Adaptive Training for Deep Neural Network Based Acoustic Model
469 -- 482Leijon Leijon, Gustav Eje Henter, Martin Dahlquist. Bayesian Analysis of Phoneme Confusion Matrices
483 -- 492Donald S. Williamson, Yuxuan Wang, DeLiang Wang. Complex Ratio Masking for Monaural Speech Separation
493 -- 503Johannes Traa, David Wingate, Noah D. Stein, Paris Smaragdis. Robust Source Localization and Enhancement With a Probabilistic Steered Response Power Model
504 -- 517Sven Ewan Shepstone, Kong-Aik Lee, Haizhou Li, Zheng-Hua Tan, Søren Holdt Jensen. Total Variability Modeling Using Source-Specific Priors
518 -- 529Martin Schneider, Walter Kellermann. Multichannel Acoustic Echo Cancellation in the Wave Domain With Increased Robustness to Nonuniqueness
530 -- 542Ken O'Hanlon, Hidehisa Nagano, Nicolas Keriven, Mark D. Plumbley. Non-Negative Group Sparsity with Subspace Note Modelling for Polyphonic Transcription
543 -- 558Elior Hadad, Simon Doclo, Sharon Gannot. The Binaural LCMV Beamformer and its Performance Analysis
559 -- 570Felipe Grijalva, Luiz Martini, Dinei Florêncio, Siome Goldenstein. A Manifold Learning Approach for Personalizing HRTFs from Anthropometric Features
571 -- 582Lin Wang, Simon Doclo. Correlation Maximization-Based Sampling Rate Offset Estimation for Distributed Microphone Arrays
583 -- 593Nasim Radmanesh, Ian S. Burnett, Bhaskar D. Rao. A Lasso-LS Optimization with a Frequency Variable Dictionary in a Multizone Sound System
594 -- 607Xin Liu, Changchun Bao. Audio Bandwidth Extension Based on Ensemble Echo State Networks with Temporal Evolution

Volume 24, Issue 2

215 -- 225Eugen Rasumow, Martin Hansen, Steven van de Par, Dirk Puschel, Volker Mellert, Simon Doclo, Matthias Blau. Regularization Approaches for Synthesizing HRTF Directivity Patterns
226 -- 235Chao Pan, Jacob Benesty, Jingdong Chen. Design of Directivity Patterns with a Unique Null of Maximum Multiplicity
236 -- 251Jeih-Weih Hung, Hsin-Ju Hsieh, Berlin Chen. Robust Speech Recognition via Enhancing the Complex-Valued Acoustic Spectrum in Modulation Domain
252 -- 264Xiao-lei Zhang, DeLiang Wang. Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection
265 -- 275M. A. Tugtekin Turan, Engin Erzin. Source and Filter Estimation for Throat-Microphone Speech Enhancement
276 -- 289Nasser Mohammadiha, Simon Doclo. Speech Dereverberation Using Non-Negative Convolutive Transfer Function and Spectro-Temporal Modeling
290 -- 299Anil Sharma, Sanjit Kaul. Two-Stage Supervised Learning-Based Method to Detect Screams and Cries in Urban Environments
300 -- 315Xiaoguang Wu, Huawei Chen. Directivity Factors of the First-Order Steerable Differential Array With Microphone Mismatches: Deterministic and Worst-Case Analysis
316 -- 328Andreas I. Koutrouvelis, George P. Kafentzis, Nikolay D. Gaubitch, Richard Heusdens. A Fast Method for High-Resolution Voiced/Unvoiced Detection and Glottal Closure/Opening Instant Estimation of Speech
329 -- 339Tomohiko Nakamura, Eita Nakamura, Shigeki Sagayama. Real-Time Audio-to-Score Alignment of Music Performances Containing Errors and Arbitrary Repeats and Skips
340 -- 353Adrian Bahne, Anders Ahlén. Optimizing the Similarity of Loudspeaker-Room Responses in Multiple Listening Positions
354 -- 365James M. Kates, Kathryn Hoberg Arehart. The Hearing-Aid Audio Quality Index (HAAQI)
366 -- 377Henning F. Schepker, Simon Doclo. A Semidefinite Programming Approach to Min-max Estimation of the Common Part of Acoustic Feedback Paths in Hearing Aids
378 -- 387Bong-Ki Lee, Joon-Hyuk Chang. Packet Loss Concealment Based on Deep Neural Networks for Digital Speech Transmission
388 -- 399Luisa Bentivogli, Nicola Bertoldi, Mauro Cettolo, Marcello Federico, Matteo Negri, Marco Turchi. On the Evaluation of Adaptive Machine Translation for Human Post-Editing

Volume 24, Issue 12

2218 -- 2230Andrea Cogliati, Zhiyao Duan, Brendt Wohlberg. Context-Dependent Piano Music Transcription With Convolutional Sparse Coding
2231 -- 2240Yanmin Qian, Tian Tan, Dong Yu. Neural Network Based Multi-Factor Aware Joint Training for Robust Speech Recognition
2241 -- 2250Lahiru Samarakoon, Khe Chai Sim. Factorized Hidden Layer Adaptation for Deep Neural Network Based Acoustic Modeling
2251 -- 2262Martin Krawczyk-Becker, Timo Gerkmann. On MMSE-Based Estimation of Amplitude and Complex Speech Spectral Coefficients Under Phase-Uncertainty
2263 -- 2276Yanmin Qian, Mengxiao Bi, Tian Tan, Kai Yu. Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition
2277 -- 2287Yi-Chan Wu, Homer H. Chen. Generation of Affective Accompaniment in Accordance With Emotion Flow
2288 -- 2300Mahmood Movassagh, Peter Kabal. Scalable Audio Coding Using Trellis-Based Optimized Joint Entropy Coding and Quantization
2301 -- 2312Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei, Philip N. Garner. Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding
2313 -- 2326David Dov, Ronen Talmon, Israel Cohen. Kernel Method for Voice Activity Detection in the Presence of Transients
2327 -- 2340Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida. Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments
2341 -- 2353Hardik B. Sailor, Hemant A. Patil. Novel Unsupervised Auditory Filterbank Learning Using Convolutional RBM for Speech Recognition
2354 -- 2367Sidsel Marie Nørholm, Jesper Rindom Jensen, Mads Græsbøll Christensen. Instantaneous Fundamental Frequency Estimation With Optimal Segmentation for Nonstationary Voiced Speech
2368 -- 2376Sheng Zhang, Jiashu Zhang, Hongyu Han. Robust Variable Step-Size Decorrelation Normalized Least-Mean-Square Algorithm and its Application to Acoustic Echo Cancellation
2377 -- 2389Tom Barker, Tuomas Virtanen. Blind Separation of Audio Mixtures Through Nonnegative Tensor Factorization of Modulation Spectrograms
2390 -- 2399Jinxin Liu, XueFeng Chen. Adaptive Compensation of Misequalization in Narrowband Active Noise Equalizer Systems
2400 -- 2413Atsunori Ogawa, Takaaki Hori, Atsushi Nakamura. Estimating Speech Recognition Accuracy Based on Error Type Classification
2414 -- 2424Finnian Kelly, John H. L. Hansen. Score-Aging Calibration for Speaker Verification
2425 -- 2438Bochen Li, Zhiyao Duan. An Approach to Score Following for Piano Performances With the Sustained Effect
2439 -- 2452Niko Moritz, Birger Kollmeier, Jörn Anemüller. Integration of Optimized Modulation Filter Sets Into Deep Neural Networks for Automatic Speech Recognition
2453 -- 2465Simon Leglaive, Roland Badeau, Gaël Richard. Multichannel Audio Source Separation With Probabilistic Reverberation Priors
2466 -- 2480Sakari Tervo. Single Snapshot Detection and Estimation of Reflections From Room Impulse Responses in the Spherical Harmonic Domain
2481 -- 2494Dejan Markovic, Fabio Antonacci, Lucio Bianchi, Stefano Tubaro, Augusto Sarti. Extraction of Acoustic Sources Through the Processing of Sound Field Maps in the Ray Space
2495 -- 2506Pavlos Papadopoulos, Andreas Tsiartas, Shrikanth Narayanan. Long-Term SNR Estimation of Speech Signals in Known and Unknown Channel Conditions
2507 -- 2515Ingo R. Titze, Anil Palaparthi. Sensitivity of Source-Filter Interaction to Specific Vocal Tract Shapes
2516 -- 2530Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot. A Hybrid Approach for Speech Enhancement Using MoG Model and Neural Network Phoneme Classifier
2531 -- 2543Gongping Huang, Jacob Benesty, Jingdong Chen. Superdirective Beamforming Based on the Krylov Matrix

Volume 24, Issue 11

1885 -- 1896Aggelos Gkiokas, Vassilios Katsouros, George Carayannis. Towards Multi-Purpose Spectral Rhythm Features: An Application to Dance Style, Meter and Tempo Estimation
1897 -- 1907Yi-Chin Huang, Chung-Hsien Wu, Si-Ting Weng. Improving Mandarin Prosody Generation Using Alternative Smoothing Techniques
1908 -- 1920Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan, Jesper Jensen. Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech
1921 -- 1934Qiaoling Zhang, Zhe Chen, Fuliang Yin. Distributed Marginalized Auxiliary Particle Filter for Speaker Tracking in Distributed Microphone Networks
1935 -- 1945Marc Ferras, Srikanth R. Madikeri, Hervé Bourlard. Speaker Diarization and Linking of Meeting Data
1946 -- 1956Yuzong Liu, Katrin Kirchhoff. Graph-Based Semisupervised Learning for Acoustic Modeling in Automatic Speech Recognition
1957 -- 1968Jin Wang, Liang-Chih Yu, K. Robert Lai, Xue-Jie Zhang. Community-Based Weighted Graph Model for Valence-Arousal Prediction of Affective Words
1969 -- 1982Alberto Carini, Stefania Cecchi, Laura Romoli. Robust Room Impulse Response Measurement Using Perfect Sequences for Legendre Nonlinear Filters
1983 -- 1997Sebastian Ewert, Mark B. Sandler. Piano Transcription in the Studio Using an Extensible Alternating Directions Framework
1998 -- 2008Yu-Ren Chien, Hsin-Min Wang, Shyh-Kang Jeng. Alignment of Lyrics With Accompanied Singing Audio Based on Acoustic-Phonetic Vowel Likelihood Modeling
2009 -- 2022Jesper Jensen, Cees H. Taal. An Algorithm for Predicting the Intelligibility of Speech Masked by Modulated Noise Maskers
2023 -- 2031Xiaodong Cui, Vaibhava Goel. Maximum Likelihood Nonlinear Transformations Based on Deep Neural Networks
2032 -- 2045Toru Nakashika, Tetsuya Takiguchi, Yasuhiro Minami. Non-Parallel Training in Voice Conversion Using an Adaptive Restricted Boltzmann Machine
2046 -- 2058I.-Bin Liao, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen. Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS
2059 -- 2068Hiroki Ouchi, Kevin Duh, Hiroyuki Shindo, Yuji Matsumoto. Transition-Based Dependency Parsing Exploiting Supertags
2069 -- 2083Tong Xiao, Derek F. Wong, Jingbo Zhu. A Loss-Augmented Approach to Training Syntactic Machine Translation Systems
2084 -- 2095Yukara Ikemiya, Katsutoshi Itoyama, Kazuyoshi Yoshii. Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation
2096 -- 2107Siddharth Sigtia, Adam M. Stark, Sacha Krstulovic, Mark D. Plumbley. Automatic Environmental Sound Recognition: Performance Versus Computational Cost
2108 -- 2121Srinivas Parthasarathy, Roddy Cowie, Carlos Busso. Using Agreement on Direction of Change to Build Rank-Based Emotion Classifiers
2122 -- 2131Jia-Ching Wang, Yuan-Shan Lee, Chang Hong Lin, Shu-fan Wang, Chih-Hao Shih, Chung-Hsien Wu. Compressive Sensing-Based Speech Enhancement
2132 -- 2145Siying Wang, Sebastian Ewert, Simon Dixon. Robust and Efficient Joint Alignment of Multiple Musical Performances
2146 -- 2157Xie Chen, Xunying Liu, Yongqiang Wang, Mark J. F. Gales, Philip C. Woodland. Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition
2158 -- 2170Ping-Keng Jao, Li Su, Yi-Hsuan Yang, Brendt Wohlberg. Monaural Music Source Separation Using Convolutional Sparse Coding
2171 -- 2186Xiaofei Li, Laurent Girin, Radu Horaud, Sharon Gannot. Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization
2187 -- 2199Duc Le, Keli Licata, Carol Persad, Emily Mower Provost. Automatic Assessment of Speech Intelligibility for Individuals With Aphasia
2200 -- 2213Thijs van de Laar, Bert de Vries. A Probabilistic Modeling Approach to Hearing Loss Compensation

Volume 24, Issue 10

1681 -- 1693James Eaton, Nikolay D. Gaubitch, Alastair H. Moore, Patrick A. Naylor. Estimation of Room Acoustic Parameters: The ACE Challenge
1694 -- 1704Takashi Nose. Efficient Implementation of Global Variance Compensation for Parametric Speech Synthesis
1705 -- 1720Shabnam Ghaffarzadegan, Hynek Boril, John H. L. Hansen. Generative Modeling of Pseudo-Whisper for Robust Whispered Speech Recognition
1721 -- 1731Seyedmahdad Mirsamadi, John H. L. Hansen. A Generalized Nonnegative Tensor Factorization Approach for Distant Speech Recognition With Distributed Microphones
1732 -- 1745Laura Fuster, Maria de Diego, Luis Antonio Azpicueta-Ruiz, Miguel Ferrer. Adaptive Filtered-x Algorithms for Room Equalization Based on Block-Based Combination Schemes
1746 -- 1758Kamil Adiloglu, Emmanuel Vincent. Variational Bayesian Inference for Source Separation and Robust Feature Extraction
1759 -- 1772Steffen Kortlang, Giso Grimm, Volker Hohmann, Birger Kollmeier, Stephan Dieter Ewert. Auditory Model-Based Dynamic Compression Controlled by Subband Instantaneous Frequency and Speech Presence Probability Estimates
1773 -- 1784Pawel Swietojanski, Steve Renals. Differentiable Pooling for Unsupervised Acoustic Model Adaptation
1785 -- 1795Kenta Niwa, Yusuke Hioka, Kazunori Kobayashi. Optimal Microphone Array Observation for Clear Recording of Distant Sound Sources
1796 -- 1807Nicolas Epain, Craig T. Jin. Spherical Harmonic Signal Covariance and Sound Field Diffuseness
1808 -- 1818Tudor-Catalin Zorila, Yannis Stylianou, Tatsuma Ishihara, Masami Akamine. Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time-Frequency Energy Reallocation Approach
1819 -- 1830Xi Ma, Dong Wang, Javier Tejedor. Similar Word Model for Unfrequent Word Enhancement in Speech Recognition
1831 -- 1841Mohammad Hadi Bokaei, Hossein Sameti, Yang Liu. Summarizing Meeting Transcripts Based on Functional Segmentation
1842 -- 1853Jiajun Zhang, Yu Zhou, Chengqing Zong. Abstractive Cross-Language Summarization via Translation Model Enhanced Predicate Argument Structure Fusing
1854 -- 1864Grégoire Lafay, Mathieu Lagrange, Mathias Rossignol, Emmanouil Benetos, Axel Roebel. A Morphological Model for Simulating Acoustic Scenes and Its Application to Sound Event Detection
1865 -- 1875An Ji, Michael T. Johnson, Jeffrey J. Berry. Parallel Reference Speaker Weighting for Kinematic-Independent Acoustic-to-Articulatory Inversion

Volume 24, Issue 1

5 -- 15Sandrine Brognaux, Thomas Drugman. HMM-Based Speech Segmentation: Improvements of Fully Automatic Approaches
16 -- 28Marie Tahon, Laurence Devillers. Towards a Small Set of Robust Acoustic Features for Emotion Recognition: Challenges
29 -- 41Hamid Behravan, Ville Hautamäki, Sabato Marco Siniscalchi, Tomi Kinnunen, Chin-Hui Lee. i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition
42 -- 53Rahim Saeidi, Paavo Alku, Tom Bäckström. Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch
54 -- 64Iman Tabatabaei Ardekani, Jari P. Kaipio, Alireza Nasiri, Hamid Sharifzadeh, Waleed H. Abdulla. A Statistical Inverse Problem Approach to Online Secondary Path Modeling in Active Noise Control
65 -- 78Themos Stafylakis, Patrick Kenny, Md. Jahangir Alam, Marcel Kockmann. Speaker and Channel Factors in Text-Dependent Speaker Recognition
79 -- 92Yanzhang He, Peter Baumann, Hao Fang, Brian Hutchinson, Aaron Jaech, Mari Ostendorf, Eric Fosler-Lussier, Janet B. Pierrehumbert. Using Pronunciation-Based Morphological Subword Units to Improve OOV Handling in Keyword Search
93 -- 104Meng Sun, Xiongwei Zhang, Hugo Van Hamme, Thomas Fang Zheng. Unseen Noise Estimation Using Separable Deep Auto Encoder for Speech Enhancement
105 -- 116Luciana Ferrer, Yun Lei, Mitchell McLaren, Nicolas Scheffer. Study of Senone-Based Deep Neural Network Approaches for Spoken Language Recognition
117 -- 129Stefan Ingi Adalbjornsson, Ted Kronvall, Simon Burgess, Kalle Åström, Andreas Jakobsson. Sparse Localization of Harmonic Audio Sources
130 -- 142Man-Wai Mak, Xiaomin Pang, Jen-Tzung Chien. Mixture of PLDA for Noise Robust I-Vector Speaker Verification
143 -- 150Craig A. Anderson, Paul D. Teal, Mark A. Poletti. Spatial Correlation of Radial Gaussian and Uniform Spherical Volume Near-Field Source Distributions
151 -- 160Humberto M. Torres, Jorge A. Gurlekian. Novel Estimation Method for the Superpositional Intonation Model
161 -- 173Stefan Bilbao, Brian Hamilton, Jonathan Botts, Lauri Savioja. Finite Volume Time Domain Room Acoustics Simulation under General Impedance Boundary Conditions
174 -- 184Amir Hossein Harati Nejad Torbati, Joseph Picone. A Doubly Hierarchical Dirichlet Process Hidden Markov Model with a Non-Ergodic Structure
185 -- 195Jen-Tzung Chien, Po-Kai Yang. Bayesian Factorization and Learning for Monaural Source Separation
196 -- 210David Lou Alon, Boaz Rafaely. Beamforming with Optimal Aliasing Cancellation in Spherical Microphone Arrays