Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 18, Issue 8

1889 -- 1901Ozlem Kalinli, Michael L. Seltzer, Jasha Droppo, Alex Acero. Noise Adaptive Training for Robust Automatic Speech Recognition
1902 -- 1912G. N. Lilis, Daniele Angelosante, Georgios B. Giannakis. Sound Field Reproduction using the Lasso
1913 -- 1928Wenyi Zhang, Bhaskar D. Rao. A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources
1929 -- 1940Damián Marelli, Mitsuko Aramaki, Richard Kronland-Martinet, Charles Verron. Time-Frequency Synthesis of Noisy Sounds With Narrow Spectral Components
1941 -- 1954Songfang Huang, Steve Renals. Hierarchical Bayesian Language Models for Conversational Speech Recognition
1955 -- 1967Emmanouil Benetos, Constantine Kotropoulos. Non-Negative Tensor Factorization Applied to Music Genre Classification
1968 -- 1977Emmanouil Benetos, Yannis Stylianou. Auditory Spectrum-Based Pitched Instrument Onset Detection
1978 -- 1993Jian Liu, Yegui Xiao, Jinwei Sun, Li Xu. Analysis of Online Secondary-Path Modeling With Auxiliary Noise Scaled by Residual Noise Signal
1994 -- 2003Chi-Chun Hsia, Chung-Hsien Wu, Jung-Yun Wu. Exploiting Prosody Hierarchy and Dynamic Features for Pitch Modeling and Generation in HMM-Based Speech Synthesis
2004 -- 2014Huijun Ding, Ing Yann Soon, Chai Kiat Yeo. Over-Attenuated Components Regeneration for Speech Enhancement
2015 -- 2027A. Reddy, R. C. Rose. Integration of Statistical Models for Dictation of Document Translations in a Machine-Aided Human Translation Task
2028 -- 2037David Imseng, Gerald Friedland. Tuning-Robust Initialization Methods for Speaker Diarization
2038 -- 2050Jens Ahrens, Sascha Spors. Sound Field Reproduction Using Planar and Linear Arrays of Loudspeakers
2051 -- 2066Gregory Sell, Malcolm Slaney. Solving Demodulation as an Optimization Problem
2067 -- 2079Guoning Hu, DeLiang L. Wang. A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation
2080 -- 2090Gibak Kim, Philipos C. Loizou. Improving Speech Intelligibility in Noise Using Environment-Optimized Algorithms
2091 -- 2098M. Kleider, Boaz Rafaely, Barak Weiss, Eitan Bachmat. Golden-Ratio Sampling for Scanning Circular Microphone Arrays
2099 -- 2110Seokhwan Jo, Chang D. Yoo. Psychoacoustically Constrained and Distortion Minimized Speech Enhancement
2111 -- 2120Wooil Kim, John H. L. Hansen. Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions
2121 -- 2133Zhiyao Duan, Bryan Pardo, Changshui Zhang. Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions
2134 -- 2144Nikoletta Bassiou, Vassiliki Moschou, Constantine Kotropoulos. Speaker Diarization Exploiting the Eigengap Criterion and Cluster Ensembles
2145 -- 2154V. Rao, P. Rao. Vocal Melody Extraction in the Presence of Pitched Accompaniment in Polyphonic Music
2155 -- 2167Brady Laska, Miodrag Bolic, Rafik A. Goubran. Particle Filter Enhancement of Speech Spectral Amplitudes

Volume 18, Issue 7

1673 -- 1675Tomohiro Nakatani, Walter Kellermann, P. Naylor, Masato Miyoshi, Biing-Hwang Juang. Introduction to the Special Issue on Processing Reverberant Speech: Methodologies and Applications
1676 -- 1691Armin Sehr, Roland Maas, Walter Kellermann. Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition
1692 -- 1707Alexander Krueger, Reinhold Haeb-Umbach. Model-Based Feature Enhancement for Reverberant Speech Recognition
1708 -- 1716R. Gomez, T. Kawahara. Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood
1717 -- 1731Tomohiro Nakatani, Takuya Yoshioka, Keisuke Kinoshita, Masato Miyoshi, Biing-Hwang Juang. Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction
1732 -- 1745Marco Jeub, M. Schafer, Thomas Esch, Peter Vary. Model-Based Dereverberation Preserving Binaural Cues
1746 -- 1765Jan S. Erkelens, Richard Heusdens. Correlation-Based and Model-Based Blind Single-Channel Late-Reverberation Suppression in Noisy Time-Varying Acoustical Environments
1766 -- 1774Tiago H. Falk, Chenxi Zheng, Wai-Yip Chan. A Non-Intrusive Quality and Intelligibility Measure of Reverberant and Dereverberated Speech
1775 -- 1780Takayuki Arai, Nao Hodoshima, K. Yasu. Using Steady-State Suppression to Improve Speech Intelligibility in Reverberant Environments for Elderly Listeners
1781 -- 1792Flavio Ribeiro, Cha Zhang, Dinei A. F. Florêncio, Demba E. Ba. Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization
1793 -- 1805Yan-Chen Lu, M. Cooke. Binaural Estimation of Sound Source Distance via the Direct-to-Reverberant Energy Ratio for Static and Moving Sources
1806 -- 1817F. Talantzis. An Acoustic Source Localization and Tracking Framework Using Particle Filtering and Information Theory
1818 -- 1829M. Kowalski, Emmanuel Vincent, Rémi Gribonval. Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation
1830 -- 1840Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval. Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model
1841 -- 1855Alireza Masnadi-Shirazi, Wenyi Zhang, Bhaskar D. Rao. Glimpsing IVA: A Framework for Overcomplete/Complete/Undercomplete Convolutive Source Separation
1856 -- 1866John Woodruff, DeLiang Wang. Sequential Organization of Speech in Reverberant Environments by Integrating Monaural Grouping and Binaural Localization
1867 -- 1871C. Hummersone, R. Mason, T. Brookes. Dynamic Precedence Effect Modeling for Source Separation in Reverberant Environments
1872 -- 1883Michael I. Mandel, S. Bressler, Barbara G. Shinn-Cunningham, Daniel P. W. Ellis. Evaluating Source Separation Algorithms With Reverberant Speech

Volume 18, Issue 6

1094 -- 1106Hamed Ketabdar, Hervé Bourlard. Enhanced Phone Posteriors for Improving Speech Recognition Systems
1107 -- 1115Sergio Canazza, Giovanni De Poli, Gian Antonio Mian. Restoration of Audio Documents by Means of Extended Kalman Filter
1116 -- 1126Chunghsin Yeh, Axel Röbel, Xavier Rodet. Multiple Fundamental Frequency Estimation and Polyphony Inference of Polyphonic Music Signals
1127 -- 1136Jiucang Hao, Te-Won Lee, Terrence J. Sejnowski. Speech Enhancement Using Gaussian Scale Mixture Models
1137 -- 1146R. Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. Integrated Active Noise Control and Noise Reduction in Hearing Aids
1147 -- 1157Justin Jian Zhang, Ricky Ho Yin Chan, Pascale Fung. Extractive Speech Summarization Using Shallow Rhetorical Structure Modeling
1158 -- 1169Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee. A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition
1170 -- 1181Chung-Hsien Wu, Chao-Hong Liu, M. Harris, Liang-Chih Yu. Sentence Correction Incorporating Relative Position and Parse Template Language Models
1182 -- 1192Robbie Vogt, Sridha Sridharan, Michael Mason. Making Confident Speaker Verification Decisions With Minimal Speech
1193 -- 1207Dimitri Nion, Kleanthis N. Mokios, Nicholas D. Sidiropoulos, Alexandros Potamianos. Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures
1208 -- 1217Vaclav Eksler, Milan Jelinek. Glottal-Shape Codebook to Improve Robustness of CELP Codecs
1218 -- 1227Moo-young Kim, W. Bastiaan Kleijn. Reduction of the Impact of Distortion Outliers and Source Mismatch in Resolution-Constrained Quantization
1228 -- 1242Maurice F. Fallon, Simon J. Godsill. Acoustic Source Localization and Tracking Using Track Before Detect
1243 -- 1257Xiaoqiang Xiao, Robert M. Nickel. Speech Enhancement With Inventory Style Speech Resynthesis
1258 -- 1268Angel M. Gomez, Jose L. Carmona, Antonio M. Peinado, Victoria E. Sánchez. A Multipulse-Based Forward Error Correction Technique for Robust CELP-Coded Speech Transmission Over Erasure Channels
1269 -- 1279Matt Gibson 0002, Thomas Hain. Error Approximation and Minimum Phone Error Acoustic Model Estimation
1280 -- 1289Matthias Mauch, Simon Dixon. Simultaneous Estimation of Chords and Musical Context From Audio
1290 -- 1299Jingen Ni, Feng Li. A Variable Step-Size Matrix Normalized Subband Adaptive Filter
1300 -- 1312Chang Huai You, Kong-Aik Lee, Haizhou Li. GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition
1313 -- 1322Ilknur Durgar El-Kahlout, Kemal Oflazer. Exploiting Morphology and Local Word Reordering in English-to-Turkish Phrase-Based Statistical Machine Translation
1323 -- 1331Jingbo Zhu, Huizhen Wang, Benjamin K. Tsou, Matthew Y. Ma. Active Learning With Sampling by Uncertainty and Density for Data Annotations
1332 -- 1340Chi-Sang Jung, Moo-young Kim, Hong-Goo Kang. Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information
1341 -- 1353Jose L. Carmona, Antonio M. Peinado, José L. Pérez-Córdoba, Angel M. Gomez. MMSE-Based Packet Loss Concealment for CELP-Coded Speech Recognition
1354 -- 1365Kentaro Ishizuka, Shoko Araki, T. Kawahara. Speech Activity Detection for Multi-Party Conversation Analyses Based on Likelihood Ratio Test on Spatial Magnitude
1366 -- 1378Marc Ferras, Cheung Chi Leung, Claude Barras, Jean-Luc Gauvain. Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition
1379 -- 1393Hynek Boril, John H. L. Hansen. Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments
1394 -- 1405Chung-Hsien Wu, Chi-Chun Hsia, Chung-Han Lee, Mai-Chun Lin. Hierarchical Prosody Conversion Using Regression-Based Clustering for Emotional Speech Synthesis
1406 -- 1416Keansub Lee, Daniel P. W. Ellis. Audio-Based Semantic Concept Classification for Consumer Video
1417 -- 1428C. Tantibundhit, Franz Pernkopf, Gernot Kubin. Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
1429 -- 1439Eric A. Lehmann, Anders M. Johansson. Diffuse Reverberation Model for Efficient Image-Source Simulation of Room Impulse Responses
1440 -- 1454Ø. Birkenes, T. Matsui, K. Tanabe, Sabato Marco Siniscalchi, Tor André Myrvoll, M. H. Johnsen. Penalized Logistic Regression With HMM Log-Likelihood Regressors for Speech Recognition
1455 -- 1463Jerome R. Bellegarda. A Dynamic Cost Weighting Framework for Unit Selection Text-to-Speech Synthesis
1464 -- 1475Mathieu Parvaix, Laurent Girin, Jean-Marc Brossier. A Watermarking-Based Method for Informed Source Separation of Audio Signals With a Single Sensor
1476 -- 1485Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino. Blind Source Separation With Parameter-Free Adaptive Step-Size Method for Robot Audition
1486 -- 1495M. Akbacak, John H. L. Hansen. Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid Representations
1496 -- 1506Mitchell McLaren, Robbie Vogt, Brendan Baker, Sridha Sridharan. Data-Driven Background Dataset Selection for SVM-Based Speaker Verification
1507 -- 1516Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama. Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency
1517 -- 1527Andre Holzapfel, Yannis Stylianou, Ali Cenk Gedik, Baris Bozkurt. Three Dimensions of Pitched Instrument Onset Detection
1528 -- 1538Panikos Heracleous, V.-A. Tran, T. Nagai, Kiyohiro Shikano. Analysis and Recognition of NAM Speech Using HMM Distances and Visual Information
1539 -- 1549Yuya Akita, Tatsuya Kawahara. Statistical Transformation of Language and Pronunciation Models for Spontaneous Speech Recognition
1550 -- 1561Charles Verron, Mitsuko Aramaki, Richard Kronland-Martinet, Grégory Pallone. A 3-D Immersive Synthesizer for Environmental Sounds
1562 -- 1574Yi-Cheng Pan, Lin-Shan Lee. Performance Analysis for Lattice-Based Speech Indexing Approaches Using Words and Subword Units
1575 -- 1587Mehrez Souden, Jacob Benesty, Sofiène Affes. Broadband Source Localization From an Eigenanalysis Perspective
1588 -- 1600Péter Mihajlik, Zoltán Tüske, B. Tarjan, Bottyán Németh, Tibor Fegyó. Improved Recognition of Spontaneous Hungarian Speech - Morphological and Acoustic Modeling Techniques for a Less Resourced Task
1601 -- 1611Gökhan Tür, Andreas Stolcke, L. Voss, Stanley Peters, Dilek Hakkani-Tür, John Dowding, Benoît Favre, Raquel Fernández, Matthew Frampton, Michael W. Frandsen, C. Frederickson, Martin Graciarena, D. Kintzing, K. Leveque, S. Mason, John Niekrasz, Matthew Purver, Korbinian Riedhammer, Elizabeth Shriberg, Jing Tien, Dimitra Vergyri, Fan Yang. The CALO Meeting Assistant System
1612 -- 1623Bengt J. Borgstrom, Abeer Alwan. HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition
1624 -- 1631Sriram Ganapathy, Petr Motlícek, Hynek Hermansky. Autoregressive Models of Amplitude Modulations in Audio Compression
1632 -- 1642Hyeon-Jin Jeon, Tae-Gyu Chang, Sen M. Kuo. Analysis of Frequency Mismatch in Narrowband Active Noise Control
1643 -- 1654Valentin Emiya, Roland Badeau, Bertrand David. Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle
1655 -- 1666Thushara D. Abhayapala, Aastha Gupta. Spherical Harmonic Analysis of Wavefields Using Multiple Circular Sensor Arrays

Volume 18, Issue 5

909 -- 911Yannis Stylianou, T. Toda, C. H. Wu, Alexander Kain, Olivier Rosec. Introduction to the Special Section on Voice Transformation
912 -- 921Elina Helander, Tuomas Virtanen, Jani Nurminen, Moncef Gabbouj. Voice Conversion Using Partial Least Squares Regression
922 -- 931Daniel Erro, Asunción Moreno, Antonio Bonafonte. Voice Conversion Based on Weighted Frequency Warping
932 -- 943Jianhua Tao, Meng Zhang, Jani Nurminen, Jilei Tian, Xia Wang. Supervisory Data Alignment for Text-Independent Voice Conversion
944 -- 953Daniel Erro, Asunción Moreno, Antonio Bonafonte. INCA Algorithm for Training Voice Conversion Systems From Nonparallel Corpora
954 -- 964Srinivas Desai, Alan W. Black, B. Yegnanarayana, Kishore Prahallad. Spectral Mapping Using Artificial Neural Networks for Voice Conversion
965 -- 973Oytun Türk, Marc Schröder. Evaluation of Expressive Speech Synthesis With Voice Conversion and Copy Resynthesis Techniques
974 -- 983Daniel Erro, Eva Navas, Inmaculada Hernáez, Ibon Saratxaga. Emotion Conversion Based on Prosodic Unit Selection
984 -- 1004Junichi Yamagishi, B. Usabaev, Simon King, O. Watts, J. Dines, Jilei Tian, Yong Guan, Rile Hu, Keiichiro Oura, Yi-Jian Wu, Keiichi Tokuda, Reima Karhila, Mikko Kurimo. Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora
1005 -- 1016O. Watts, J. Yamagishi, Simon King, K. Berkling. Synthesis of Child Speech With HMM Adaptation and Voice Conversion
1017 -- 1029Purvis Bedenbaugh, Diana K. Sarko, Heidi L. Roth, Eugene M. Martin. Prosody-Preserving Voice Transformation to Evaluate Brain Representations of Speech Sounds
1030 -- 1040Daniel Felps, Ricardo Gutierrez-Osuna. Developing Objective Measures of Foreign-Accent Conversion
1041 -- 1052Carlos Molina, Néstor Becerra Yoma, Fernando Huenupán, Claudio Garretón, Jorge Wuth. Maximum Entropy-Based Reinforcement Learning Using a Confidence Measure in Speech Recognition for Telephone Speech
1053 -- 1062Xugang Lu, Jianwu Dang. Vowel Production Manifold: Intrinsic Factor Analysis of Vowel Articulation
1063 -- 1065S. Abdallah. Comment on Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection
1065 -- 1068Cong-Thanh Do, Dominique Pastor, André Goalic. On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR
1068 -- 1071Parham Mokhtari, Hironori Takemoto, Ryouichi Nishimura, Hiroaki Kato. Optimum Loss Factor for a Perfectly Matched Layer in Finite-Difference Time-Domain Acoustic Simulation
1072 -- 1077Mehrez Souden, Jingdong Chen, Jacob Benesty, Sofiène Affes. Gaussian Model-Based Multichannel Speech Presence Probability
1077 -- 1082S. Tiomkin, D. Malah, Salva Shechtman. Statistical Text-to-Speech Synthesis Based on Segment-Wise Representation With a Norm Constraint
1082 -- 1086Claudio Garretón, Néstor Becerra Yoma, M. Torres. Channel Robust Feature Transformation Based on Filter-Bank Energy Filtering

Volume 18, Issue 4

713 -- 714Vesa Välimäki, Federico Fontana, Julius O. Smith, Udo Zölzer. Introduction to the Special Issue on Virtual Analog Audio Effects and Musical Instruments
715 -- 727Giovanni De Sanctis, Augusto Sarti. Virtual Analog Modeling in the Wave-Digital Domain
728 -- 737David T. Yeh, Jonathan S. Abel, Julius O. Smith. Automated Physical Modeling of Nonlinear Audio Circuits For Real-Time Audio Effects - Part I: Theoretical Development
738 -- 746Jyri Pakarinen, Matti Karjalainen. Enhanced Wave Digital Triode Model for Real-Time Tube Amplifier Emulation
747 -- 759Thomas Hélie. Volterra Series and State Transformation for Real-Time Simulations of Audio Circuits Including Saturations: Application to the Moog Ladder Filter
760 -- 772Federico Fontana, Marco Civolani. Modeling of the EMS VCS3 Voltage-Controlled Filter as a Nonlinear Filter Network
773 -- 785Juhan Nam, Vesa Välimäki, Jonathan S. Abel, Julius O. Smith. Efficient Antialiasing Oscillator Algorithms Using Low-Order Fractional Delay Filters
786 -- 798Vesa Välimäki, Juhan Nam, Julius O. Smith, Jonathan S. Abel. Alias-Suppressed Oscillators Based on Differentiated Polynomial Waveforms
799 -- 808Stefan Bilbao, Julian Parker. A Virtual Model of Spring Reverberation
809 -- 821Balázs Bank, Stefano Zambon, Federico Fontana. A Modal-Based Real-Time Piano Synthesizer
822 -- 832Gianpaolo Evangelista, Fredrik Eckerholm. Player-Instrument Interaction Models for Digital Waveguide Synthesis of Guitar: Touch and Collisions
833 -- 842Nelson Lee, Julius O. Smith, Vesa Välimäki. Analysis and Synthesis of Coupled Vibrating Strings Using a Hybrid Modal-Waveguide Synthesis Model
843 -- 854Rémi Mignot, Thomas Hélie, Denis Matignon. Digital Waveguide Modeling for Wind Instruments: Building a State-Space Representation Based on the Webster-Lokshin Model
855 -- 871Esteban Maestre, Merlijn Blaauw, Jordi Bonada, Enric Guaus, Alfonso Perez. Statistical Modeling of Bowing Control Applied to Violin Sound Synthesis
872 -- 880Stefan Bilbao. Percussion Synthesis Based on Models of Nonlinear Shell Vibration
881 -- 890Rudolf Rabenstein, Tilman Koch, Christian Popp. Tubular Bells: A Physical and Algorithmic Model
891 -- 902Federico Avanzini, Riccardo Marogna. A Modular Physically Based Approach to the Sound Synthesis of Membrane Percussion Instruments

Volume 18, Issue 3

417 -- 419Bertrand David, Masataka Goto, Laurent Daudet, Paris Smaragdis. Editorial for the Special Issue on Signal Models and Representations of Musical and Environmental Sounds
420 -- 433Vittoria Bruni, Silvia Marconi, Domenico Vitulano. Time-Scale Atoms Chains for Transients Detection in Audio Signals
434 -- 446Emmanuel Ravelli, Gaël Richard, Laurent Daudet. Audio Signal Representations for Indexing in the Transform Domain
447 -- 460Nicolás Ruiz-Reyes, Pedro Vera-Candeas. Adaptive Signal Modeling Based on Sparse Approximations for Scalable Parametric Audio Coding
461 -- 472Bob L. Sturm, John J. Shynk. Sparse Approximation and the Pursuit of Meaningful Signal Models With Interference Adaptation
473 -- 486Julio J. Carabias-Orti, Pedro Vera-Candeas, Francisco J. Cañadas-Quesada, Nicolás Ruiz-Reyes. Music Scene-Adaptive Harmonic Dictionary for Unsupervised Note-Event Detection
487 -- 497Johan Xi Zhang, Mads Græsbøll Christensen, Søren Holdt Jensen, Marc Moonen. A Robust and Computationally Efficient Subspace-Based Fundamental Frequency Estimator
498 -- 508Jeremy Wells, Damian Murphy. A Comparative Evaluation of Techniques for Single-Frame Discrimination of Nonstationary Sinusoids
509 -- 518Mathieu Lagrange, Gary P. Scavone, Philippe Depalle. Analysis/Synthesis of Sounds Generated by Sustained Contact Between Rigid Objects
519 -- 527Paul H. Peeling, Ali Taylan Cemgil, Simon J. Godsill. Generative Spectrogram Factorization Models for Polyphonic Piano Transcription
528 -- 537Emmanuel Vincent, Nancy Bertin, Roland Badeau. Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation
538 -- 549Nancy Bertin, Roland Badeau, Emmanuel Vincent. Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription
550 -- 563Alexey Ozerov, Cédric Févotte. Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
564 -- 575Jean-Louis Durrieu, Gaël Richard, Bertrand David, Cédric Févotte. Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals
576 -- 588Yannis Panagakis, Constantine Kotropoulos, Gonzalo R. Arce. Non-Negative Multilinear Principal Component Analysis of Auditory Temporal Modulations for Music Genre Classification
589 -- 601Onur Dikmen, Ali Taylan Cemgil. Gamma Markov Random Fields for Audio Source Modeling
602 -- 612Luke Barrington, Antoni B. Chan, Gert R. G. Lanckriet. Modeling Music as a Dynamic Texture
613 -- 624Anssi Klapuri, Tuomas Virtanen. Representing Musical Sounds With an Interpolating State Model
625 -- 637Kris West, Stephen Cox. Incorporating Cultural Representations of Features Into Audio Music Similarity Estimation
638 -- 648Hiromasa Fujihara, Masataka Goto, Tetsuro Kitahara, Hiroshi G. Okuno. A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval
649 -- 662Meinard Müller, Sebastian Ewert. Towards Timbre-Invariant Audio Features for Harmony-Based Music
663 -- 674Juan José Burred, Axel Röbel, Thomas Sikora. Dynamic Spectral Envelope Modeling for Timbre Analysis of Musical Instrument Sounds
675 -- 687Geoffroy Peeters, Emmanuel Deruty. Sound Indexing Using Morphological Description
688 -- 707Gordon Wichern, Jiachen Xue, Harvey D. Thornburg, Brandon Mechtley, Andreas Spanias. Segmentation, Indexing, and Retrieval for Environmental and Natural Sounds

Volume 18, Issue 2

213 -- 223Hüseyin Hacihabiboglu, Banu Gunel, Zoran Cvetkovic. Simulation of Directional Microphones in Digital Waveguide Mesh-Based Models of Room Acoustics
224 -- 236Claudius Gläser, Martin Heckmann, Frank Joublin, Christian Goerick. Combining Auditory Preprocessing and Bayesian Estimation for Robust Formant Tracking
237 -- 248Damián Marelli, Péter Balázs. On Pole-Zero Model Estimation Methods Minimizing a Logarithmic Criterion for Speech Analysis
249 -- 259Alfred Mertins, Tiemin Mei, Markus Kallinger. Room Impulse Response Shortening/Reshaping With Infinity- and p -Norm Optimization
260 -- 276Mehrez Souden, Jacob Benesty, Sofiène Affes. On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction
277 -- 285Avram Levi, Harvey F. Silverman. A Robust Method to Extract Talker Azimuth Orientation Using a Large-Aperture Microphone Array
286 -- 295Roberto Napoli, Luigi Piroddi. Nonlinear Active Noise Control With NARX Models
296 -- 309Luis Buera, Antonio Miguel, Oscar Saz, Alfonso Ortega, Eduardo Lleida. Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition
310 -- 319Chao-Ling Hsu, Jyh-Shing Roger Jang. On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset
320 -- 329Ümit Güz, Sébastien Cuendet, Dilek Hakkani-Tür, Gökhan Tür. Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech
330 -- 341Vinay Melkote, Kenneth Rose. Trellis-Based Approaches to Rate-Distortion Optimized Audio Encoding
342 -- 355Bram Cornelis, Simon Doclo, Tim Van den Bogaert, Marc Moonen, Jan Wouters. Theoretical Analysis of Binaural Multimicrophone Noise Reduction Techniques
356 -- 368Wen Jin, Xin Liu, Michael S. Scordilis, Lu Han. Speech Enhancement Using Harmonic Emphasis and Adaptive Comb Filtering
369 -- 381Nathalie Camelin, Frédéric Béchet, Géraldine Damnati, Renato de Mori. Detection and Interpretation of Opinion Expressions in Spoken Surveys
382 -- 394Michael I. Mandel, Ron J. Weiss, Daniel P. W. Ellis. Model-Based Expectation-Maximization Source Separation and Localization
395 -- 406Shinji Watanabe, Atsushi Nakamura. Predictor-Corrector Adaptation by Using Time Evolution System With Macroscopic Time Scale
407 -- 412Alexandros Nanopoulos, Dimitrios Rafailidis, Panagiotis Symeonidis, Yannis Manolopoulos. MusicBox: Personalized Music Recommendation Based on Cubic Analysis of Social Tags

Volume 18, Issue 1

1 -- 0Ali H. Sayed. Free Electronic Access to SP Publications
2 -- 16Dmitry N. Zotkin, Ramani Duraiswami, Nail A. Gumerov. Plane-Wave Decomposition of Acoustical Scenes Via Spherical and Cylindrical Microphone Arrays
17 -- 33Ramdas Kumaresan, Nitesh Panchal. Encoding Bandpass Signals Using Zero/Level Crossings: A Model-Based Approach
34 -- 49Péter Balázs, Bernhard Laback, Gerhard Eckel, Werner A. Deutsch. Time-Frequency Sparsity by Removing Perceptually Irrelevant Components Using a Simple Model of Simultaneous Masking
50 -- 57Antti J. Eronen, Anssi Klapuri. Music Tempo Estimation With k -NN Regression
58 -- 67Jean-Marc Valin, Timothy B. Terriberry, Christopher Montgomery, Gregory Maxwell. A High-Quality Speech and Audio Codec With Less Than 10-ms Delay
68 -- 77Martin Raspaud, Harald Viste, Gianpaolo Evangelista. Binaural Source Localization by Joint Estimation of ILD and ITD
78 -- 89Konrad Kowalczyk, Maarten van Walstijn. Wideband and Isotropic Room Acoustics Simulation Using 2-D Interpolated FDTD Schemes
90 -- 100Tiago H. Falk, Wai-Yip Chan. Modulation Spectral Features for Robust Far-Field Speaker Identification
101 -- 116Vaninirappuputhenpurayil Gopalan Reju, Soo Ngee Koh, Ing Yann Soon. Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
117 -- 125Ian Vince McLoughlin. Vowel Intelligibility in Chinese
141 -- 157Shih-Sian Cheng, Hsin-Min Wang, Hsin-Chia Fu. BIC-Based Speaker Segmentation Using Divide-and-Conquer Strategies With Application to Speaker Diarization
158 -- 170Emanuël Anco Peter Habets, Jacob Benesty, Israel Cohen, Sharon Gannot, Jacek Dmochowski. New Insights Into the MVDR Beamformer in Room Acoustics
171 -- 186Tianyu T. Wang, Thomas F. Quatieri. High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch
187 -- 196Feifan Liu, Yang Liu. Exploring Correlation Between ROUGE and Human Evaluation on Meeting Summaries
197 -- 207Mehryar Mohri, Pedro Moreno, Eugene Weinstein. Efficient and Robust Music Identification With Weighted Finite-State Transducers