IEEE Transactions on Audio, Speech & Language Processing

researchr

You are not signed in
Sign in
Sign up

1889	--	1901	Ozlem Kalinli, Michael L. Seltzer, Jasha Droppo, Alex Acero. Noise Adaptive Training for Robust Automatic Speech Recognition
1902	--	1912	G. N. Lilis, Daniele Angelosante, Georgios B. Giannakis. Sound Field Reproduction using the Lasso
1913	--	1928	Wenyi Zhang, Bhaskar D. Rao. A Two Microphone-Based Approach for Source Localization of Multiple Speech Sources
1929	--	1940	Damián Marelli, Mitsuko Aramaki, Richard Kronland-Martinet, Charles Verron. Time-Frequency Synthesis of Noisy Sounds With Narrow Spectral Components
1941	--	1954	Songfang Huang, Steve Renals. Hierarchical Bayesian Language Models for Conversational Speech Recognition
1955	--	1967	Emmanouil Benetos, Constantine Kotropoulos. Non-Negative Tensor Factorization Applied to Music Genre Classification
1968	--	1977	Emmanouil Benetos, Yannis Stylianou. Auditory Spectrum-Based Pitched Instrument Onset Detection
1978	--	1993	Jian Liu, Yegui Xiao, Jinwei Sun, Li Xu. Analysis of Online Secondary-Path Modeling With Auxiliary Noise Scaled by Residual Noise Signal
1994	--	2003	Chi-Chun Hsia, Chung-Hsien Wu, Jung-Yun Wu. Exploiting Prosody Hierarchy and Dynamic Features for Pitch Modeling and Generation in HMM-Based Speech Synthesis
2004	--	2014	Huijun Ding, Ing Yann Soon, Chai Kiat Yeo. Over-Attenuated Components Regeneration for Speech Enhancement
2015	--	2027	A. Reddy, R. C. Rose. Integration of Statistical Models for Dictation of Document Translations in a Machine-Aided Human Translation Task
2028	--	2037	David Imseng, Gerald Friedland. Tuning-Robust Initialization Methods for Speaker Diarization
2038	--	2050	Jens Ahrens, Sascha Spors. Sound Field Reproduction Using Planar and Linear Arrays of Loudspeakers
2051	--	2066	Gregory Sell, Malcolm Slaney. Solving Demodulation as an Optimization Problem
2067	--	2079	Guoning Hu, DeLiang L. Wang. A Tandem Algorithm for Pitch Estimation and Voiced Speech Segregation
2080	--	2090	Gibak Kim, Philipos C. Loizou. Improving Speech Intelligibility in Noise Using Environment-Optimized Algorithms
2091	--	2098	M. Kleider, Boaz Rafaely, Barak Weiss, Eitan Bachmat. Golden-Ratio Sampling for Scanning Circular Microphone Arrays
2099	--	2110	Seokhwan Jo, Chang D. Yoo. Psychoacoustically Constrained and Distortion Minimized Speech Enhancement
2111	--	2120	Wooil Kim, John H. L. Hansen. Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions
2121	--	2133	Zhiyao Duan, Bryan Pardo, Changshui Zhang. Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions
2134	--	2144	Nikoletta Bassiou, Vassiliki Moschou, Constantine Kotropoulos. Speaker Diarization Exploiting the Eigengap Criterion and Cluster Ensembles
2145	--	2154	V. Rao, P. Rao. Vocal Melody Extraction in the Presence of Pitched Accompaniment in Polyphonic Music
2155	--	2167	Brady Laska, Miodrag Bolic, Rafik A. Goubran. Particle Filter Enhancement of Speech Spectral Amplitudes

1673	--	1675	Tomohiro Nakatani, Walter Kellermann, P. Naylor, Masato Miyoshi, Biing-Hwang Juang. Introduction to the Special Issue on Processing Reverberant Speech: Methodologies and Applications
1676	--	1691	Armin Sehr, Roland Maas, Walter Kellermann. Reverberation Model-Based Decoding in the Logmelspec Domain for Robust Distant-Talking Speech Recognition
1692	--	1707	Alexander Krueger, Reinhold Haeb-Umbach. Model-Based Feature Enhancement for Reverberant Speech Recognition
1708	--	1716	R. Gomez, T. Kawahara. Robust Speech Recognition Based on Dereverberation Parameter Optimization Using Acoustic Model Likelihood
1717	--	1731	Tomohiro Nakatani, Takuya Yoshioka, Keisuke Kinoshita, Masato Miyoshi, Biing-Hwang Juang. Speech Dereverberation Based on Variance-Normalized Delayed Linear Prediction
1732	--	1745	Marco Jeub, M. Schafer, Thomas Esch, Peter Vary. Model-Based Dereverberation Preserving Binaural Cues
1746	--	1765	Jan S. Erkelens, Richard Heusdens. Correlation-Based and Model-Based Blind Single-Channel Late-Reverberation Suppression in Noisy Time-Varying Acoustical Environments
1766	--	1774	Tiago H. Falk, Chenxi Zheng, Wai-Yip Chan. A Non-Intrusive Quality and Intelligibility Measure of Reverberant and Dereverberated Speech
1775	--	1780	Takayuki Arai, Nao Hodoshima, K. Yasu. Using Steady-State Suppression to Improve Speech Intelligibility in Reverberant Environments for Elderly Listeners
1781	--	1792	Flavio Ribeiro, Cha Zhang, Dinei A. F. Florêncio, Demba E. Ba. Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization
1793	--	1805	Yan-Chen Lu, M. Cooke. Binaural Estimation of Sound Source Distance via the Direct-to-Reverberant Energy Ratio for Static and Moving Sources
1806	--	1817	F. Talantzis. An Acoustic Source Localization and Tracking Framework Using Particle Filtering and Information Theory
1818	--	1829	M. Kowalski, Emmanuel Vincent, Rémi Gribonval. Beyond the Narrowband Approximation: Wideband Convex Methods for Under-Determined Reverberant Audio Source Separation
1830	--	1840	Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval. Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model
1841	--	1855	Alireza Masnadi-Shirazi, Wenyi Zhang, Bhaskar D. Rao. Glimpsing IVA: A Framework for Overcomplete/Complete/Undercomplete Convolutive Source Separation
1856	--	1866	John Woodruff, DeLiang Wang. Sequential Organization of Speech in Reverberant Environments by Integrating Monaural Grouping and Binaural Localization
1867	--	1871	C. Hummersone, R. Mason, T. Brookes. Dynamic Precedence Effect Modeling for Source Separation in Reverberant Environments
1872	--	1883	Michael I. Mandel, S. Bressler, Barbara G. Shinn-Cunningham, Daniel P. W. Ellis. Evaluating Source Separation Algorithms With Reverberant Speech

1094	--	1106	Hamed Ketabdar, Hervé Bourlard. Enhanced Phone Posteriors for Improving Speech Recognition Systems
1107	--	1115	Sergio Canazza, Giovanni De Poli, Gian Antonio Mian. Restoration of Audio Documents by Means of Extended Kalman Filter
1116	--	1126	Chunghsin Yeh, Axel Röbel, Xavier Rodet. Multiple Fundamental Frequency Estimation and Polyphony Inference of Polyphonic Music Signals
1127	--	1136	Jiucang Hao, Te-Won Lee, Terrence J. Sejnowski. Speech Enhancement Using Gaussian Scale Mixture Models
1137	--	1146	R. Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. Integrated Active Noise Control and Noise Reduction in Hearing Aids
1147	--	1157	Justin Jian Zhang, Ricky Ho Yin Chan, Pascale Fung. Extractive Speech Summarization Using Shallow Rhetorical Structure Modeling
1158	--	1169	Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee. A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition
1170	--	1181	Chung-Hsien Wu, Chao-Hong Liu, M. Harris, Liang-Chih Yu. Sentence Correction Incorporating Relative Position and Parse Template Language Models
1182	--	1192	Robbie Vogt, Sridha Sridharan, Michael Mason. Making Confident Speaker Verification Decisions With Minimal Speech
1193	--	1207	Dimitri Nion, Kleanthis N. Mokios, Nicholas D. Sidiropoulos, Alexandros Potamianos. Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures
1208	--	1217	Vaclav Eksler, Milan Jelinek. Glottal-Shape Codebook to Improve Robustness of CELP Codecs
1218	--	1227	Moo-young Kim, W. Bastiaan Kleijn. Reduction of the Impact of Distortion Outliers and Source Mismatch in Resolution-Constrained Quantization
1228	--	1242	Maurice F. Fallon, Simon J. Godsill. Acoustic Source Localization and Tracking Using Track Before Detect
1243	--	1257	Xiaoqiang Xiao, Robert M. Nickel. Speech Enhancement With Inventory Style Speech Resynthesis
1258	--	1268	Angel M. Gomez, Jose L. Carmona, Antonio M. Peinado, Victoria E. Sánchez. A Multipulse-Based Forward Error Correction Technique for Robust CELP-Coded Speech Transmission Over Erasure Channels
1269	--	1279	Matt Gibson 0002, Thomas Hain. Error Approximation and Minimum Phone Error Acoustic Model Estimation
1280	--	1289	Matthias Mauch, Simon Dixon. Simultaneous Estimation of Chords and Musical Context From Audio
1290	--	1299	Jingen Ni, Feng Li. A Variable Step-Size Matrix Normalized Subband Adaptive Filter
1300	--	1312	Chang Huai You, Kong-Aik Lee, Haizhou Li. GMM-SVM Kernel With a Bhattacharyya-Based Distance for Speaker Recognition
1313	--	1322	Ilknur Durgar El-Kahlout, Kemal Oflazer. Exploiting Morphology and Local Word Reordering in English-to-Turkish Phrase-Based Statistical Machine Translation
1323	--	1331	Jingbo Zhu, Huizhen Wang, Benjamin K. Tsou, Matthew Y. Ma. Active Learning With Sampling by Uncertainty and Density for Data Annotations
1332	--	1340	Chi-Sang Jung, Moo-young Kim, Hong-Goo Kang. Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information
1341	--	1353	Jose L. Carmona, Antonio M. Peinado, José L. Pérez-Córdoba, Angel M. Gomez. MMSE-Based Packet Loss Concealment for CELP-Coded Speech Recognition
1354	--	1365	Kentaro Ishizuka, Shoko Araki, T. Kawahara. Speech Activity Detection for Multi-Party Conversation Analyses Based on Likelihood Ratio Test on Spatial Magnitude
1366	--	1378	Marc Ferras, Cheung Chi Leung, Claude Barras, Jean-Luc Gauvain. Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition
1379	--	1393	Hynek Boril, John H. L. Hansen. Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments
1394	--	1405	Chung-Hsien Wu, Chi-Chun Hsia, Chung-Han Lee, Mai-Chun Lin. Hierarchical Prosody Conversion Using Regression-Based Clustering for Emotional Speech Synthesis
1406	--	1416	Keansub Lee, Daniel P. W. Ellis. Audio-Based Semantic Concept Classification for Consumer Video
1417	--	1428	C. Tantibundhit, Franz Pernkopf, Gernot Kubin. Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
1429	--	1439	Eric A. Lehmann, Anders M. Johansson. Diffuse Reverberation Model for Efficient Image-Source Simulation of Room Impulse Responses
1440	--	1454	Ø. Birkenes, T. Matsui, K. Tanabe, Sabato Marco Siniscalchi, Tor André Myrvoll, M. H. Johnsen. Penalized Logistic Regression With HMM Log-Likelihood Regressors for Speech Recognition
1455	--	1463	Jerome R. Bellegarda. A Dynamic Cost Weighting Framework for Unit Selection Text-to-Speech Synthesis
1464	--	1475	Mathieu Parvaix, Laurent Girin, Jean-Marc Brossier. A Watermarking-Based Method for Informed Source Separation of Audio Signals With a Single Sensor
1476	--	1485	Hirofumi Nakajima, Kazuhiro Nakadai, Yuji Hasegawa, Hiroshi Tsujino. Blind Source Separation With Parameter-Free Adaptive Step-Size Method for Robot Audition
1486	--	1495	M. Akbacak, John H. L. Hansen. Spoken Proper Name Retrieval for Limited Resource Languages Using Multilingual Hybrid Representations
1496	--	1506	Mitchell McLaren, Robbie Vogt, Brendan Baker, Sridha Sridharan. Data-Driven Background Dataset Selection for SVM-Based Speaker Verification
1507	--	1516	Hirokazu Kameoka, Nobutaka Ono, Shigeki Sagayama. Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency
1517	--	1527	Andre Holzapfel, Yannis Stylianou, Ali Cenk Gedik, Baris Bozkurt. Three Dimensions of Pitched Instrument Onset Detection
1528	--	1538	Panikos Heracleous, V.-A. Tran, T. Nagai, Kiyohiro Shikano. Analysis and Recognition of NAM Speech Using HMM Distances and Visual Information
1539	--	1549	Yuya Akita, Tatsuya Kawahara. Statistical Transformation of Language and Pronunciation Models for Spontaneous Speech Recognition
1550	--	1561	Charles Verron, Mitsuko Aramaki, Richard Kronland-Martinet, Grégory Pallone. A 3-D Immersive Synthesizer for Environmental Sounds
1562	--	1574	Yi-Cheng Pan, Lin-Shan Lee. Performance Analysis for Lattice-Based Speech Indexing Approaches Using Words and Subword Units
1575	--	1587	Mehrez Souden, Jacob Benesty, Sofiène Affes. Broadband Source Localization From an Eigenanalysis Perspective
1588	--	1600	Péter Mihajlik, Zoltán Tüske, B. Tarjan, Bottyán Németh, Tibor Fegyó. Improved Recognition of Spontaneous Hungarian Speech - Morphological and Acoustic Modeling Techniques for a Less Resourced Task
1601	--	1611	Gökhan Tür, Andreas Stolcke, L. Voss, Stanley Peters, Dilek Hakkani-Tür, John Dowding, Benoît Favre, Raquel Fernández, Matthew Frampton, Michael W. Frandsen, C. Frederickson, Martin Graciarena, D. Kintzing, K. Leveque, S. Mason, John Niekrasz, Matthew Purver, Korbinian Riedhammer, Elizabeth Shriberg, Jing Tien, Dimitra Vergyri, Fan Yang. The CALO Meeting Assistant System
1612	--	1623	Bengt J. Borgstrom, Abeer Alwan. HMM-Based Reconstruction of Unreliable Spectrographic Data for Noise Robust Speech Recognition
1624	--	1631	Sriram Ganapathy, Petr Motlícek, Hynek Hermansky. Autoregressive Models of Amplitude Modulations in Audio Compression
1632	--	1642	Hyeon-Jin Jeon, Tae-Gyu Chang, Sen M. Kuo. Analysis of Frequency Mismatch in Narrowband Active Noise Control
1643	--	1654	Valentin Emiya, Roland Badeau, Bertrand David. Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle
1655	--	1666	Thushara D. Abhayapala, Aastha Gupta. Spherical Harmonic Analysis of Wavefields Using Multiple Circular Sensor Arrays

909	--	911	Yannis Stylianou, T. Toda, C. H. Wu, Alexander Kain, Olivier Rosec. Introduction to the Special Section on Voice Transformation
912	--	921	Elina Helander, Tuomas Virtanen, Jani Nurminen, Moncef Gabbouj. Voice Conversion Using Partial Least Squares Regression
922	--	931	Daniel Erro, Asunción Moreno, Antonio Bonafonte. Voice Conversion Based on Weighted Frequency Warping
932	--	943	Jianhua Tao, Meng Zhang, Jani Nurminen, Jilei Tian, Xia Wang. Supervisory Data Alignment for Text-Independent Voice Conversion
944	--	953	Daniel Erro, Asunción Moreno, Antonio Bonafonte. INCA Algorithm for Training Voice Conversion Systems From Nonparallel Corpora
954	--	964	Srinivas Desai, Alan W. Black, B. Yegnanarayana, Kishore Prahallad. Spectral Mapping Using Artificial Neural Networks for Voice Conversion
965	--	973	Oytun Türk, Marc Schröder. Evaluation of Expressive Speech Synthesis With Voice Conversion and Copy Resynthesis Techniques
974	--	983	Daniel Erro, Eva Navas, Inmaculada Hernáez, Ibon Saratxaga. Emotion Conversion Based on Prosodic Unit Selection
984	--	1004	Junichi Yamagishi, B. Usabaev, Simon King, O. Watts, J. Dines, Jilei Tian, Yong Guan, Rile Hu, Keiichiro Oura, Yi-Jian Wu, Keiichi Tokuda, Reima Karhila, Mikko Kurimo. Thousands of Voices for HMM-Based Speech Synthesis-Analysis and Application of TTS Systems Built on Various ASR Corpora
1005	--	1016	O. Watts, J. Yamagishi, Simon King, K. Berkling. Synthesis of Child Speech With HMM Adaptation and Voice Conversion
1017	--	1029	Purvis Bedenbaugh, Diana K. Sarko, Heidi L. Roth, Eugene M. Martin. Prosody-Preserving Voice Transformation to Evaluate Brain Representations of Speech Sounds
1030	--	1040	Daniel Felps, Ricardo Gutierrez-Osuna. Developing Objective Measures of Foreign-Accent Conversion
1041	--	1052	Carlos Molina, Néstor Becerra Yoma, Fernando Huenupán, Claudio Garretón, Jorge Wuth. Maximum Entropy-Based Reinforcement Learning Using a Confidence Measure in Speech Recognition for Telephone Speech
1053	--	1062	Xugang Lu, Jianwu Dang. Vowel Production Manifold: Intrinsic Factor Analysis of Vowel Articulation
1063	--	1065	S. Abdallah. Comment on Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection
1065	--	1068	Cong-Thanh Do, Dominique Pastor, André Goalic. On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR
1068	--	1071	Parham Mokhtari, Hironori Takemoto, Ryouichi Nishimura, Hiroaki Kato. Optimum Loss Factor for a Perfectly Matched Layer in Finite-Difference Time-Domain Acoustic Simulation
1072	--	1077	Mehrez Souden, Jingdong Chen, Jacob Benesty, Sofiène Affes. Gaussian Model-Based Multichannel Speech Presence Probability
1077	--	1082	S. Tiomkin, D. Malah, Salva Shechtman. Statistical Text-to-Speech Synthesis Based on Segment-Wise Representation With a Norm Constraint
1082	--	1086	Claudio Garretón, Néstor Becerra Yoma, M. Torres. Channel Robust Feature Transformation Based on Filter-Bank Energy Filtering

713	--	714	Vesa Välimäki, Federico Fontana, Julius O. Smith, Udo Zölzer. Introduction to the Special Issue on Virtual Analog Audio Effects and Musical Instruments
715	--	727	Giovanni De Sanctis, Augusto Sarti. Virtual Analog Modeling in the Wave-Digital Domain
728	--	737	David T. Yeh, Jonathan S. Abel, Julius O. Smith. Automated Physical Modeling of Nonlinear Audio Circuits For Real-Time Audio Effects - Part I: Theoretical Development
738	--	746	Jyri Pakarinen, Matti Karjalainen. Enhanced Wave Digital Triode Model for Real-Time Tube Amplifier Emulation
747	--	759	Thomas Hélie. Volterra Series and State Transformation for Real-Time Simulations of Audio Circuits Including Saturations: Application to the Moog Ladder Filter
760	--	772	Federico Fontana, Marco Civolani. Modeling of the EMS VCS3 Voltage-Controlled Filter as a Nonlinear Filter Network
773	--	785	Juhan Nam, Vesa Välimäki, Jonathan S. Abel, Julius O. Smith. Efficient Antialiasing Oscillator Algorithms Using Low-Order Fractional Delay Filters
786	--	798	Vesa Välimäki, Juhan Nam, Julius O. Smith, Jonathan S. Abel. Alias-Suppressed Oscillators Based on Differentiated Polynomial Waveforms
799	--	808	Stefan Bilbao, Julian Parker. A Virtual Model of Spring Reverberation
809	--	821	Balázs Bank, Stefano Zambon, Federico Fontana. A Modal-Based Real-Time Piano Synthesizer
822	--	832	Gianpaolo Evangelista, Fredrik Eckerholm. Player-Instrument Interaction Models for Digital Waveguide Synthesis of Guitar: Touch and Collisions
833	--	842	Nelson Lee, Julius O. Smith, Vesa Välimäki. Analysis and Synthesis of Coupled Vibrating Strings Using a Hybrid Modal-Waveguide Synthesis Model
843	--	854	Rémi Mignot, Thomas Hélie, Denis Matignon. Digital Waveguide Modeling for Wind Instruments: Building a State-Space Representation Based on the Webster-Lokshin Model
855	--	871	Esteban Maestre, Merlijn Blaauw, Jordi Bonada, Enric Guaus, Alfonso Perez. Statistical Modeling of Bowing Control Applied to Violin Sound Synthesis
872	--	880	Stefan Bilbao. Percussion Synthesis Based on Models of Nonlinear Shell Vibration
881	--	890	Rudolf Rabenstein, Tilman Koch, Christian Popp. Tubular Bells: A Physical and Algorithmic Model
891	--	902	Federico Avanzini, Riccardo Marogna. A Modular Physically Based Approach to the Sound Synthesis of Membrane Percussion Instruments

417	--	419	Bertrand David, Masataka Goto, Laurent Daudet, Paris Smaragdis. Editorial for the Special Issue on Signal Models and Representations of Musical and Environmental Sounds
420	--	433	Vittoria Bruni, Silvia Marconi, Domenico Vitulano. Time-Scale Atoms Chains for Transients Detection in Audio Signals
434	--	446	Emmanuel Ravelli, Gaël Richard, Laurent Daudet. Audio Signal Representations for Indexing in the Transform Domain
447	--	460	Nicolás Ruiz-Reyes, Pedro Vera-Candeas. Adaptive Signal Modeling Based on Sparse Approximations for Scalable Parametric Audio Coding
461	--	472	Bob L. Sturm, John J. Shynk. Sparse Approximation and the Pursuit of Meaningful Signal Models With Interference Adaptation
473	--	486	Julio J. Carabias-Orti, Pedro Vera-Candeas, Francisco J. Cañadas-Quesada, Nicolás Ruiz-Reyes. Music Scene-Adaptive Harmonic Dictionary for Unsupervised Note-Event Detection
487	--	497	Johan Xi Zhang, Mads Græsbøll Christensen, Søren Holdt Jensen, Marc Moonen. A Robust and Computationally Efficient Subspace-Based Fundamental Frequency Estimator
498	--	508	Jeremy Wells, Damian Murphy. A Comparative Evaluation of Techniques for Single-Frame Discrimination of Nonstationary Sinusoids
509	--	518	Mathieu Lagrange, Gary P. Scavone, Philippe Depalle. Analysis/Synthesis of Sounds Generated by Sustained Contact Between Rigid Objects
519	--	527	Paul H. Peeling, Ali Taylan Cemgil, Simon J. Godsill. Generative Spectrogram Factorization Models for Polyphonic Piano Transcription
528	--	537	Emmanuel Vincent, Nancy Bertin, Roland Badeau. Adaptive Harmonic Spectral Decomposition for Multiple Pitch Estimation
538	--	549	Nancy Bertin, Roland Badeau, Emmanuel Vincent. Enforcing Harmonicity and Smoothness in Bayesian Non-Negative Matrix Factorization Applied to Polyphonic Music Transcription
550	--	563	Alexey Ozerov, Cédric Févotte. Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
564	--	575	Jean-Louis Durrieu, Gaël Richard, Bertrand David, Cédric Févotte. Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals
576	--	588	Yannis Panagakis, Constantine Kotropoulos, Gonzalo R. Arce. Non-Negative Multilinear Principal Component Analysis of Auditory Temporal Modulations for Music Genre Classification
589	--	601	Onur Dikmen, Ali Taylan Cemgil. Gamma Markov Random Fields for Audio Source Modeling
602	--	612	Luke Barrington, Antoni B. Chan, Gert R. G. Lanckriet. Modeling Music as a Dynamic Texture
613	--	624	Anssi Klapuri, Tuomas Virtanen. Representing Musical Sounds With an Interpolating State Model
625	--	637	Kris West, Stephen Cox. Incorporating Cultural Representations of Features Into Audio Music Similarity Estimation
638	--	648	Hiromasa Fujihara, Masataka Goto, Tetsuro Kitahara, Hiroshi G. Okuno. A Modeling of Singing Voice Robust to Accompaniment Sounds and Its Application to Singer Identification and Vocal-Timbre-Similarity-Based Music Information Retrieval
649	--	662	Meinard Müller, Sebastian Ewert. Towards Timbre-Invariant Audio Features for Harmony-Based Music
663	--	674	Juan José Burred, Axel Röbel, Thomas Sikora. Dynamic Spectral Envelope Modeling for Timbre Analysis of Musical Instrument Sounds
675	--	687	Geoffroy Peeters, Emmanuel Deruty. Sound Indexing Using Morphological Description
688	--	707	Gordon Wichern, Jiachen Xue, Harvey D. Thornburg, Brandon Mechtley, Andreas Spanias. Segmentation, Indexing, and Retrieval for Environmental and Natural Sounds

213	--	223	Hüseyin Hacihabiboglu, Banu Gunel, Zoran Cvetkovic. Simulation of Directional Microphones in Digital Waveguide Mesh-Based Models of Room Acoustics
224	--	236	Claudius Gläser, Martin Heckmann, Frank Joublin, Christian Goerick. Combining Auditory Preprocessing and Bayesian Estimation for Robust Formant Tracking
237	--	248	Damián Marelli, Péter Balázs. On Pole-Zero Model Estimation Methods Minimizing a Logarithmic Criterion for Speech Analysis
249	--	259	Alfred Mertins, Tiemin Mei, Markus Kallinger. Room Impulse Response Shortening/Reshaping With Infinity- and p -Norm Optimization
260	--	276	Mehrez Souden, Jacob Benesty, Sofiène Affes. On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction
277	--	285	Avram Levi, Harvey F. Silverman. A Robust Method to Extract Talker Azimuth Orientation Using a Large-Aperture Microphone Array
286	--	295	Roberto Napoli, Luigi Piroddi. Nonlinear Active Noise Control With NARX Models
296	--	309	Luis Buera, Antonio Miguel, Oscar Saz, Alfonso Ortega, Eduardo Lleida. Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition
310	--	319	Chao-Ling Hsu, Jyh-Shing Roger Jang. On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset
320	--	329	Ümit Güz, Sébastien Cuendet, Dilek Hakkani-Tür, Gökhan Tür. Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech
330	--	341	Vinay Melkote, Kenneth Rose. Trellis-Based Approaches to Rate-Distortion Optimized Audio Encoding
342	--	355	Bram Cornelis, Simon Doclo, Tim Van den Bogaert, Marc Moonen, Jan Wouters. Theoretical Analysis of Binaural Multimicrophone Noise Reduction Techniques
356	--	368	Wen Jin, Xin Liu, Michael S. Scordilis, Lu Han. Speech Enhancement Using Harmonic Emphasis and Adaptive Comb Filtering
369	--	381	Nathalie Camelin, Frédéric Béchet, Géraldine Damnati, Renato de Mori. Detection and Interpretation of Opinion Expressions in Spoken Surveys
382	--	394	Michael I. Mandel, Ron J. Weiss, Daniel P. W. Ellis. Model-Based Expectation-Maximization Source Separation and Localization
395	--	406	Shinji Watanabe, Atsushi Nakamura. Predictor-Corrector Adaptation by Using Time Evolution System With Macroscopic Time Scale
407	--	412	Alexandros Nanopoulos, Dimitrios Rafailidis, Panagiotis Symeonidis, Yannis Manolopoulos. MusicBox: Personalized Music Recommendation Based on Cubic Analysis of Social Tags

1	--	0	Ali H. Sayed. Free Electronic Access to SP Publications
2	--	16	Dmitry N. Zotkin, Ramani Duraiswami, Nail A. Gumerov. Plane-Wave Decomposition of Acoustical Scenes Via Spherical and Cylindrical Microphone Arrays
17	--	33	Ramdas Kumaresan, Nitesh Panchal. Encoding Bandpass Signals Using Zero/Level Crossings: A Model-Based Approach
34	--	49	Péter Balázs, Bernhard Laback, Gerhard Eckel, Werner A. Deutsch. Time-Frequency Sparsity by Removing Perceptually Irrelevant Components Using a Simple Model of Simultaneous Masking
50	--	57	Antti J. Eronen, Anssi Klapuri. Music Tempo Estimation With k -NN Regression
58	--	67	Jean-Marc Valin, Timothy B. Terriberry, Christopher Montgomery, Gregory Maxwell. A High-Quality Speech and Audio Codec With Less Than 10-ms Delay
68	--	77	Martin Raspaud, Harald Viste, Gianpaolo Evangelista. Binaural Source Localization by Joint Estimation of ILD and ITD
78	--	89	Konrad Kowalczyk, Maarten van Walstijn. Wideband and Isotropic Room Acoustics Simulation Using 2-D Interpolated FDTD Schemes
90	--	100	Tiago H. Falk, Wai-Yip Chan. Modulation Spectral Features for Robust Far-Field Speaker Identification
101	--	116	Vaninirappuputhenpurayil Gopalan Reju, Soo Ngee Koh, Ing Yann Soon. Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
117	--	125	Ian Vince McLoughlin. Vowel Intelligibility in Chinese
141	--	157	Shih-Sian Cheng, Hsin-Min Wang, Hsin-Chia Fu. BIC-Based Speaker Segmentation Using Divide-and-Conquer Strategies With Application to Speaker Diarization
158	--	170	Emanuël Anco Peter Habets, Jacob Benesty, Israel Cohen, Sharon Gannot, Jacek Dmochowski. New Insights Into the MVDR Beamformer in Room Acoustics
171	--	186	Tianyu T. Wang, Thomas F. Quatieri. High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch
187	--	196	Feifan Liu, Yang Liu. Exploring Correlation Between ROUGE and Human Evaluation on Meeting Summaries
197	--	207	Mehryar Mohri, Pedro Moreno, Eugene Weinstein. Efficient and Robust Music Identification With Weighted Finite-State Transducers

External Links

Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 18, Issue 8

Volume 18, Issue 7

Volume 18, Issue 6

Volume 18, Issue 5

Volume 18, Issue 4

Volume 18, Issue 3

Volume 18, Issue 2

Volume 18, Issue 1

External Links

Journal: IEEE Transactions on Audio, Speech &amp; Language Processing

Volume 18, Issue 8

Volume 18, Issue 7

Volume 18, Issue 6

Volume 18, Issue 5

Volume 18, Issue 4

Volume 18, Issue 3

Volume 18, Issue 2

Volume 18, Issue 1

Journal: IEEE Transactions on Audio, Speech & Language Processing