Abstract is missing.
- Human sound perception - what can we learn from it when developing audio analysis algorithms?Tuomas Virtanen. [doi]
- Pitch estimation using mutual informationMajid Mirbagheri, Yanbo Xu, Shihab A. Shamma. 1-4 [doi]
- Establishing some principles of human speech production through two-dimensional computational modelsMauro Nicolao, Roger K. Moore. 5-10 [doi]
- A spectral envelope estimation method based on F0-adaptive multi-frame integration analysisTomoyasu Nakano, Masataka Goto. 11-16 [doi]
- Cochlear implant-like processing of speech signal for speaker verificationCong-Thanh Do, Claude Barras. 17-21 [doi]
- Evaluating speech intelligibility enhancement for HMM-based synthetic speech in noiseCassia Valentini-Botinhao, Junichi Yamagishi, Simon King. 22-27 [doi]
- A generalized Stein's estimation approach for speech enhancement based on perceptual criteriaSunder Ram Krishnan, Chandra Sekhar Seelamantula. 28-33 [doi]
- Non-stationary signal processing and its application in speech recognitionZoltán Tüske, Friedhelm R. Drepper, Ralf Schlüter. 34-39 [doi]
- Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture modelsLiang Lu, Arnab Ghoshal, Steve Renals. 40-45 [doi]
- Hierarchical hybrid language models for open vocabulary continuous speech recognition using WFSTM. Ali Basha Shaik, David Rybach, Stefan Hahn, Ralf Schlüter, Hermann Ney. 46-51 [doi]
- Template-based ASR using posterior features and synthetic references: comparing different TTS systemsSerena Soldo, Mathew Magimai-Doss, Hervé Bourlard. 52-57 [doi]
- Explicit duration modelling in HMM-based speech synthesis using a hybrid hidden Markov model-multilayer perceptronKalu U. Ogbureke, João P. Cabral, Julie Carson-Berndsen. 58-63 [doi]
- Dimensionality reduction of large TDOA vectors for speaker diarizationDeepu Vijayasenan, Fabio Valente. 64-67 [doi]
- Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response powerYoussef Oualil, Mathew Magimai-Doss, Friedrich Faubel, Dietrich Klakow. 68-73 [doi]
- Structured sparse coding for microphone array location calibrationAfsaneh Asaei, Bhiksha Raj, Hervé Bourlard, Volkan Cevher. 74-79 [doi]
- Log-normal matrix factorization with application to speech-music separationTakuya Yoshioka, Daichi Sakaue. 80-85 [doi]
- Multi-channel speech separation with soft time-frequency maskingRahil Mahdian Toroghi, Friedrich Faubel, Dietrich Klakow. 86-91 [doi]
- Smoothing speech trajectories by regularizationHeyun Huang, Louis ten Bosch, Bert Cranen, Lou Boves. 92-97 [doi]
- Data-driven speech representations for NMF-based word learningJoris Driesen, Jort F. Gemmeke, Hugo Van Hamme. 98-103 [doi]
- Spectro-temporal features with distribution equalizationSamuel K. Ngouoko M, Martin Heckmann, Britta Wrede. 104-109 [doi]
- Language identification using spectro-temporal patch featuresKamal Sahni, Pranay Dighe, Rita Singh, Bhiksha Raj. 110-113 [doi]
- Inharmonic speech: a tool for the study of speech perception and separationJosh H. McDermott, Daniel P. W. Ellis, Hideki Kawahara. 114-117 [doi]