Abstract is missing.
- Sound Event Detection Using Point-Labeled DataBongjun Kim, Bryan Pardo. 1-5 [doi]
- Batch Uniformization for Minimizing Maximum Anomaly Score of Dnn-Based Anomaly Detection in SoundsYuma Koizumi, Shoichiro Saito, Masataka Yamaguchi, Shin Murata, Noboru Harada. 6-10 [doi]
- City Classification from Multiple Real-World Sound ScenesHelen L. Bear, Toni Heittola, Annamaria Mesaros, Emmanouil Benetos, Tuomas Virtanen. 11-15 [doi]
- Model-Agnostic Approaches To Handling Noisy Labels When Training Sound Event ClassifiersEduardo Fonseca, Frederic Font, Xavier Serra. 16-20 [doi]
- Annotations Time Shift: A Key Parameter in Evaluating Musical Note Onset Detection AlgorithmsMina Mounir, Peter Karsmakers, Toon van Waterschoot. 21-25 [doi]
- End-To-End Melody Note Transcription Based on a Beat-Synchronous Attention MechanismRyo Nishikimi, Eita Nakamura, Masataka Goto, Kazuyoshi Yoshii. 26-30 [doi]
- Time-Scale Modification Using Fuzzy Epoch-Synchronous Overlap-Add (FESOLA)Timothy Roberts, Kuldip K. Paliwal. 31-34 [doi]
- High-Level Control of Drum Track Generation Using Learned Patterns of Rhythmic InteractionStefan Lattner, Maarten Grachten. 35-39 [doi]
- Investigating Kernel Shapes and Skip Connections for Deep Learning-Based Harmonic-Percussive SeparationCarlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven Ahlbäck. 40-44 [doi]
- Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and QuantityEthan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux. 45-49 [doi]
- On The Behavior of Delay Network Reverberator ModesGrehisama Das, Elliot K. Canfield-Dafilou, Jonathan S. Abel. 50-54 [doi]
- Graphic Equalizer Design with Symmetric Biquad FiltersJuho Liski, Jussi Rämö, Vesa Välimäki. 55-59 [doi]
- Active Feedback Suppression for Hearing Devices Exploiting Multiple LoudspeakersHenning F. Schepker, Simon Doclo. 60-64 [doi]
- Perceptual Evaluation of Binaural Auralization of Data Obtained from the Spatial Decomposition MethodJens Ahrens. 65-69 [doi]
- Sparse Representation of Hrtfs by Ear AlignmentZamir Ben-Hur, David Lou Alon, Ravish Mehra, Boaz Rafaely. 70-74 [doi]
- Morphological Weighting Improves Individualized Prediction of HRTF Directivity PatternsMuhammad Shahnawaz, Craig T. Jin, Joan Alexis Glaunès, Augusto Sarti, Anthony I. Tew. 75-79 [doi]
- EEG-Based Decoding of Auditory Attention to a Target Instrument in Polyphonic MusicGiorgia Cantisani, Slim Essid, Gaël Richard. 80-84 [doi]
- Intrusive and Non-Intrusive Perceptual Speech Quality Assessment Using a Convolutional Neural NetworkHannes Gamper, Chandan K. A. Reddy, Ross Cutler, Ivan J. Tashev, Johannes Gehrke. 85-89 [doi]
- An Improved Measure of Musical Noise Based on Spectral KurtosisMatteo Torcoli. 90-94 [doi]
- An Efficient Model for Estimating Subjective Quality of Separated Audio Source SignalsThorsten Kastner, Jürgen Herre. 95-99 [doi]
- A Classification-Aided Framework for Non-Intrusive Speech Quality AssessmentXuan Dong 0004, Donald S. Williamson. 100-104 [doi]
- Identification of Voice Quality Variation Using I-VectorsChuyao Feng, Eva van Leer, David V. Anderson. 105-109 [doi]
- 3D Localized Sound Zone Generation with a Planar Omni-Directional Loudspeaker ArrayTakuma Okamoto. 110-114 [doi]
- Motion-Tolerant Beamforming with Deformable Microphone ArraysRyan M. Corey, Andrew C. Singer. 115-119 [doi]
- An Em Method for Multichannel Toa and Doa Estimation of Acoustic EchoesJesper Rindom Jensen, Usama Saqib, Sharon Gannot. 120-124 [doi]
- Speech Enhancement Using Polynomial Eigenvalue DecompositionVincent W. Neo, Christine Evers, Patrick A. Naylor. 125-129 [doi]
- Sub-Sample Time Delay Estimation via Auxiliary-Function-Based Iterative UpdatesKouei Yamaoka, Robin Scheibler, Nobutaka Ono, Yukoh Wakabayashi. 130-134 [doi]
- Active Noise Control Over 3D Space with Multiple Circular ArraysHuiyuan Sun, Thushara D. Abhayapala, Prasanga N. Samarasinghe. 135-139 [doi]
- Sound Field Translation Methods for Binaural ReproductionLachlan Birnie, Thushara Abhayapala, Prasanga N. Samarasinghe, Vladimir Tourbabin. 140-144 [doi]
- Feedback Structures for a Transfer Function Model of a Circular Vibrating MembraneMaximilian Schäfer, Rudolf Rabenstein, Sebastian J. Schlecht. 145-149 [doi]
- Dense Reverberation with Delay Feedback MatricesSebastian J. Schlecht, Emanuel A. P. Habets. 150-154 [doi]
- Physical Models For Fast Estimation Of Guitar String, Fret And Plucking PositionJacob Møller Hjerrild, Silvin Willemsen, Mads Græsbøll Christensen. 155-159 [doi]
- Joint Singing Pitch Estimation and Voice Separation Based on a Neural Harmonic Structure RendererTomoyasu Nakano, Kazuyoshi Yoshii, Yiming Wu, Ryo Nishikimi, Kin Wah Edward Lin, Masataka Goto. 160-164 [doi]
- Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple DomainsMatthew Maciejewski, Gregory Sell, Yusuke Fujita, Leibny Paola García-Perera, Shinji Watanabe, Sanjeev Khudanpur. 165-169 [doi]
- A Style Transfer Approach to Source SeparationShrikant Venkataramani, Efthymios Tzinis, Paris Smaragdis. 170-174 [doi]
- Universal Sound SeparationIlya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin W. Wilson, Jonathan Le Roux, John R. Hershey. 175-179 [doi]
- Deep Tensor Factorization for Spatially-Aware Scene DecompositionJonah Casebeer, Michael Colomb, Paris Smaragdis. 180-184 [doi]
- Independent Vector Analysis with More Microphones Than SourcesRobin Scheibler, Nobutaka Ono. 185-189 [doi]
- Sparse Adaptation of Distributed Blind Source Separation in Acoustic Sensor NetworksMichael Günther, Haitham Afifi, Andreas Brendel, Holger Karl, Walter Kellermann. 190-194 [doi]
- Multiple Hypothesis Tracking for Overlapping Speaker SegmentationAidan O. T. Hogg, Christine Evers, Patrick A. Naylor. 195-199 [doi]
- Declipping Speech Using Deep FilteringWolfgang Mack, Emanuel A. P. Habets. 200-204 [doi]
- Speech Bandwidth Extension with WavenetArchit Gupta, Brendan Shillingford, Yannis M. Assael, Thomas C. Walters. 205-208 [doi]
- IRM with Phase Parameterization for Speech EnhancementXianyun Wang, Changchun Bao, Rui Cheng. 209-213 [doi]
- Generative Speech Enhancement Based on Cloned NetworksMichael Chinen, W. Bastiaan Kleijn, Felicia S. C. Lim, Jan Skoglund. 214-218 [doi]
- Improvement of Speech Residuals for Speech EnhancementSamy Elshamy, Tim Fingscheidt. 219-223 [doi]
- Simultaneous Denoising, Dereverberation, and Source Separation Using a Unified Convolutional BeamformerTomohiro Nakatani, Keisuke Kinoshita, Rintaro Ikeshita, Hiroshi Sawada, Shoko Araki. 224-228 [doi]
- A Perceptual Weighting Filter Loss for DNN Training In Speech EnhancementZiyue Zhao, Samy Elshamy, Tim Fingscheidt. 229-233 [doi]
- Speech Enhancement Using End-to-End Speech Recognition ObjectivesAswin Shanmugam Subramanian, Xiaofei Wang, Murali Karthick Baskar, Shinji Watanabe, Toru Taniguchi, Dung T. Tran, Yuya Fujita. 234-238 [doi]
- Separated Noise Suppression and Speech Restoration: Lstm-Based Speech Enhancement in Two StagesMaximilian Strake, Bruno Defraene, Kristoff Fluyt, Wouter Tirry, Tim Fingscheidt. 239-243 [doi]
- Fast Convergence Algorithm for State-Space Model Based Speech Dereverberation by Multi-Channel Non-Negative Matrix FactorizationMasahito Togami, Tatsuya Komatsu. 244-248 [doi]
- Attention Wave-U-Net for Speech EnhancementRitwik Giri, Umut Isik, Arvindh Krishnaswamy. 249-253 [doi]
- Dilated FCN: Listening Longer to Hear BetterShuyu Gong, Zhewei Wang, Tao Sun, Yuanhang Zhang, Charles D. Smith, Li Xu, Jundong Liu. 254-258 [doi]
- Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene ClassificationKonstantinos Drossos, Paul Magron, Tuomas Virtanen. 259-263 [doi]
- Zero-Shot Audio Classification Based On Class Label EmbeddingsHuang Xie, Tuomas Virtanen. 264-267 [doi]
- Identify, Locate and Separate: Audio-Visual Object Extraction in Large Video Collections Using Weak SupervisionSanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard. 268-272 [doi]
- Weakly Informed Audio Source SeparationKilian Schulze-Forster, Clément Doire, Gaël Richard, Roland Badeau. 273-277 [doi]
- Tricycle: Audio Representation Learning from Sensor Network Data Using Self-SupervisionMark Cartwright, Jason Cramer, Justin Salamon, Juan Pablo Bello. 278-282 [doi]
- Deep Ranking-Based Sound Source LocalizationRenana Opochinsky, Bracha Laufer-Goldshtein, Sharon Gannot, Gal Chechik. 283-287 [doi]
- Independent Low-Rank Matrix Analysis with Decorrelation LearningRintaro Ikeshita, Nobutaka Ito, Tomohiro Nakatani, Hiroshi Sawada. 288-292 [doi]
- Generalized Weighted-Prediction-Error Dereverberation with Varying Source Priors For Reverberant Speech RecognitionToru Taniguchi, Aswin Shanmugam Subramanian, Xiaofei Wang, Dung Tran, Yuya Fujita, Shinji Watanabe. 293-297 [doi]
- Multichannel Speech Enhancement Based On Time-Frequency Masking Using Subband Long Short-Term MemoryXiaofei Li, Radu Horaud. 298-302 [doi]
- Parametric Resynthesis With Neural VocodersSoumi Maiti, Michael I. Mandel. 303-307 [doi]
- Continual Learning of New Sound Classes Using Generative ReplayZhepei Wang, Y. Cem Sübakan, Efthymios Tzinis, Paris Smaragdis, Laurent Charlin. 308-312 [doi]
- ToyADMOS: A Dataset of Miniature-Machine Operating Sounds for Anomalous Sound DetectionYuma Koizumi, Shoichiro Saito, Hisashi Uematsu, Noboru Harada, Keisuke Imoto. 313-317 [doi]
- Evaluation of Post-Processing Algorithms for Polyphonic Sound Event DetectionLéo Cances, Patrice Guyot, Thomas Pellegrini. 318-322 [doi]
- Polyphonic Sound Event and Sound Activity Detection: A Multi-Task ApproachArjun Pankajakshan, Helen L. Bear, Emmanouil Benetos. 323-327 [doi]
- Acoustic Scene Classification Using Higher-Order Ambisonic FeaturesMarc C. Green, Sharath Adavanne, Damian Murphy, Tuomas Virtanen. 328-332 [doi]
- Joint Measurement of Localization and Detection of Sound EventsAnnamaria Mesaros, Sharath Adavanne, Archontis Politis, Toni Heittola, Tuomas Virtanen. 333-337 [doi]
- Joint Analysis of Acoustic Events and Scenes Based on Multitask LearningNoriyuki Tonami, Keisuke Imoto, Masahiro Niitsuma, Ryosuke Yamanishi, Yoichi Yamashita. 338-342 [doi]
- Regression Versus Classification for Neural Network Based Audio Source LocalizationLauréline Perotin, Alexandre Défossez, Emmanuel Vincent, Romain Serizel, Alexandre Guérin. 343-347 [doi]
- Sound Source Localization Using Relative Harmonic Coefficients in Modal DomainYonggang Hu, Prasanga N. Samarasinghe, Thushara D. Abhayapala. 348-352 [doi]
- Acoustic Localization Using Spatial Probability in Noisy and Reverberant EnvironmentsSebastian Braun, Ivan Tashev. 353-357 [doi]
- Supervised Contrastive Embeddings for Binaural Source LocalizationDuowei Tang, Maja Taseska, Toon van Waterschoot. 358-362 [doi]
- Improved Change Prediction for Combined Beamforming and Echo Cancellation with Application to a Generalized Sidelobe CancelerStefan Kühl, Alexander Bohlender, Matthias Schrammen, Peter Jax. 363-367 [doi]
- Two-Dimensional Sound Field Recording With Multiple Circular Microphone Arrays Considering Multiple ScatteringMasahiro Nakanishi, Natsuki Ueno, Shoichi Koyama, Hiroshi Saruwatari. 368-372 [doi]
- RTF-Steered Binaural MVDR Beamforming Incorporating Multiple External MicrophonesNico Gößling, Wiebke Middelberg, Simon Doclo. 373-377 [doi]
- 1ST-Order Microphone Array System for Large Area Sound Field Recording and Reconstruction: Discussion and Preliminary ResultsFederico Borra, Steven Krenn, Israel Dejene Gebru, Dejan Markovic. 378-382 [doi]
- Direction of Arrival Estimation In Highly Reverberant Environments Using Soft Time-Frequency MaskVladimir Tourbabin, Jacob Donley, Boaz Rafaely, Ravish Mehra. 383-387 [doi]
- Analytical Method of 2.5d Exterior Sound Field Synthesis By Using Multipole Loudspeaker ArrayKenta Imaizumi, Kimitaka Tsutsumi, Atsushi Nakadaira, Yoichi Haneda. 388-392 [doi]
- A Sparse Bayesian Learning Based RIR Reconstruction Method for Acoustic Toa And DOA EstimationZonglong Bai, Jesper Rindom Jensen, Jinwei Sun, Mads Græsbøll Christensen. 393-397 [doi]