researchr
explore
Tags
Journals
Conferences
Authors
Profiles
Groups
calendar
New Conferences
Events
Deadlines
search
search
You are not signed in
Sign in
Sign up
Links
Filter by Year
OR
AND
NOT
1
2009
2011
2013
2015
2017
2019
2021
2023
Filter by Tag
Filter by Author
[+]
OR
AND
NOT
1
Archontis Politis
Augusto Sarti
Daniel P. W. Ellis
Emanuel A. P. Habets
Emmanouil Benetos
Fabio Antonacci
Gaël Richard
Jonathan Le Roux
Nobutaka Ono
Paris Smaragdis
Patrick A. Naylor
Prasanga N. Samarasinghe
Roland Badeau
Sharon Gannot
Shoichi Koyama
Simon Doclo
Thushara D. Abhayapala
Tuomas Virtanen
Ville Pulkki
Walter Kellermann
Filter by Top terms
[+]
OR
AND
NOT
1
acoustic
analysis
arrays
audio
detection
domain
enhancement
estimation
field
learning
localization
microphone
model
multi
noise
separation
sound
source
speech
using
WASPAA (waspaa)
Editions
Publications
Viewing Publication 1 - 100 from 671
2023
SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement
Martin Strauss 0003
,
Nicola Pia
,
Nagashree K. S. Rao
,
Bernd Edler
.
waspaa 2023
:
1-5
[doi]
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023, New Paltz, NY, USA, October 22-25, 2023
IEEE,
2023.
[doi]
Correlation Based Glimpse Proportion Index
Ahmed Alghamdi
,
Leonard Moen
,
Wai-Yip Chan
,
Daniel Fogerty
,
Jesper Jensen 0001
.
waspaa 2023
:
1-5
[doi]
Neural Audio Decorrelation Using Generative Adversarial Networks
Carlotta Anemüller
,
Oliver Thiergart
,
Emanuël A. P. Habets
.
waspaa 2023
:
1-5
[doi]
Audio Inputs for Active Speaker Detection and Localization Via Microphone Array
Davide Berghi
,
Philip J. B. Jackson
.
waspaa 2023
:
1-5
[doi]
Complete and Separate: Conditional Separation with Missing Target Source Attribute Completion
Dimitrios Bralios
,
Efthymios Tzinis
,
Paris Smaragdis
.
waspaa 2023
:
1-5
[doi]
Design of Frequency-Invariant Beamformers with Sparse Concentric Circular Arrays
Yaakov Buchris
,
Israel Cohen
,
Alon Amar
.
waspaa 2023
:
1-5
[doi]
Class Activation Mapping-Driven Data Augmentation: Masking Significant Regions for Enhanced Acoustic Scene Classification
Pil Moo Byun
,
Jeong Hwan Choi
,
Joon-Hyuk Chang
.
waspaa 2023
:
1-5
[doi]
Lace: A Light-Weight, Causal Model for Enhancing Coded Speech Through Adaptive Convolutions
Jan Büthe
,
Jean-Marc Valin
,
Ahmed Mustafa
.
waspaa 2023
:
1-5
[doi]
Towards on-Device Keyword Spotting using Low-Footprint Quaternion Neural Models
Aryan Chaudhary
,
Vinayak Abrol
.
waspaa 2023
:
1-5
[doi]
The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss Functions
George Close
,
Thomas Hain
,
Stefan Goetze
.
waspaa 2023
:
1-5
[doi]
Mixed-Delay Distributed Beamforming for Own-Speech Separation in Hearing Devices with Wireless Remote Microphones
Ryan M. Corey
.
waspaa 2023
:
1-5
[doi]
An Improved Metric of Informational Masking for Perceptual Audio Quality Measurement
Pablo M. Delgado
,
Jürgen Herre
.
waspaa 2023
:
1-5
[doi]
Estimating the Direction of Arrival of a Spoken Wake Word Using a Single Sensor on an Elastic Panel
Tre DiPassio
,
Michael C. Heilemann
,
Benjamin Thompson
,
Mark F. Bocko
.
waspaa 2023
:
1-5
[doi]
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models
Hao-Wen Dong
,
Xiaoyu Liu
,
Jordi Pons
,
Gautam Bhattacharya
,
Santiago Pascual
,
Joan Serrà
,
Taylor Berg-Kirkpatrick
,
Julian J. McAuley
.
waspaa 2023
:
1-5
[doi]
Slim-Tasnet: A Slimmable Neural Network for Speech Separation
Mohamed Elminshawi
,
Srikanth Raj Chetupalli
,
Emanuël A. P. Habets
.
waspaa 2023
:
1-5
[doi]
Predicting Thresholds in an Auditory Overshoot Paradigm Using a Computational Subcortical Model with Efferent Feedback
Afagh Farhadi
,
Laurel H. Carney
.
waspaa 2023
:
1-5
[doi]
Temporal Noise Shaping on MDCT Subband Signals for Transform Audio Coding
Richard Füg
,
Bernd Edler
.
waspaa 2023
:
1-5
[doi]
Hyperbolic Unsupervised Anomalous Sound Detection
François G. Germain
,
Gordon Wichern
,
Jonathan Le Roux
.
waspaa 2023
:
1-5
[doi]
Covariance Blocking and Whitening Method for Successive Relative Transfer Function Vector Estimation in Multi-Speaker Scenarios
Henri Gode
,
Simon Doclo
.
waspaa 2023
:
1-5
[doi]
Quaternion Anti-Transfer Learning for Speech Emotion Recognition
Eric Guizzo
,
Tillman Weyde
,
Giacomo Tarroni
,
Danilo Comminiello
.
waspaa 2023
:
1-5
[doi]
An Objective Evaluation of Hearing AIDS and DNN-Based Binaural Speech Enhancement in Complex Acoustic Scenes
Enric Gusó
,
Joanna Luberadzka
,
Martí Baig
,
Umut Sayin Saraç
,
Xavier Serra
.
waspaa 2023
:
1-5
[doi]
Diff-Pitcher: Diffusion-Based Singing Voice Pitch Correction
Jiarui Hai
,
Mounya Elhilali
.
waspaa 2023
:
1-5
[doi]
Optimizing Higher-Order Directional Audio Coding with Adaptive Mixing and Energy Matching for Ambisonic Compression and Upmixing
Christoph Hold
,
Leo McCormack
,
Archontis Politis
,
Ville Pulkki
.
waspaa 2023
:
1-5
[doi]
Adaptive Sparse Linear Prediction in Fixed-Filter ANC Headphone Applications for Multi-Speaker Speech Reduction
Yurii Iotov
,
Sidsel Marie Nørholm
,
Valiantsin Belyi
,
Mads Græsbøll Christensen
.
waspaa 2023
:
1-5
[doi]
Region-of-Interest Oriented Constant-Beamwidth Beamforming with Rectangular Arrays
Gal Itzhak
,
Israel Cohen
.
waspaa 2023
:
1-5
[doi]
Deep Adaptation Control for Stereophonic Acoustic Echo Cancellation
Amir Ivry
,
Israel Cohen
,
Baruch Berdugo
.
waspaa 2023
:
1-5
[doi]
Music De-Limiter Networks Via Sample-Wise Gain Inversion
Chang-Bin Jeon
,
Kyogu Lee
.
waspaa 2023
:
1-5
[doi]
Hybrid Noise Shaping for Audio Coding Using Perfectly Overlapped Window
Byeongho Jo
,
Seungkwon Beack
.
waspaa 2023
:
1-5
[doi]
Flexible Multichannel Speech Enhancement for Noise-Robust Frontend
Ante Jukic
,
Jagadeesh Balam
,
Boris Ginsburg
.
waspaa 2023
:
1-5
[doi]
A High-Rate Extension to Soundstream
Hong-Goo Kang
,
Jan Skoglund
,
W. Bastiaan Kleijn
,
Andrew Storus
,
Hengchin Yeh
.
waspaa 2023
:
1-5
[doi]
All-in-One Metrical and Functional Structure Analysis with Neighborhood Attentions on Demixed Audio
Taejun Kim
,
Juhan Nam
.
waspaa 2023
:
1-5
[doi]
Perceptual Quality Enhancement of Sound Field Synthesis Based on Combination of Pressure and Amplitude Matching
Keisuke Kimura
,
Shoichi Koyama
,
Hiroshi Saruwatari
.
waspaa 2023
:
1-5
[doi]
Compressing Audio CNNS with Graph Centrality Based Filter Pruning
James A. King
,
Arshdeep Singh
,
Mark D. Plumbley
.
waspaa 2023
:
1-5
[doi]
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations
Yuma Koizumi
,
Heiga Zen
,
Shigeki Karita
,
Yifan Ding
,
Kohei Yatabe
,
Nobuyuki Morioka
,
Yu Zhang 0033
,
Wei Han
,
Ankur Bapna
,
Michiel Bacchiani
.
waspaa 2023
:
1-5
[doi]
Kernel Interpolation of Incident Sound Field in Region Including Scattering Objects
Shoichi Koyama
,
Masaki Nakada
,
Juliano G. C. Ribeiro
,
Hiroshi Saruwatari
.
waspaa 2023
:
1-5
[doi]
Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions
Saksham Singh Kushwaha
,
Irán R. Román
,
Magdalena Fuentes
,
Juan Pablo Bello
.
waspaa 2023
:
1-5
[doi]
A Novel Method to Detect Instrumental Music in a Large Scale Music Catalog
Wo Jae Lee
,
Emanuele Coviello
.
waspaa 2023
:
1-5
[doi]
AECSQI: Referenceless Acoustic Echo Cancellation Measures Using Speech Quality and Intelligibility Improvement
Jin Woo Lee
,
Hyeong-Seok Choi
,
Kyogu Lee
.
waspaa 2023
:
1-5
[doi]
Yet Another Generative Model for Room Impulse Response Estimation
Sungho Lee
,
Hyeong-Seok Choi
,
Kyogu Lee
.
waspaa 2023
:
1-5
[doi]
Diffusion Posterior Sampling for Informed Single-Channel Dereverberation
Jean-Marie Lemercier
,
Simon Welker
,
Timo Gerkmann
.
waspaa 2023
:
1-5
[doi]
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs
Yinghao Aaron Li
,
Cong Han
,
Nima Mesgarani
.
waspaa 2023
:
1-5
[doi]
Robust Audio Anti-Spoofing System Based on Low-Frequency Sub-Band Information
Menglu Li
,
Xiao-Ping Zhang
.
waspaa 2023
:
1-5
[doi]
Fitting Auditory Filterbanks with Multiresolution Neural Networks
Vincent Lostanlen
,
Daniel Haider
,
Han Han
,
Mathieu Lagrange
,
Péter Balázs
,
Martin Ehler
.
waspaa 2023
:
1-5
[doi]
Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial Learning
Diep Luong
,
Minh Tran
,
Shayan Gharib
,
Konstantinos Drossos
,
Tuomas Virtanen
.
waspaa 2023
:
1-5
[doi]
Convolutive Block-Matching Segmentation Algorithm with Application to Music Structure Analysis
Axel Marmoret
,
Jérémy E. Cohen
,
Frédéric Bimbot
.
waspaa 2023
:
1-5
[doi]
Signal Reconstruction from Mel-Spectrogram Based on Bi-Level Consistency of Full-Band Magnitude and Phase
Yoshiki Masuyama
,
Natsuki Ueno
,
Nobutaka Ono
.
waspaa 2023
:
1-5
[doi]
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Yoshiki Masuyama
,
Xuankai Chang
,
Wangyou Zhang
,
Samuele Cornell
,
Zhong-qiu Wang
,
Nobutaka Ono
,
Yanmin Qian
,
Shinji Watanabe 0001
.
waspaa 2023
:
1-5
[doi]
Relative Transfer Function Vector Estimation for Acoustic Sensor Networks Exploiting Covariance Matrix Structure
Wiebke Middelberg
,
Henri Gode
,
Simon Doclo
.
waspaa 2023
:
1-5
[doi]
Differentiable Representation of Warping Based on Lie Group Theory
Atsushi Miyashita
,
Tomoki Toda
.
waspaa 2023
:
1-5
[doi]
Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning
Ilyass Moummad
,
Nicolas Farrugia
.
waspaa 2023
:
1-5
[doi]
Single-Channel Speaker Distance Estimation in Reverberant Environments
Michael Neri
,
Archontis Politis
,
Daniel Krause 0001
,
Marco Carli
,
Tuomas Virtanen
.
waspaa 2023
:
1-5
[doi]
Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning
Aditya Arie Nugraha
,
Diego Di Carlo
,
Yoshiaki Bando
,
Mathieu Fontaine 0002
,
Kazuyoshi Yoshii
.
waspaa 2023
:
1-5
[doi]
Automatic Detection of Poor Tone Quality in Classical Guitar Playing Using Deep Anomaly Detection Method
Kenta Ogawa
,
Shun Sawada
,
Kouichi Katsurada
,
Hidehumi Ohmura
.
waspaa 2023
:
1-5
[doi]
Wide-Area 6DOF Rendering of Multi-Point Ambisonic Recordings Based on Interpolation of Spatial Parameters
Archontis Politis
,
Lauros Pajunen
,
Jussi Leppänen
,
Sujeet Mate
,
Antti J. Eronen
.
waspaa 2023
:
1-5
[doi]
Computing Acoustic Onsets Via an Eikonal Solver
Samuel F. Potter
,
Monte Hoover
,
Dmitry N. Zotkin
,
Ramani Duraiswami
.
waspaa 2023
:
1-5
[doi]
Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine Sounds
Ricardo Falcón Pérez
,
Gordon Wichern
,
François G. Germain
,
Jonathan Le Roux
.
waspaa 2023
:
1-5
[doi]
Neural Networks for Interference Reduction in Multi-Track Recordings
Rajesh R
,
Padmanabhan Rajan
.
waspaa 2023
:
1-5
[doi]
General Purpose Audio Effect Removal
Matthew Rice
,
Christian J. Steinmetz
,
George Fazekas
,
Joshua D. Reiss
.
waspaa 2023
:
1-5
[doi]
Histogram Layer Time Delay Neural Networks for Passive Sonar Classification
Jarin Ritu
,
Ethan Barnes
,
Riley Martell
,
Alexandra Van Dine
,
Joshua Peeples
.
waspaa 2023
:
1-5
[doi]
Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer
Shivam Saini
,
Jürgen Peissig
.
waspaa 2023
:
1-5
[doi]
Leveraging Synthetic Data for Improving Chamber Ensemble Separation
Saurjya Sarkar
,
Louise Thorpe
,
Emmanouil Benetos
,
Mark Sandler 0001
.
waspaa 2023
:
1-5
[doi]
Array Configuration Mismatch in Deep DOA Estimation: Towards Robust Training
Ayal Schwartz
,
Elior Hadad
,
Sharon Gannot
,
Shlomo E. Chazan
.
waspaa 2023
:
1-5
[doi]
Distribution of Modal Damping in Absorptive Shoebox Rooms
Maximilian Schäfer
,
Karolina Prawda
,
Rudolf Rabenstein
,
Sebastian J. Schlecht
.
waspaa 2023
:
1-5
[doi]
Efficient Deep Acoustic Echo Suppression with Condition-Aware Training
Ernst Seidel
,
Pejman Mowlaee
,
Tim Fingscheidt
.
waspaa 2023
:
1-5
[doi]
Annotating Jazz Recordings Using Lead Sheet Alignment with Deep Chroma Features
Ivan Shanin
,
Simon Dixon
.
waspaa 2023
:
1-5
[doi]
Consolidating Compression and Revisiting Expansion: an Alternative Amplification Rule for Wide Dynamic Range Compression
Alice Sokolova
,
Baris Aksanli
,
Fred Harris 0001
,
Harinath Garudadri
.
waspaa 2023
:
1-5
[doi]
Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network for Direction-Based Speech Enhancement with Head-Mounted Microphone Arrays
Benjamin Stahl
,
Alois Sontacchi
.
waspaa 2023
:
1-5
[doi]
Analysis of XLS-R for Speech Quality Assessment
Bastiaan Tamm
,
Rik Vandenberghe
,
Hugo Van Hamme
.
waspaa 2023
:
1-5
[doi]
Single Channel Speech Presence Probability Estimation based on Hybrid Global-Local Information
Shuai Tao
,
Yang Xiang
,
Himavanth Reddy
,
Jesper Rindom Jensen
,
Mads Græsbøll Christensen
.
waspaa 2023
:
1-5
[doi]
Multi-Source Direction-of-Arrival Estimation using Group-Sparse Fitting of Steered Response Power Maps
Elisa Tengan
,
Thomas Dietzen
,
Filip Elvander
,
Toon van Waterschoot
.
waspaa 2023
:
1-5
[doi]
Inverted Cardioid Topology for Multi-Radius Spherical Microphone Arrays
Mark R. P. Thomas
,
Jan-Hendrik Hanschke
.
waspaa 2023
:
1-5
[doi]
Perceptual Musical Similarity Metric Learning with Graph Neural Networks
Cyrus Vahidi
,
Shubhr Singh
,
Emmanouil Benetos
,
Huy Phan
,
Dan Stowell
,
György Fazekas
,
Mathieu Lagrange
.
waspaa 2023
:
1-5
[doi]
Low-Complexity Higher Order Scattering Delay Networks
Leny Vinceslas
,
Matteo Scerbo
,
Hüseyin Hacihabiboglu
,
Zoran Cvetkovic
,
Enzo De Sena
.
waspaa 2023
:
1-5
[doi]
Unsupervised Improvement of Audio-Text Cross-Modal Representations
Zhepei Wang
,
Cem Subakan
,
Krishna Subramani
,
Junkai Wu
,
Tiago Tavares
,
Fábio Ayres
,
Paris Smaragdis
.
waspaa 2023
:
1-5
[doi]
Directional Target Speaker Extraction under Noisy Underdetermined Conditions through Conditional Variational Autoencoder with Global Style Tokens
Rui Wang
,
Tomoki Toda
.
waspaa 2023
:
1-5
[doi]
Mitigating Cross-Database Differences for Learning Unified HRTF Representation
Yutong Wen
,
You Zhang 0001
,
Zhiyao Duan
.
waspaa 2023
:
1-5
[doi]
Low Bit Rate Binaural Link for Improved Ultra Low-Latency Low-Complexity Multichannel Speech Enhancement in Hearing Aids
Nils L. Westhausen
,
Bernd T. Meyer
.
waspaa 2023
:
1-5
[doi]
A Differentiable Acoustic Guitar Model for String-Specific Polyphonic Synthesis
Andrew Wiggins
,
Youngmoo E. Kim
.
waspaa 2023
:
1-5
[doi]
Bridging High-Quality Audio and Video Via Language for Sound Effects Retrieval from Visual Queries
Julia Wilkins
,
Justin Salamon
,
Magdalena Fuentes
,
Juan Pablo Bello
,
Oriol Nieto
.
waspaa 2023
:
1-5
[doi]
Masked Frequency Modeling for Improving Packet Loss Concealment in Speech Transmission Systems
Da-Hee Yang
,
Donghyun Kim
,
Joon-Hyuk Chang
.
waspaa 2023
:
1-5
[doi]
A Differentiable Image Source Model for Room Acoustics Optimization
Bowen Zhi
,
Alisha Sharma
,
Dmitry N. Zotkin
,
Ramani Duraiswami
.
waspaa 2023
:
1-5
[doi]
Extending Audio Masked Autoencoders toward Audio Restoration
Zhi Zhong
,
Hao Shi
,
Masato Hirano
,
Kazuki Shimada
,
Kazuya Tateishi
,
Takashi Shibuya 0001
,
Shusuke Takahashi
,
Yuki Mitsufuji
.
waspaa 2023
:
1-5
[doi]
Learning Sub-Dimensional HRTF Representations Towards Individualization Applications - Traditional and Deep Learning Approaches
Devansh Zurale
,
Shlomo Dubnov
.
waspaa 2023
:
1-5
[doi]
2021
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021, New Paltz, NY, USA, October 17-20, 2021
IEEE,
2021.
[doi]
Stochastic Reverberation Model with a Frequency Dependent Attenuation
Achille Aknin
,
Roland Badeau
.
waspaa 2021
:
351-355
[doi]
Adaptive Binaural Filtering for a Multiple-Talker Listening System Using Remote and On-Ear Microphones
Ryan M. Corey
,
Andrew C. Singer
.
waspaa 2021
:
1-5
[doi]
Spatial Subtraction of Reflections from Room Impulse Responses Measured with a Spherical Microphone Array
Thomas Deppisch
,
Jens Ahrens
,
Sebastià V. Amengual Garí
,
Paul Calamia
.
waspaa 2021
:
346-350
[doi]
Speech Intelligibility of Mandarin- and German-Speaking Listeners in Challenging Conditions
Hongmei Hu
,
Stephan Dieter Ewert
.
waspaa 2021
:
86-90
[doi]
Low-Order Filter Approximation of Diffraction for Virtual Acoustics
Christoph Kirsch
,
Stephan Dieter Ewert
.
waspaa 2021
:
341-345
[doi]
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement
Yuma Koizumi
,
Shigeki Karita
,
Scott Wisdom
,
Hakan Erdogan
,
John R. Hershey
,
Llion Jones
,
Michiel Bacchiani
.
waspaa 2021
:
161-165
[doi]
A Universal Deep Room Acoustics Estimator
Paula Sánchez López
,
Paul Callens
,
Milos Cernak
.
waspaa 2021
:
356-360
[doi]
End-to-End Zero-Shot Voice Conversion Using a DDSP Vocoder
Shahan Nercessian
.
waspaa 2021
:
1-5
[doi]
Zero-Shot Personalized Speech Enhancement Through Speaker-Informed Model Selection
Aswin Sivaraman
,
Minje Kim
.
waspaa 2021
:
171-175
[doi]
On the Role of Lip Reflection/Transmission in the Relationship Between LPC and Waveguide Vocal Tract Models
Tamara Smyth
,
Devansh Zurale
.
waspaa 2021
:
311-315
[doi]
SIDIQ: Computational Quality Assessment of Enhanced Speech Based on Auditory Figure-Ground Segregation, Similarity, and Disturbance
Benjamin Stahl
,
Alois Sontacchi
.
waspaa 2021
:
96-100
[doi]
MIMII Due: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection with Domain Shifts Due to Changes in Operational and Environmental Conditions
Ryo Tanabe
,
Harsh Purohit
,
Kota Dohi
,
Takashi Endo
,
Yuki Nikaido
,
Toshiki Nakamura
,
Yohei Kawaguchi
.
waspaa 2021
:
21-25
[doi]
Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate
Matteo Torcoli
,
Jouni Paulus
,
Thorsten Kastner
,
Christian Uhle
.
waspaa 2021
:
91-95
[doi]
Excitation-Inhibition Cell Activity Patterns for Binaural Source Localisation
Hsuan-Yang Wang
,
Philip Nelson
,
Christine Evers
.
waspaa 2021
:
81-85
[doi]
Towards Large Scale Ecoacoustic Monitoring with Small Amounts of Labeled Data
Enis Berk Çoban
,
Ali Raza Syed
,
Dara Pir
,
Michael I. Mandel
.
waspaa 2021
:
181-185
[doi]
Sign in
or
sign up
to see more results.