WASPAA - researchr conference series publications

researchr

You are not signed in
Sign in
Sign up

Viewing Publication 1 - 100 from 671

2023

SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech EnhancementMartin Strauss 0003, Nicola Pia, Nagashree K. S. Rao, Bernd Edler. waspaa 2023: 1-5 [doi]

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023, New Paltz, NY, USA, October 22-25, 2023IEEE, 2023. [doi]

Correlation Based Glimpse Proportion IndexAhmed Alghamdi, Leonard Moen, Wai-Yip Chan, Daniel Fogerty, Jesper Jensen 0001. waspaa 2023: 1-5 [doi]

Neural Audio Decorrelation Using Generative Adversarial NetworksCarlotta Anemüller, Oliver Thiergart, Emanuël A. P. Habets. waspaa 2023: 1-5 [doi]

Audio Inputs for Active Speaker Detection and Localization Via Microphone ArrayDavide Berghi, Philip J. B. Jackson. waspaa 2023: 1-5 [doi]

Complete and Separate: Conditional Separation with Missing Target Source Attribute CompletionDimitrios Bralios, Efthymios Tzinis, Paris Smaragdis. waspaa 2023: 1-5 [doi]

Design of Frequency-Invariant Beamformers with Sparse Concentric Circular ArraysYaakov Buchris, Israel Cohen, Alon Amar. waspaa 2023: 1-5 [doi]

Class Activation Mapping-Driven Data Augmentation: Masking Significant Regions for Enhanced Acoustic Scene ClassificationPil Moo Byun, Jeong Hwan Choi, Joon-Hyuk Chang. waspaa 2023: 1-5 [doi]

Lace: A Light-Weight, Causal Model for Enhancing Coded Speech Through Adaptive ConvolutionsJan Büthe, Jean-Marc Valin, Ahmed Mustafa. waspaa 2023: 1-5 [doi]

Towards on-Device Keyword Spotting using Low-Footprint Quaternion Neural ModelsAryan Chaudhary, Vinayak Abrol. waspaa 2023: 1-5 [doi]

The Effect of Spoken Language on Speech Enhancement Using Self-Supervised Speech Representation Loss FunctionsGeorge Close, Thomas Hain, Stefan Goetze. waspaa 2023: 1-5 [doi]

Mixed-Delay Distributed Beamforming for Own-Speech Separation in Hearing Devices with Wireless Remote MicrophonesRyan M. Corey. waspaa 2023: 1-5 [doi]

An Improved Metric of Informational Masking for Perceptual Audio Quality MeasurementPablo M. Delgado, Jürgen Herre. waspaa 2023: 1-5 [doi]

Estimating the Direction of Arrival of a Spoken Wake Word Using a Single Sensor on an Elastic PanelTre DiPassio, Michael C. Heilemann, Benjamin Thompson, Mark F. Bocko. waspaa 2023: 1-5 [doi]

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision ModelsHao-Wen Dong, Xiaoyu Liu, Jordi Pons, Gautam Bhattacharya, Santiago Pascual, Joan Serrà, Taylor Berg-Kirkpatrick, Julian J. McAuley. waspaa 2023: 1-5 [doi]

Slim-Tasnet: A Slimmable Neural Network for Speech SeparationMohamed Elminshawi, Srikanth Raj Chetupalli, Emanuël A. P. Habets. waspaa 2023: 1-5 [doi]

Predicting Thresholds in an Auditory Overshoot Paradigm Using a Computational Subcortical Model with Efferent FeedbackAfagh Farhadi, Laurel H. Carney. waspaa 2023: 1-5 [doi]

Temporal Noise Shaping on MDCT Subband Signals for Transform Audio CodingRichard Füg, Bernd Edler. waspaa 2023: 1-5 [doi]

Hyperbolic Unsupervised Anomalous Sound DetectionFrançois G. Germain, Gordon Wichern, Jonathan Le Roux. waspaa 2023: 1-5 [doi]

Covariance Blocking and Whitening Method for Successive Relative Transfer Function Vector Estimation in Multi-Speaker ScenariosHenri Gode, Simon Doclo. waspaa 2023: 1-5 [doi]

Quaternion Anti-Transfer Learning for Speech Emotion RecognitionEric Guizzo, Tillman Weyde, Giacomo Tarroni, Danilo Comminiello. waspaa 2023: 1-5 [doi]

An Objective Evaluation of Hearing AIDS and DNN-Based Binaural Speech Enhancement in Complex Acoustic ScenesEnric Gusó, Joanna Luberadzka, Martí Baig, Umut Sayin Saraç, Xavier Serra. waspaa 2023: 1-5 [doi]

Diff-Pitcher: Diffusion-Based Singing Voice Pitch CorrectionJiarui Hai, Mounya Elhilali. waspaa 2023: 1-5 [doi]

Optimizing Higher-Order Directional Audio Coding with Adaptive Mixing and Energy Matching for Ambisonic Compression and UpmixingChristoph Hold, Leo McCormack, Archontis Politis, Ville Pulkki. waspaa 2023: 1-5 [doi]

Adaptive Sparse Linear Prediction in Fixed-Filter ANC Headphone Applications for Multi-Speaker Speech ReductionYurii Iotov, Sidsel Marie Nørholm, Valiantsin Belyi, Mads Græsbøll Christensen. waspaa 2023: 1-5 [doi]

Region-of-Interest Oriented Constant-Beamwidth Beamforming with Rectangular ArraysGal Itzhak, Israel Cohen. waspaa 2023: 1-5 [doi]

Deep Adaptation Control for Stereophonic Acoustic Echo CancellationAmir Ivry, Israel Cohen, Baruch Berdugo. waspaa 2023: 1-5 [doi]

Music De-Limiter Networks Via Sample-Wise Gain InversionChang-Bin Jeon, Kyogu Lee. waspaa 2023: 1-5 [doi]

Hybrid Noise Shaping for Audio Coding Using Perfectly Overlapped WindowByeongho Jo, Seungkwon Beack. waspaa 2023: 1-5 [doi]

Flexible Multichannel Speech Enhancement for Noise-Robust FrontendAnte Jukic, Jagadeesh Balam, Boris Ginsburg. waspaa 2023: 1-5 [doi]

A High-Rate Extension to SoundstreamHong-Goo Kang, Jan Skoglund, W. Bastiaan Kleijn, Andrew Storus, Hengchin Yeh. waspaa 2023: 1-5 [doi]

All-in-One Metrical and Functional Structure Analysis with Neighborhood Attentions on Demixed AudioTaejun Kim, Juhan Nam. waspaa 2023: 1-5 [doi]

Perceptual Quality Enhancement of Sound Field Synthesis Based on Combination of Pressure and Amplitude MatchingKeisuke Kimura, Shoichi Koyama, Hiroshi Saruwatari. waspaa 2023: 1-5 [doi]

Compressing Audio CNNS with Graph Centrality Based Filter PruningJames A. King, Arshdeep Singh, Mark D. Plumbley. waspaa 2023: 1-5 [doi]

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text RepresentationsYuma Koizumi, Heiga Zen, Shigeki Karita, Yifan Ding, Kohei Yatabe, Nobuyuki Morioka, Yu Zhang 0033, Wei Han, Ankur Bapna, Michiel Bacchiani. waspaa 2023: 1-5 [doi]

Kernel Interpolation of Incident Sound Field in Region Including Scattering ObjectsShoichi Koyama, Masaki Nakada, Juliano G. C. Ribeiro, Hiroshi Saruwatari. waspaa 2023: 1-5 [doi]

Sound Source Distance Estimation in Diverse and Dynamic Acoustic ConditionsSaksham Singh Kushwaha, Irán R. Román, Magdalena Fuentes, Juan Pablo Bello. waspaa 2023: 1-5 [doi]

A Novel Method to Detect Instrumental Music in a Large Scale Music CatalogWo Jae Lee, Emanuele Coviello. waspaa 2023: 1-5 [doi]

AECSQI: Referenceless Acoustic Echo Cancellation Measures Using Speech Quality and Intelligibility ImprovementJin Woo Lee, Hyeong-Seok Choi, Kyogu Lee. waspaa 2023: 1-5 [doi]

Yet Another Generative Model for Room Impulse Response EstimationSungho Lee, Hyeong-Seok Choi, Kyogu Lee. waspaa 2023: 1-5 [doi]

Diffusion Posterior Sampling for Informed Single-Channel DereverberationJean-Marie Lemercier, Simon Welker, Timo Gerkmann. waspaa 2023: 1-5 [doi]

SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANsYinghao Aaron Li, Cong Han, Nima Mesgarani. waspaa 2023: 1-5 [doi]

Robust Audio Anti-Spoofing System Based on Low-Frequency Sub-Band InformationMenglu Li, Xiao-Ping Zhang. waspaa 2023: 1-5 [doi]

Fitting Auditory Filterbanks with Multiresolution Neural NetworksVincent Lostanlen, Daniel Haider, Han Han, Mathieu Lagrange, Péter Balázs, Martin Ehler. waspaa 2023: 1-5 [doi]

Representation Learning for Audio Privacy Preservation Using Source Separation and Robust Adversarial LearningDiep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas Virtanen. waspaa 2023: 1-5 [doi]

Convolutive Block-Matching Segmentation Algorithm with Application to Music Structure AnalysisAxel Marmoret, Jérémy E. Cohen, Frédéric Bimbot. waspaa 2023: 1-5 [doi]

Signal Reconstruction from Mel-Spectrogram Based on Bi-Level Consistency of Full-Band Magnitude and PhaseYoshiki Masuyama, Natsuki Ueno, Nobutaka Ono. waspaa 2023: 1-5 [doi]

Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning RepresentationYoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe 0001. waspaa 2023: 1-5 [doi]

Relative Transfer Function Vector Estimation for Acoustic Sensor Networks Exploiting Covariance Matrix StructureWiebke Middelberg, Henri Gode, Simon Doclo. waspaa 2023: 1-5 [doi]

Differentiable Representation of Warping Based on Lie Group TheoryAtsushi Miyashita, Tomoki Toda. waspaa 2023: 1-5 [doi]

Pretraining Respiratory Sound Representations using Metadata and Contrastive LearningIlyass Moummad, Nicolas Farrugia. waspaa 2023: 1-5 [doi]

Single-Channel Speaker Distance Estimation in Reverberant EnvironmentsMichael Neri, Archontis Politis, Daniel Krause 0001, Marco Carli, Tuomas Virtanen. waspaa 2023: 1-5 [doi]

Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel LearningAditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine 0002, Kazuyoshi Yoshii. waspaa 2023: 1-5 [doi]

Automatic Detection of Poor Tone Quality in Classical Guitar Playing Using Deep Anomaly Detection MethodKenta Ogawa, Shun Sawada, Kouichi Katsurada, Hidehumi Ohmura. waspaa 2023: 1-5 [doi]

Wide-Area 6DOF Rendering of Multi-Point Ambisonic Recordings Based on Interpolation of Spatial ParametersArchontis Politis, Lauros Pajunen, Jussi Leppänen, Sujeet Mate, Antti J. Eronen. waspaa 2023: 1-5 [doi]

Computing Acoustic Onsets Via an Eikonal SolverSamuel F. Potter, Monte Hoover, Dmitry N. Zotkin, Ramani Duraiswami. waspaa 2023: 1-5 [doi]

Location as Supervision for Weakly Supervised Multi-Channel Source Separation of Machine SoundsRicardo Falcón Pérez, Gordon Wichern, François G. Germain, Jonathan Le Roux. waspaa 2023: 1-5 [doi]

Neural Networks for Interference Reduction in Multi-Track RecordingsRajesh R, Padmanabhan Rajan. waspaa 2023: 1-5 [doi]

General Purpose Audio Effect RemovalMatthew Rice, Christian J. Steinmetz, George Fazekas, Joshua D. Reiss. waspaa 2023: 1-5 [doi]

Histogram Layer Time Delay Neural Networks for Passive Sonar ClassificationJarin Ritu, Ethan Barnes, Riley Martell, Alexandra Van Dine, Joshua Peeples. waspaa 2023: 1-5 [doi]

Blind Room Acoustic Parameters Estimation Using Mobile Audio TransformerShivam Saini, Jürgen Peissig. waspaa 2023: 1-5 [doi]

Leveraging Synthetic Data for Improving Chamber Ensemble SeparationSaurjya Sarkar, Louise Thorpe, Emmanouil Benetos, Mark Sandler 0001. waspaa 2023: 1-5 [doi]

Array Configuration Mismatch in Deep DOA Estimation: Towards Robust TrainingAyal Schwartz, Elior Hadad, Sharon Gannot, Shlomo E. Chazan. waspaa 2023: 1-5 [doi]

Distribution of Modal Damping in Absorptive Shoebox RoomsMaximilian Schäfer, Karolina Prawda, Rudolf Rabenstein, Sebastian J. Schlecht. waspaa 2023: 1-5 [doi]

Efficient Deep Acoustic Echo Suppression with Condition-Aware TrainingErnst Seidel, Pejman Mowlaee, Tim Fingscheidt. waspaa 2023: 1-5 [doi]

Annotating Jazz Recordings Using Lead Sheet Alignment with Deep Chroma FeaturesIvan Shanin, Simon Dixon. waspaa 2023: 1-5 [doi]

Consolidating Compression and Revisiting Expansion: an Alternative Amplification Rule for Wide Dynamic Range CompressionAlice Sokolova, Baris Aksanli, Fred Harris 0001, Harinath Garudadri. waspaa 2023: 1-5 [doi]

Multichannel Subband-Fullband Gated Convolutional Recurrent Neural Network for Direction-Based Speech Enhancement with Head-Mounted Microphone ArraysBenjamin Stahl, Alois Sontacchi. waspaa 2023: 1-5 [doi]

Analysis of XLS-R for Speech Quality AssessmentBastiaan Tamm, Rik Vandenberghe, Hugo Van Hamme. waspaa 2023: 1-5 [doi]

Single Channel Speech Presence Probability Estimation based on Hybrid Global-Local InformationShuai Tao, Yang Xiang, Himavanth Reddy, Jesper Rindom Jensen, Mads Græsbøll Christensen. waspaa 2023: 1-5 [doi]

Multi-Source Direction-of-Arrival Estimation using Group-Sparse Fitting of Steered Response Power MapsElisa Tengan, Thomas Dietzen, Filip Elvander, Toon van Waterschoot. waspaa 2023: 1-5 [doi]

Inverted Cardioid Topology for Multi-Radius Spherical Microphone ArraysMark R. P. Thomas, Jan-Hendrik Hanschke. waspaa 2023: 1-5 [doi]

Perceptual Musical Similarity Metric Learning with Graph Neural NetworksCyrus Vahidi, Shubhr Singh, Emmanouil Benetos, Huy Phan, Dan Stowell, György Fazekas, Mathieu Lagrange. waspaa 2023: 1-5 [doi]

Low-Complexity Higher Order Scattering Delay NetworksLeny Vinceslas, Matteo Scerbo, Hüseyin Hacihabiboglu, Zoran Cvetkovic, Enzo De Sena. waspaa 2023: 1-5 [doi]

Unsupervised Improvement of Audio-Text Cross-Modal RepresentationsZhepei Wang, Cem Subakan, Krishna Subramani, Junkai Wu, Tiago Tavares, Fábio Ayres, Paris Smaragdis. waspaa 2023: 1-5 [doi]

Directional Target Speaker Extraction under Noisy Underdetermined Conditions through Conditional Variational Autoencoder with Global Style TokensRui Wang, Tomoki Toda. waspaa 2023: 1-5 [doi]

Mitigating Cross-Database Differences for Learning Unified HRTF RepresentationYutong Wen, You Zhang 0001, Zhiyao Duan. waspaa 2023: 1-5 [doi]

Low Bit Rate Binaural Link for Improved Ultra Low-Latency Low-Complexity Multichannel Speech Enhancement in Hearing AidsNils L. Westhausen, Bernd T. Meyer. waspaa 2023: 1-5 [doi]

A Differentiable Acoustic Guitar Model for String-Specific Polyphonic SynthesisAndrew Wiggins, Youngmoo E. Kim. waspaa 2023: 1-5 [doi]

Bridging High-Quality Audio and Video Via Language for Sound Effects Retrieval from Visual QueriesJulia Wilkins, Justin Salamon, Magdalena Fuentes, Juan Pablo Bello, Oriol Nieto. waspaa 2023: 1-5 [doi]

Masked Frequency Modeling for Improving Packet Loss Concealment in Speech Transmission SystemsDa-Hee Yang, Donghyun Kim, Joon-Hyuk Chang. waspaa 2023: 1-5 [doi]

A Differentiable Image Source Model for Room Acoustics OptimizationBowen Zhi, Alisha Sharma, Dmitry N. Zotkin, Ramani Duraiswami. waspaa 2023: 1-5 [doi]

Extending Audio Masked Autoencoders toward Audio RestorationZhi Zhong, Hao Shi, Masato Hirano, Kazuki Shimada, Kazuya Tateishi, Takashi Shibuya 0001, Shusuke Takahashi, Yuki Mitsufuji. waspaa 2023: 1-5 [doi]

Learning Sub-Dimensional HRTF Representations Towards Individualization Applications - Traditional and Deep Learning ApproachesDevansh Zurale, Shlomo Dubnov. waspaa 2023: 1-5 [doi]

2021

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021, New Paltz, NY, USA, October 17-20, 2021IEEE, 2021. [doi]

Stochastic Reverberation Model with a Frequency Dependent AttenuationAchille Aknin, Roland Badeau. waspaa 2021: 351-355 [doi]

Adaptive Binaural Filtering for a Multiple-Talker Listening System Using Remote and On-Ear MicrophonesRyan M. Corey, Andrew C. Singer. waspaa 2021: 1-5 [doi]

Spatial Subtraction of Reflections from Room Impulse Responses Measured with a Spherical Microphone ArrayThomas Deppisch, Jens Ahrens, Sebastià V. Amengual Garí, Paul Calamia. waspaa 2021: 346-350 [doi]

Speech Intelligibility of Mandarin- and German-Speaking Listeners in Challenging ConditionsHongmei Hu, Stephan Dieter Ewert. waspaa 2021: 86-90 [doi]

Low-Order Filter Approximation of Diffraction for Virtual AcousticsChristoph Kirsch, Stephan Dieter Ewert. waspaa 2021: 341-345 [doi]

DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech EnhancementYuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani. waspaa 2021: 161-165 [doi]

A Universal Deep Room Acoustics EstimatorPaula Sánchez López, Paul Callens, Milos Cernak. waspaa 2021: 356-360 [doi]

End-to-End Zero-Shot Voice Conversion Using a DDSP VocoderShahan Nercessian. waspaa 2021: 1-5 [doi]

Zero-Shot Personalized Speech Enhancement Through Speaker-Informed Model SelectionAswin Sivaraman, Minje Kim. waspaa 2021: 171-175 [doi]

On the Role of Lip Reflection/Transmission in the Relationship Between LPC and Waveguide Vocal Tract ModelsTamara Smyth, Devansh Zurale. waspaa 2021: 311-315 [doi]

SIDIQ: Computational Quality Assessment of Enhanced Speech Based on Auditory Figure-Ground Segregation, Similarity, and DisturbanceBenjamin Stahl, Alois Sontacchi. waspaa 2021: 96-100 [doi]

MIMII Due: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection with Domain Shifts Due to Changes in Operational and Environmental ConditionsRyo Tanabe, Harsh Purohit, Kota Dohi, Takashi Endo, Yuki Nikaido, Toshiki Nakamura, Yohei Kawaguchi. waspaa 2021: 21-25 [doi]

Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality EstimateMatteo Torcoli, Jouni Paulus, Thorsten Kastner, Christian Uhle. waspaa 2021: 91-95 [doi]

Excitation-Inhibition Cell Activity Patterns for Binaural Source LocalisationHsuan-Yang Wang, Philip Nelson, Christine Evers. waspaa 2021: 81-85 [doi]

Towards Large Scale Ecoacoustic Monitoring with Small Amounts of Labeled DataEnis Berk Çoban, Ali Raza Syed, Dara Pir, Michael I. Mandel. waspaa 2021: 181-185 [doi]

Links

Filter by Year
OR AND NOT 1

Filter by Tag

Filter by Author

[+]
OR AND NOT 1

Filter by Top terms

[+]
OR AND NOT 1

WASPAA (waspaa)

Viewing Publication 1 - 100 from 671

2023

2021

Links

Filter by YearOR AND NOT 1

Filter by Tag

Filter by Author [+]OR AND NOT 1

Filter by Top terms [+]OR AND NOT 1

WASPAA (waspaa)

Viewing Publication 1 - 100 from 671

2023

2021

Filter by Year
OR AND NOT 1

Filter by Author

[+]
OR AND NOT 1

Filter by Top terms

[+]
OR AND NOT 1