| 1 | -- | 0 | Damian Koszewski, Thomas Görne, Grazina Korvel, Bozena Kostek. Automatic music signal mixing system based on one-dimensional Wave-U-Net autoencoders |
| 2 | -- | 0 | Jiale Qian, Xinlu Liu, Yi Yu, Wei Li 0012. Stripe-Transformer: deep stripe feature learning for music source separation |
| 3 | -- | 0 | Prashanth H. C., Madhav Rao, Dhanya Eledath, Ramasubramanian C. Trainable windows for SincNet architecture |
| 4 | -- | 0 | Mariusz Klec, Alicja Wieczorkowska, Krzysztof Szklanny, Wlodzimierz Strus. Beyond the Big Five personality traits for music recommendation systems |
| 5 | -- | 0 | Noriyuki Tonami, Keisuke Imoto. Sound event triage: detecting sound events considering priority of classes |
| 6 | -- | 0 | Jiangyu Han, Yanhua Long. Heterogeneous separation consistency training for adaptation of unsupervised speech separation |
| 7 | -- | 0 | Tingting Wang, Haiyan Guo, Zirui Ge, Qiquan Zhang, Zhen Yang 0001. An MMSE graph spectral magnitude estimator for speech signals residing on an undirected multiple graph |
| 8 | -- | 0 | Douglas D. O'Shaughnessy. Review of methods for coding of speech signals |
| 9 | -- | 0 | Prashanth H. C., Madhav Rao, Dhanya Eledath, V. Ramasubramanian 0001. Correction: Trainable windows for SincNet architecture |
| 10 | -- | 0 | Xiaoping Xie, Yongzhen Chen, Rufeng Shen, Dan Tian. Research on monaural speech segregation based on feature selection |
| 11 | -- | 0 | Alessandro Ilic Mezza, Massimiliano Zanoni, Augusto Sarti. A latent rhythm complexity model for attribute-controlled drum pattern generation |
| 12 | -- | 0 | Oliviero Massi, Alessandro Ilic Mezza, Riccardo Giampiccolo, Alberto Bernardini. Deep learning-based wave digital modeling of rate-dependent hysteretic nonlinearities for virtual analog applications |
| 13 | -- | 0 | Fabian Ostermann, Igor Vatolkin, Martin Ebeling. AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks |
| 14 | -- | 0 | Zimu Li, Yanyan Xu 0001, Dengfeng Ke, Kaile Su. Three-stage training and orthogonality regularization for spoken language recognition |
| 15 | -- | 0 | Pu Wang, Hugo Van Hamme. Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech |
| 16 | -- | 0 | Xiao-Yuan Guo, Chun-xian Gao, Hui Liu. Voice activity detection in the presence of transient based on graph |
| 17 | -- | 0 | Thomas Dietzen, Randall Ali, Maja Taseska, Toon van Waterschoot. MYRiAD: a multi-array room acoustic database |
| 18 | -- | 0 | Jean-Marie Lemercier, Joachim Thiemann, Raphael Koning, Timo Gerkmann. A neural network-supported two-stage algorithm for lightweight dereverberation on hearing devices |
| 19 | -- | 0 | Mauricio Araneda-Hernandez, Felipe Bravo-Marquez, Denis Parra, Rodrigo F. Cádiz. MUSIB: musical score inpainting benchmark |
| 20 | -- | 0 | Ashwin Bellur, Karan Thakkar, Mounya Elhilali. Explicit-memory multiresolution adaptive framework for speech and music separation |
| 21 | -- | 0 | Kunpeng Wang, Hao Zhou, Jingxiang Cai, Wenna Li, Juan Yao. Time-domain adaptive attention network for single-channel speech separation |
| 22 | -- | 0 | Gang Liu, Shifang Cai, Ce Wang. Speech emotion recognition based on emotion perception |
| 23 | -- | 0 | Tong Liu, Xiaochen Yuan. Paralinguistic and spectral feature extraction for speech emotion classification using machine learning techniques |
| 24 | -- | 0 | Luca Comanducci, Davide Gioiosa, Massimiliano Zanoni, Fabio Antonacci, Augusto Sarti. Variational Autoencoders for chord sequence generation conditioned on Western harmonic music complexity |
| 25 | -- | 0 | Zhe Han, Yuxuan Ke, Xiaodong Li 0002, Chengshi Zheng. Parallel processing of distributed beamforming and multichannel linear prediction for speech denoising and deverberation in wireless acoustic sensor networks |
| 26 | -- | 0 | Tugçe Melike Koçak, Büsra Çilem Dibek, Esma Nafiye Polat, Nilüfer Kafesçioglu, Cenk Demiroglu. Automatic detection of attachment style in married couples through conversation analysis |
| 27 | -- | 0 | Yuting Zhou, Hongjie Wan. Dual-branch attention module-based network with parameter sharing for joint sound event detection and localization |
| 28 | -- | 0 | Xingwei Liang, Zehua Zhang, Ruifeng Xu. Multi-task deep cross-attention networks for far-field speaker verification and keyword spotting |
| 29 | -- | 0 | Michael Günther 0003, Andreas Brendel, Walter Kellermann. Microphone utility estimation in acoustic sensor networks using single-channel signal features |
| 30 | -- | 0 | Shiyun Xu, Zehua Zhang, Mingjiang Wang. Channel and temporal-frequency attention UNet for monaural speech enhancement |
| 31 | -- | 0 | Te Zeng, Francis C. M. Lau 0001. Training audio transformers for cover song identification |
| 32 | -- | 0 | Eric Grinstein, Vincent W. Neo, Patrick A. Naylor. Dual input neural networks for positional sound source localization |
| 33 | -- | 0 | Zhiyong Chen, Shugong Xu. Learning domain-heterogeneous speaker recognition systems with personalized continual federated learning |
| 34 | -- | 0 | Lekai Zhang, Yingfan Wang, Kailun He, Hailong Zhang, Baixi Xing, Xiaofeng Liu, Fo Hu. The power of humorous audio: exploring emotion regulation in traffic congestion through EEG-based study |
| 35 | -- | 0 | Takao Kawamura, Kouei Yamaoka, Yukoh Wakabayashi, Nobutaka Ono, Ryoichi Miyazaki. Acoustic object canceller: removing a known signal from monaural recording using blind synchronization |
| 36 | -- | 0 | Yicheng Hsu, Mingsian R. Bai. Learning-based robust speaker counting and separation with the aid of spatial coherence |
| 37 | -- | 0 | Santiago Ruiz, Toon van Waterschoot, Marc Moonen. Cascade algorithms for combined acoustic feedback cancelation and noise reduction |
| 38 | -- | 0 | Elisa Tengan, Thomas Dietzen, Filip Elvander, Toon van Waterschoot. Direction-of-arrival and power spectral density estimation using a single directional microphone and group-sparse optimization |
| 39 | -- | 0 | Amin Saremi, Balaji Ramkumar, Ghazaleh Ghaffari, Zonghua Gu 0001. An acoustic echo canceller optimized for hands-free speech telecommunication in large vehicle cabins |
| 40 | -- | 0 | Yan Li, Yapeng Wang, Xu Yang, Sio Kei Im. Speech emotion recognition based on Graph-LSTM neural network |
| 41 | -- | 0 | Chunxi Wang, Maoshen Jia, Xinfeng Zhang. Deep encoder/decoder dual-path neural network for speech separation in noisy reverberation environments |
| 42 | -- | 0 | Jian Guan, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang 0001. Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection |
| 43 | -- | 0 | Jingtan Li, Mengkai Sun, Zhonghao Zhao, Xingcan Li, Gaigai Li, Chen Wu, Kun Qian 0003, Bin Hu 0001, Yoshiharu Yamamoto, Björn W. Schuller. Battling with the low-resource condition for snore sound recognition: introducing a meta-learning strategy |
| 44 | -- | 0 | Le Ma, Xinda Wu, Ruiyuan Tang, Chongjun Zhong, Kejun Zhang. YuYin: a multi-task learning model of multi-modal e-commerce background music recommendation |
| 45 | -- | 0 | Hao Huang, Lin Wang, Jichen Yang, Ying Hu, Liang He 0003. W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision |
| 46 | -- | 0 | Stijn Kindt, Jenthe Thienpondt, Luca Becker, Nilesh Madhu. Robustness of ad hoc microphone clustering using speaker embeddings: evaluation under realistic and challenging scenarios |
| 47 | -- | 0 | Kavya Manohar, A. R. Jayan, Rajeev Rajan. Improving speech recognition systems for the morphologically complex Malayalam language using subword tokens for language modeling |
| 48 | -- | 0 | Zhaopeng Qian, Kejing Xiao, Chongchong Yu. A survey of technologies for automatic Dysarthric speech recognition |
| 49 | -- | 0 | Lekshmi Chandrika Reghunath, Rajeev Rajan. Predominant audio source separation in polyphonic music |
| 50 | -- | 0 | Huiwen Xue, Chenxin Sun, Mingcheng Tang, Chenrui Hu, Zhengqing Yuan, Min Huang, Zhongzhe Xiao. Effective acoustic parameters for automatic classification of performed and synthesized Guzheng music |
| 51 | -- | 0 | Pierre-Amaury Grumiaux, Mathieu Lagrange. Efficient bandwidth extension of musical signals using a differentiable harmonic plus noise model |
| 52 | -- | 0 | Masahiro Suzuki. Piano score rearrangement into multiple difficulty levels via notation-to-notation approach |
| 53 | -- | 0 | Jing Wang, Hanyue Liu, Liang Xu, Wenjing Yang, Weiming Yi, Fang Liu. Lightweight target speaker separation network based on joint training |
| 54 | -- | 0 | Walter Kellermann, Rainer Martin 0001, Nobutaka Ono. Signal processing and machine learning for speech and audio in acoustic sensor networks |
| 55 | -- | 0 | Aleksej Chinaev, Niklas Knaepper, Gerald Enzner. Online distributed waveform-synchronization for acoustic sensor networks with dynamic topology |
| 74 | -- | 0 | Jingneng Fu, Hongyan Wei. Dynamic programming network for point target detection |
| 75 | -- | 0 | Xiangwei Meng, Meng Yuan. Modified rank sum nonparametric CFAR to combat clutter edge |
| 76 | -- | 0 | Tang Yulin, Liming Wang, Houpu Li, Shaofeng Bian. Side-scan sonar underwater target segmentation using the BHP-UNet |
| 77 | -- | 0 | Boyu Zhu, Biao Wang 0002, Banggui Cai, Yunan Zhu 0004, Peng Chao, Zide Fang. A variable step size least mean p-power adaptive filtering algorithm based on multi-moment error fusion |
| 78 | -- | 0 | Lei Fang, Zelin Shi, Yunpeng Liu 0001, Chenxi Li, Mingqi Pang, Enbo Zhao. A general geometric transformation model for line-scan image registration |
| 79 | -- | 0 | Xuehua Li, Ze Lin, Zhichao Bu, Jianxin He, Fang Liu, Zhao Shi, Shunxian Tang, Chuanzhi Wang, Shaojun Dai. Volume scan modeling simulator of a 1D phased array weather radar |
| 80 | -- | 0 | Lan Guo, Rui Gao, Yang Cong, Lei Yang. Correction: Robust automatic modulation classification under noise mismatch |
| 81 | -- | 0 | Taha Hocine Kerbaa, Amar Mezache, Fulvio Gini, Maria S. Greco. Multi-headed deep learning-based estimator for correlated-SIRV Pareto type II distributed clutter |