IEEE Transactions on Audio, Speech & Language Processing

researchr

You are not signed in
Sign in
Sign up

1852	--	1867	Natsuki Ueno, Shoichi Koyama, Hiroshi Saruwatari. Three-Dimensional Sound Field Reproduction Based on Weighted Mode-Matching Method
1868	--	1879	Lijun Wu, Xu Tan, Tao Qin, Jianhuang Lai, Tie-Yan Liu. Beyond Error Propagation: Language Branching Also Affects the Accuracy of Sequence Generation
1880	--	1892	Amit Das, Jinyu Li, Guoli Ye, Rui Zhao, Yifan Gong. Advancing Acoustic-to-Word CTC Model With Attention and Mixed-Units
1893	--	1905	Niccolo Antonello, Enzo De Sena, Marc Moonen, Patrick A. Naylor, Toon van Waterschoot. Joint Acoustic Localization and Dereverberation Through Plane Wave Decomposition and Sparse Regularization
1906	--	1918	Federico Borra, Alberto Bernardini, Fabio Antonacci, Augusto Sarti. Uniform Linear Arrays of First-Order Steerable Differential Microphones
1919	--	1931	Li Chai, Jun Du, Qing-Feng Liu, Chin-Hui Lee. Using Generalized Gaussian Distributions to Improve Regression Error Modeling for Deep Learning-Based Speech Enhancement
1932	--	1943	Jun Qi, Jun Du, Sabato Marco Siniscalchi, Chin-Hui Lee. A Theory on Deep Neural Network Based Vector-to-Vector Regression With an Illustration of Its Expressive Power in Speech Enhancement
1944	--	1956	Xudong Dang, Qi Cheng, Hongyan Zhu. Indoor Multiple Sound Source Localization via Multi-Dimensional Assignment Data Association
1957	--	1969	Martin Schneider 0009, Emanuel A. P. Habets. Iterative DFT-Domain Inverse Filter Optimization Using a Weighted Least-Squares Criterion
1970	--	1984	Kehai Chen, Rui Wang 0015, Masao Utiyama, Eiichiro Sumita, Tiejun Zhao. Neural Machine Translation With Sentence-Level Topic Context
1985	--	1999	Alejandro Gómez Alanís, Antonio M. Peinado, José A. González 0001, Angel M. Gomez. A Gated Recurrent Convolutional Neural Network for Robust Spoofing Detection
2000	--	2011	Siyuan Feng, Tan Lee. Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling
2012	--	2024	Wei Li, Nancy F. Chen, Sabato Marco Siniscalchi, Chin-Hui Lee. Improving Mispronunciation Detection of Mandarin Tones for Non-Native Learners With Soft-Target Tone Labels and BLSTM-Based Deep Tone Models
2025	--	2040	Quansheng Tu, Huawei Chen. On Mainlobe Orientation of the First- and Second-Order Differential Microphone Arrays
2041	--	2053	Jan Chorowski, Ron J. Weiss, Samy Bengio, Aäron Van Den Oord. Unsupervised Speech Representation Learning Using WaveNet Autoencoders
2054	--	2066	Vishnuvardhan Varanasi, Ayushya Agarwal, Rajesh M. Hegde. Near-Field Acoustic Source Localization Using Spherical Harmonic Features
2067	--	2079	Yibin Zheng, Jianhua Tao, Zhengqi Wen, Jiangyan Yi. Forward-Backward Decoding Sequence for Regularizing End-to-End TTS
2080	--	2091	Yanhui Tu, Jun Du, Chin-Hui Lee. Speech Enhancement Based on Teacher-Student Deep Learning Using Improved Speech Presence Probability for Noise-Robust Speech Recognition
2092	--	2102	Yuzhou Liu, DeLiang Wang. Divide and Conquer: A Deep CASA Approach to Talker-Independent Monaural Speaker Separation
2103	--	2112	Xuebo Liu, Derek F. Wong, Lidia S. Chao, Yang Liu 0005. Latent Attribute Based Hierarchical Decoder for Neural Machine Translation
2113	--	2126	Jingyi Hu, Ning Chen. Enhanced Feature Summarizing for Effective Cover Song Identification
2127	--	2139	Qianli Ma, Liuhong Yu, Shuai Tian, Enhuan Chen, Wing W. Y. Ng. Global-Local Mutual Attention Model for Text Classification
2140	--	2149	Vesa Välimäki, Jussi Rämö. Neurally Controlled Graphic Equalizer
2150	--	2161	Sean U. N. Wood, Johannes Stahl 0003, Pejman Mowlaee. Binaural Codebook-Based Speech Enhancement With Atomic Speech Presence Probability
2162	--	2172	Lukas Pfeifenberger, Matthias Zöhrer, Franz Pernkopf. Eigenvector-Based Speech Mask Estimation for Multi-Channel Speech Enhancement
2173	--	2182	Marc Arnela, Saeed Dabbaghchian, Oriol Guasch, Olov Engwall. MRI-Based Vocal Tract Representations for the Three-Dimensional Finite Element Synthesis of Diphthongs
2183	--	2196	Varun Srivastava, Mayank Mishra. Adversarial Approximate Inference for Speech to Electroglottograph Conversion
2197	--	2212	Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Kazuyoshi Yoshii, Tatsuya Kawahara. Semi-Supervised Multichannel Speech Enhancement With a Deep Speech Prior
2213	--	2222	Qipeng Guo, Xipeng Qiu, Xiangyang Xue, Zheng Zhang. Low-Rank and Locality Constrained Self-Attention for Sequence Modeling
2223	--	2233	Jun Yu 0001, Qiang Ling, Changwei Luo, Chang Wen Chen. Synthesizing 3D Trump: Predicting and Visualizing the Relationship Between Text, Speech, and Articulatory Movements
2234	--	2248	Ryosuke Sugiura, Yutaka Kamamoto, Takehiro Moriya. Shape Control of Discrete Generalized Gaussian Distributions for Frequency-Domain Audio Coding
2249	--	2262	Zamir Ben-Hur, David Lou Alon, Ravish Mehra, Boaz Rafaely. Efficient Representation and Sparse Sampling of Head-Related Transfer Functions Using Phase-Correction Based on Ear Alignment
2263	--	2277	Luca Remaggi, Philip J. B. Jackson, Wenwu Wang. Modeling the Comb Filter Effect and Interaural Coherence for Binaural Source Separation
2278	--	2287	Biao Zhang 0002, Deyi Xiong, Jinsong Su, Jiebo Luo. Future-Aware Knowledge Distillation for Neural Machine Translation
2288	--	2300	Randall Ali, Toon van Waterschoot, Marc Moonen. Integration of a Priori and Estimated Constraints Into an MVDR Beamformer for Speech Enhancement
2301	--	2312	Nitya Tiwari, Prem C. Pandey. Speech Enhancement Using Noise Estimation With Dynamic Quantile Tracking
2313	--	2325	Junwen Duan, Xiao Ding, Yue Zhang, Ting Liu 0001. TEND: A Target-Dependent Representation Learning Framework for News Document
2326	--	2335	Lujun Zhao, Xipeng Qiu, Qi Zhang 0001, Xuanjing Huang. Sequence Labeling With Deep Gated Dual Path CNN
2336	--	2349	Akihiro Kato, Tomi H. Kinnunen. Statistical Regression Models for Noise Robust F0 Estimation Using Recurrent Deep Neural Networks
2350	--	2361	Dayiheng Liu, Jie Fu, Qian Qu, Jiancheng Lv. BFGAN: Backward and Forward Generative Adversarial Networks for Lexically Constrained Sentence Generation
2362	--	2372	Andrés Marafioti, Nathanaël Perraudin, Nicki Holighaus, Piotr Majdak. A Context Encoder For Audio Inpainting
2373	--	2384	Jichen Yang, Rohan Kumar Das, Nina Zhou. Extraction of Octave Spectra Information for Spoofing Attack Detection
2385	--	2396	Oren Barkan, David Tsiris, Ori Katz, Noam Koenigstein. InverSynth: Deep Estimation of Synthesizer Parameter Configurations From Audio Signals

External Links

Journal: IEEE Transactions on Audio, Speech & Language Processing

Volume 27, Issue 12

External Links

Journal: IEEE Transactions on Audio, Speech &amp; Language Processing

Volume 27, Issue 12

Journal: IEEE Transactions on Audio, Speech & Language Processing