Speech and Computer - 25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023, Proceedings, Part II - researchr publication

researchr

You are not signed in
Sign in
Sign up

Alexey Karpov 0001, K. Samudravijaya, K. T. Deepak, Rajesh M. Hegde, Shyam S. Agrawal, S. R. Mahadeva Prasanna, editors, Speech and Computer - 25th International Conference, SPECOM 2023, Dharwad, India, November 29 - December 2, 2023, Proceedings, Part II. Volume 14339 of Lecture Notes in Computer Science, Springer, 2023. [doi]

Conference: specom2023

Abstract is missing.

Analysing Breathing Patterns in Reading and Spontaneous SpeechGauri Deshpande, Björn W. Schuller, Pallavi Deshpande, Anuradha Rajiv Joshi, S. K. Oza, Sachin Patel. 3-17 [doi]

Audio-Visual Speaker Verification via Joint Cross-AttentionGnana Praveen Rajasekhar, Jahangir Alam. 18-31 [doi]

A Novel Scheme to Classify Read and Spontaneous SpeechSunil Kumar Kopparapu. 32-45 [doi]

Analysis of a Hinglish ASR System's Performance for Fraud DetectionPradeep Rangappa, Aditya Kiran Brahma, Venkatesh Vayyavuru, Rishi Yadav, Hemant Misra, Kasturi Karuna. 46-58 [doi]

CAPTuring Accents: An Approach to Personalize Pronunciation Training for Learners with Different L1 BackgroundsVeronica Khaustova, Evgeny Pyshkin, Victor Khaustov, John Blake 0002, Natalia Bogach. 59-70 [doi]

Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource LanguagesVishwa Gupta, Gilles Boulianne. 73-86 [doi]

Phone Durations Modeling for Livvi-Karelian ASRIrina S. Kipyatkova, Ildar Kagirov. 87-99 [doi]

Significance of Indic Self-supervised Speech Representations for Indic Under-Resourced ASRSougata Mukherjee, Jagabandhu Mishra, S. R. Mahadeva Prasanna. 100-113 [doi]

Study of Various End-to-End Keyword Spotting Systems on the Bengali Language Under Low-Resource ConditionAchintya K. Sarkar, Tulika Basu, Rajib Roy, Joyanta Basu, Michael Tongbram, Yamben Jina Chanu, Priyanka Dwivedi. 114-126 [doi]

Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani LanguageAshwini Dasare, Amartya Chowdhury, Aditya Srinivas Menon, Konjengbam Anand, K. T. Deepak, S. R. M. Prasanna. 127-139 [doi]

Studying the Effect of Frame-Level Concatenation of GFCC and TS-MFCC Features on Zero-Shot Children's ASRAnkita, Shambhavi, Syed Shahnawazuddin. 140-150 [doi]

Code-Mixed Text-to-Speech Synthesis Under Low-Resource ConstraintsRaviraj Joshi, Nikesh Garera. 151-163 [doi]

An End-to-End TTS Model in Chhattisgarhi, a Low-Resource Indian LanguageAbhayjeet Singh, Anjali Jayakumar, Deekshitha G, Hitesh Kumar, Jesuraja Bandekar, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh. 164-172 [doi]

An ASR Corpus in Chhattisgarhi, a Low Resource Indian LanguageAbhayjeet Singh, Arjun Singh Mehta, Ashish Khuraishi K. S, Deekshitha G, Gauri Date, Jai Nanavati, Jesuraja Bandekar, Karnalius Basumatary, Karthika P, Sandhya Badiger, Sathvik Udupa, Saurabh Kumar, Prasanta Kumar Ghosh, Prashanthi V, Priyanka Pai, Raoul Nanavati, Sai Praneeth Reddy Mora, Srinivasa Raghavan K. M.. 173-181 [doi]

Cross Lingual Style Transfer Using Multiscale Loss Function for Soliga: A Low Resource Tribal LanguageAshwini Dasare, B. Lohith Reddy, A. Sai Chandra Koushik, B. Sai Raj, V. Krishna Sai Rohith, Satisha Basavaraju, K. T. Deepak. 182-194 [doi]

Preliminary Analysis of Lambani Vowels and Vowel Classification Using Acoustic FeaturesLeena Dihingia, Prashant Bannulmath, Amartya Chowdhury, S. R. M. Prasanna, K. T. Deepak, Tehreem Sheikh. 195-207 [doi]

Curriculum Learning Based Approach for Faster Convergence of TTS ModelNavneet Kaur, Prasanta Kumar Ghosh. 208-221 [doi]

Rhythm Measures and Language Endangerment: The Case of DeoriKrisangi Saikia, Shakuntala Mahanta. 222-230 [doi]

Konkani Phonetic Transcription System 1.0Swapnil Fadte, Edna Vaz Fernandes, Hanumant Redkar, Jyoti D. Pawar. 231-240 [doi]

E-TTS: Expressive Text-to-Speech Synthesis for Hindi Using Data AugmentationIshika Gupta, Hema A. Murthy. 243-257 [doi]

Direct Vs Cascaded Speech-to-Speech Translation Using TransformerLalaram Arya, Amartya Chowdhury, S. R. Mahadeva Prasanna. 258-270 [doi]

Deep Learning Based Speech Quality Assessment Focusing on Noise EffectsRahul Jaiswal, Anu Priya. 271-282 [doi]

Quantifying the Emotional Landscape of Music with Three DimensionsKirtana Sunil Phatnani, Hemant A. Patil. 283-294 [doi]

Analysis of Mandarin vs English Language for Emotional Voice ConversionS. Uthiraa, Hemant A. Patil. 295-306 [doi]

Audio DeepFake Detection Employing Multiple Parametric Exponential Linear UnitsMd Shahidul Alam, Abderrahim Fathan, Jahangir Alam. 307-321 [doi]

A Comparison of Learned Representations with Jointly Optimized VAE and DNN for Syllable Stress DetectionJhansi Mallela, Prasanth Sai Boyina, Chiranjeevi Yarra. 322-334 [doi]

On the Asymptotic Behaviour of the Speech SignalPriyanka Gupta, Rajul Acharya, Ankur T. Patil, Hemant A. Patil. 335-343 [doi]

Improvement of Audio-Visual Keyword Spotting System Accuracy Using Excitation Source FeatureSalam Nandakishor, Debadatta Pati. 344-356 [doi]

Developing a Question Answering System on the Material of Holocaust Survivors' Testimonies in RussianLiudmila Bukreeva, Daria Guseva, Mikhail Dolgushin, Vera Evdokimova, Vasilisa Obotnina. 357-366 [doi]

Decoding Asian Elephant Vocalisations: Unravelling Call Types, Context-Specific Behaviors, and Individual IdentitiesSeema Lokhandwala, Rohit Sinha 0003, Sreeram Ganji, Balakrishna Pailla. 367-379 [doi]

Enhancing Children's Short Utterance Based ASV Using Data Augmentation Techniques and Feature Concatenation ApproachShahid Aziz, Syed Shahnawazuddin. 380-394 [doi]

Studying the Effectiveness of Data Augmentation and Frequency-Domain Linear Prediction Coefficients in Children's Speaker Verification Under Low-Resource ConditionsShahid Aziz, Shivesh Pushp, Syed Shahnawazuddin. 395-406 [doi]

Constant-Q Based Harmonic and Pitch Features for Normal vs. Pathological Infant Cry ClassificationAditya Pusuluri, Aastha Kachhi, Hemant A. Patil. 407-420 [doi]

Robustness of Whisper Features for Infant Cry ClassificationMonil Charola, Siddharth Rathod, Hemant A. Patil. 421-433 [doi]

I-MSV 2022: Indic-Multilingual and Multi-sensor Speaker Verification ChallengeJagabandhu Mishra, Mrinmoy Bhattacharjee, S. R. Mahadeva Prasanna. 437-445 [doi]

Multi-task Learning over Mixup Variants for the Speaker Verification TaskAbderrahim Fathan, Jahangir Alam, Xiaolin Zhu. 446-460 [doi]

Exploring the Impact of Different Approaches for Spoken Dialect Identification of Konkani LanguageSean Monteiro, Ananya Angra, Muralikrishna H, Veena Thenkanidiyoor, Aroor Dinesh Dileep. 461-474 [doi]

Adversarially Trained Hierarchical Attention Network for Domain-Invariant Spoken Language IdentificationUrvashi Goswami, H. Muralikrishna, Aroor Dinesh Dileep, Veena Thenkanidiyoor. 475-489 [doi]

Ensemble of Incremental System Enhancements for Robust Speaker Diarization in Code-Switched Real-Life AudiosRaj Prakash Gohil, Ramya Viswanathan, Saurabh Agrawal, C. M. Vikram, Madhu R. Kamble, Kamini Sabu, M. Ali Basha Shaik, Krishna K. S. Rajesh. 490-502 [doi]

Enhancing Language Identification in Indian Context Through Exploiting Learned Features with Wav2Vec2.0Shivang Gupta, Kowshik Siva Sai Motepalli, Ravi Kumar, Vamsi Narasinga, Mirishkar Sai Ganesh, Anil Kumar Vuppala. 503-512 [doi]

Design and Development of Voice OTP Authentication SystemPavanitha Manche, Sahaja Nandyala, Jagabandhu Mishra, Gayathri Ananthanarayanan, S. R. Mahadeva Prasanna. 513-528 [doi]

End-to-End Native Language Identification Using a Modified Vision Transformer(ViT) from L2 English SpeechKishan Pipariya, Debolina Pramanik, Puja Bharati, Sabyasachi Chandra, Shyamal Kumar Das Mandal. 529-538 [doi]

Dialect Identification in Ao Using Modulation-Based RepresentationMoakala Tzudir, Rishith Sadashiv T. N., Ayush Agarwal, S. R. Mahadeva Prasanna. 539-549 [doi]

Self-supervised Speaker Verification Employing Augmentation Mix and Self-augmented Training-Based ClusteringAbderrahim Fathan, Jahangir Alam. 550-563 [doi]

runs on WebDSL