Speech and Computer - 19th International Conference, SPECOM 2017, Hatfield, UK, September 12-16, 2017, Proceedings - researchr publication

researchr

You are not signed in
Sign in
Sign up

Alexey Karpov, Rodmonga Potapova, Iosif Mporas, editors, Speech and Computer - 19th International Conference, SPECOM 2017, Hatfield, UK, September 12-16, 2017, Proceedings. Volume 10458 of Lecture Notes in Computer Science, Springer, 2017. [doi]

Conference: specom2017

Abstract is missing.

Low-Resource Speech Recognition and Keyword-SpottingMark J. F. Gales, Kate M. Knill, Anton Ragni. 3-19 [doi]

Big Data, Deep Learning - At the Edge of X-Ray Speaker AnalysisBjörn W. Schuller. 20-34 [doi]

A Comparison of Covariance Matrix and i-vector Based Speaker RecognitionNiksa Jakovljevic, Ivan D. Jokic, Slobodan Josic, Vlado Delic. 37-45 [doi]

A Trainable Method for the Phonetic Similarity Search in German Proper NamesOliver Jokisch, Horst-Udo Hain. 46-55 [doi]

Acoustic and Perceptual Correlates of Vowel Articulation in Parkinson's Disease With and Without Mild Cognitive Impairment: A Pilot StudyMichaela Strinzel, Vasilisa Verkhodanova, Fedor Jalvingh, Roel Jonkers, Matt Coler. 56-64 [doi]

Acoustic Cues for the Perceptual Assessment of Surround SoundIngo Siegert, Oliver Jokisch, Alicia Flores Lotz, Franziska Trojahn, Martin Meszaros, Michael Maruschke. 65-75 [doi]

Acoustic Modeling in the STC Keyword Search System for OpenKWS 2016 EvaluationIvan Medennikov, Aleksei Romanenko, Alexey Prudnikov, Valentin Mendelev, Yuri Y. Khokhlov, Maxim Korenevsky, Natalia A. Tomashenko, Alexander Zatvornitskiy. 76-86 [doi]

Adaptation Approaches for Pronunciation Scoring with Sparse Training DataFederico Landini, Luciana Ferrer, Horacio Franco. 87-97 [doi]

An Algorithm for Detection of Breath Sounds in Spontaneous Speech with Application to Speaker RecognitionSri Harsha Dumpala, K. N. R. K. Raju Alluri. 98-108 [doi]

An Alternative Approach to Exploring a VideoFahim A. Salim, Fasih Haider, Owen Conlan, Saturnino Luz. 109-118 [doi]

An Analysis of the RNN-Based Spoken Term Detection TrainingJan Svec, Lubos Smídl, Josef V. Psutka. 119-129 [doi]

Analysis of Interaction Parameter Levels in Interaction Quality Modelling for Human-Human ConversationAnastasiia Spirina, Olesia Vaskovskaia, Tatiana Karaseva, Alina Skorokhod, Iana Polonskaia, Maxim Sidorov. 130-140 [doi]

Annotation Error Detection: Anomaly Detection vs. ClassificationJindrich Matousek, Daniel Tihelka. 141-151 [doi]

Are You Addressing Me? Multimodal Addressee Detection in Human-Human-Computer ConversationsOleg Akhtiamov, Dmitrii Ubskii, Evgeniia Feldina, Aleksei Pugachev, Alexey Karpov, Wolfgang Minker. 152-161 [doi]

Assessing Spoken Dialog Services from the End-User Perspective: Usability and ExperienceOtilia Kocsis, Basilis Kladis, Anastasios Tsopanoglou, Nikos Fakotakis. 162-170 [doi]

Audio-Replay Attack Detection CountermeasuresGalina Lavrentyeva, Sergey Novoselov, Egor Malykh, Alexander Kozlov, Oleg Kudashev, Vadim Shchemelinin. 171-181 [doi]

Automatic Estimation of Presentation Skills Using Speech, Slides and GesturesAbualsoud Hanani, Mohammad Al-Amleh, Waseem Bazbus, Saleem Salameh. 182-191 [doi]

Automatic Phonetic Transcription for Russian: Speech Variability ModelingVera Evdokimova, Pavel A. Skrelin, Tatiana Chukaeva. 192-199 [doi]

Automatic Smoker Detection from Telephone Speech SignalsAmir Hossein Poorjam, Soheila Hesaraki, Saeid Safavi, Hugo Van Hamme, Mohamad Hasan Bahari. 200-210 [doi]

Bimodal Anti-Spoofing System for Mobile SecurityEugene Luckyanets, Aleksandr Melnikov, Oleg Kudashev, Sergey Novoselov, Galina Lavrentyeva. 211-220 [doi]

Canadian English Word Stress: A Corpora-Based Study of National Identity in a Multilingual CommunityTatiana Shevchenko, Daria Pozdeeva. 221-232 [doi]

Classification of Formal and Informal Dialogues Based on Turn-Taking and Intonation Using Deep Neural NetworksIstván Szekrényes, György Kovács. 233-243 [doi]

Clustering Target Speaker on a Set of Telephone DialogsAndrey Shulipa, Aleksey Sholohov, Yuri Matveev. 244-252 [doi]

Cognitive Entropy in the Perceptual-Auditory Evaluation of Emotional Modal States of Foreign Language Communication PartnerRodmonga Potapova, Vsevolod Potapov. 253-261 [doi]

Correlation Normalization of Syllables and Comparative Evaluation of Pronunciation Quality in Speech RehabilitationEugeny U. Kostyuchenko, Roman V. Meshcheryakov, Dariya Ignatieva, Alexander Pyatkov, Evgeny Choynzonov, Lidiya N. Balatskaya. 262-271 [doi]

CRF-Based Phrase Boundary Detection Trained on Large-Scale TTS Speech CorporaMarkéta Juzová. 272-281 [doi]

Deep Recurrent Neural Networks in Speech Synthesis Using a Continuous VocoderMohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh. 282-291 [doi]

Design of Online Echo Canceller in Duplex ModeAndrey Barabanov, Evgenij Vikulov. 292-301 [doi]

Detection of Stance and Sentiment Modifiers in Political BlogsMaria Skeppstedt, Vasiliki Simaki, Carita Paradis, Andreas Kerren. 302-311 [doi]

Digits to Words Converter for Slavic Languages in Systems of Automatic Speech RecognitionJosef Chaloupka. 312-321 [doi]

Discriminating Speakers by Their Voices - A Fusion Based ApproachHalim Sayoud, Siham Ouamour, Zohra Hamadache. 322-331 [doi]

Emotional Poetry GenerationAitzol Astigarraga, José María Martínez-Otzeta, Igor Rodriguez, Basilio Sierra, Elena Lazkano. 332-342 [doi]

End-to-End Large Vocabulary Speech Recognition for the Serbian LanguageBranislav M. Popovic, Edvin Pakoci, Darko Pekar. 343-352 [doi]

Examining the Impact of Feature Selection on Sentiment Analysis for the Greek LanguageNikolaos Spatiotis, Michael Paraskevas, Isidoros Perikos, Iosif Mporas. 353-361 [doi]

Experimenting with Hybrid TDNN/HMM Acoustic Models for Russian Speech RecognitionIrina S. Kipyatkova. 362-369 [doi]

Exploring Multiparty Casual Talk for Social Human-Machine DialogueEmer Gilmartin, Benjamin R. Cowan, Carl Vogel, Nick Campbell. 370-378 [doi]

First Experiments to Detect Anomaly Using Personality Traits vs. Prosodic FeaturesCedric Fayet, Arnaud Delhay, Damien Lolive, Pierre-François Marteau. 379-388 [doi]

Fusion of a Novel Volterra-Wiener Filter Based Nonlinear Residual Phase and MFCC for Speaker VerificationPurvi Agrawal, Hemant A. Patil. 389-397 [doi]

Hesitations in Spontaneous Speech: Acoustic Analysis and DetectionVasilisa Verkhodanova, Vladimir Shapranov, Irina S. Kipyatkova. 398-406 [doi]

Human as Acmeologic Entity in Social Network Discourse (Multidimensional Approach)Rodmonga Potapova, Vsevolod Potapov. 407-416 [doi]

Improved Speaker Adaptation by Combining I-vector and fMLLR with Deep Bottleneck NetworksThai Son Nguyen, Kevin Kilgour, Matthias Sperber, Alex Waibel. 417-426 [doi]

Improving of LVCSR for Causal Czech Using Publicly Available Language ResourcesPetr Mizera, Petr Pollák. 427-437 [doi]

Improving Performance of Speaker Identification Systems Using Score Level Fusion of Two Modes of OperationSaeid Safavi, Iosif Mporas. 438-444 [doi]

Improving Speech-Based Emotion Recognition by Using Psychoacoustic Modeling and Analysis-by-SynthesisIngo Siegert, Alicia Flores Lotz, Olga Egorow, Andreas Wendemuth. 445-455 [doi]

In Search of Sentence Boundaries in Spontaneous SpeechNatalia Bogdanova-Beglarian. 456-463 [doi]

Investigating Acoustic Correlates of Broad and Narrow Focus Perception by Japanese Learners of EnglishGábor Pintér, Oliver Jokisch, Shinobu Mizuguchi. 464-472 [doi]

Language Adaptive Multilingual CTC Speech RecognitionMarkus Müller 0001, Sebastian Stüker, Alex Waibel. 473-482 [doi]

Language Model Optimization for a Deep Neural Network Based Speech Recognition System for SerbianEdvin Pakoci, Branislav M. Popovic, Darko Pekar. 483-492 [doi]

Lexico-Semantical Indices of "Deprivation - Aggression" Modality Correlation in Social Network DiscourseRodmonga Potapova, Liliya Komalova. 493-502 [doi]

Linguistic Features and Sociolinguistic Variability in Everyday Spoken RussianNatalia Bogdanova-Beglarian, Tatiana Y. Sherstinova, Olga Blinova, Gregory Y. Martynenko. 503-511 [doi]

Medical Speech Recognition: Reaching Parity with HumansErik Edwards, Wael Salloum, Greg Finley, James Fone, Greg Cardiff, Mark Miller, David Suendermann-Oeft. 512-524 [doi]

Microphone Array Post-filter in Frequency Domain for Speech Recognition Using Short-Time Log-Spectral Amplitude Estimator and Spectral Harmonic/Noise ClassifierSergey I. Salishev, Ilya Klotchkov, Andrey Barabanov. 525-534 [doi]

Multimodal Keyword Search for Multilingual and Mixlingual Speech CorpusAbhimanyu Popli, Arun Kumar. 535-545 [doi]

Neural Network Doc2vec in Automated Sentiment Analysis for Short Informal TextsNatalia Maslova, Vsevolod Potapov. 546-554 [doi]

Neural Network Speaker Descriptor in Speaker Diarization of Telephone SpeechZbynek Zajíc, Jan Zelinka, Ludek Müller. 555-563 [doi]

Novel Linear Prediction Temporal Phase Based Features for Speaker RecognitionAmi Gandhi, Hemant A. Patil. 564-571 [doi]

Novel Phase Encoded Mel Cepstral Features for Speaker VerificationApeksha J. Naik, Rishabh Tak, Hemant A. Patil. 572-581 [doi]

On a Way to the Computer Aided Speech Intonation TrainingBoris Lobanov, Yelena Karnevskaya, Vladimir Zhitko. 582-592 [doi]

On Residual CNN in Text-Dependent Speaker Verification TaskEgor Malykh, Sergey Novoselov, Oleg Kudashev. 593-601 [doi]

Perception and Acoustic Features of Speech of Children with Autism Spectrum DisordersElena E. Lyakso, Olga V. Frolova, Aleksey Grigorev. 602-612 [doi]

Phase Analysis and Labeling Strategies in a CNN-Based Speaker Change Detection SystemMarek Hrúz, Petr Salajka. 613-622 [doi]

Preparing Audio Recordings of Everyday Speech for Prosody Research: The Case of the ORD CorpusTatiana Y. Sherstinova. 623-631 [doi]

Recognizing Emotionally Coloured Dialogue Speech Using Speaker-Adapted DNN-CNN Bottleneck FeaturesKohei Mukaihara, Sakriani Sakti, Satoshi Nakamura 0001. 632-641 [doi]

Relationship Between Perception of Cuteness in Female Voices and Their DurationsRyohei Ohno, Masanori Morise, Tetsuro Kitahara. 642-650 [doi]

Retaining Expression on De-identified FacesLi Meng, Aruna Shenoy. 651-661 [doi]

Semi-automatic Facial Key-Point Dataset CreationMiroslav Hlavác, Ivan Gruber, Milos Zelezný, Alexey Karpov. 662-668 [doi]

Song Emotion Recognition Using Music Genre InformationAthanasios Koutras. 669-679 [doi]

Spanish Corpus for Sentiment Analysis Towards BrandsMaría Navas-Loro, Víctor Rodríguez-Doncel, Idafen Santana Pérez, Alberto Sánchez. 680-689 [doi]

Speech Enhancement for Speaker Recognition Using Deep Recurrent Neural NetworksMaxim Tkachenko, Alexander Yamshinin, Nikolay Lyubimov, Mikhail Kotov, Marina Nastasenko. 690-699 [doi]

Stance Classification in Texts from Blogs on the 2016 British ReferendumVasiliki Simaki, Carita Paradis, Andreas Kerren. 700-709 [doi]

The "Retrospective Commenting" Method for Longitudinal Recordings of Everyday SpeechArto Mustajoki, Tatiana Y. Sherstinova. 710-718 [doi]

The 2016 RWTH Keyword Search System for Low-Resource LanguagesPavel Golik, Zoltán Tüske, Kazuki Irie, Eugen Beck, Ralf Schlüter, Hermann Ney. 719-730 [doi]

The Effect of Morphological Factors on Sentence Boundaries in Russian Spontaneous SpeechAnton Stepikhov, Anastassia Loukina. 731-740 [doi]

The Pausing Method Based on Brown Clustering and Word EmbeddingArman Kaliyev, Sergey V. Rybin, Yuri Matveev. 741-747 [doi]

Unsupervised Document Classification and Topic DetectionJaromír Novotný, Pavel Ircing. 748-756 [doi]

Using a High-Speed Video Camera for Robust Audio-Visual Speech Recognition in Acoustically Noisy ConditionsDenis Ivanko, Alexey Karpov, Dmitry Ryumin, Irina S. Kipyatkova, Anton I. Saveliev, Victor Budkov, Dmitriy Ivanko, Milos Zelezný. 757-766 [doi]

Utilizing Lipreading in Large Vocabulary Continuous Speech RecognitionKarel Palecek. 767-776 [doi]

Vocal Emotion Conversion Using WSOLA and Linear PredictionSusmitha Vekkot, Shikha Tripathi. 777-787 [doi]

Voice Conversion for TTS Systems with Tuning on the Target Speaker Based on GMMVadim Zahariev, Elias Azarov, Alexander A. Petrovsky. 788-798 [doi]

VoiScan: Telephone Voice Analysis for Health and Biometric ApplicationsLadan Baghai-Ravary, Steve W. Beet. 799-808 [doi]

Web Queries Classification Based on the Syntactical Patterns of Search TypesAlaa Mohasseb, Mohamed Bader-El-Den, Andreas Kanavos, Mihaela Cocea. 809-819 [doi]

What Speech Recognition Accuracy is Needed for Video Transcripts to be a Useful Search Interface?Yang Chao, Marie-Luce Bourguet. 820-828 [doi]

runs on WebDSL