default search action
22nd SPECOM 2020: St. Petersburg, Russia
- Alexey Karpov, Rodmonga Potapova:
Speech and Computer - 22nd International Conference, SPECOM 2020, St. Petersburg, Russia, October 7-9, 2020, Proceedings. Lecture Notes in Computer Science 12335, Springer 2020, ISBN 978-3-030-60275-8 - Tanvirul Alam, Akib Khan:
Lightweight CNN for Robust Voice Activity Detection. 1-12 - Pedro Alonso, Rajkumar Saini, György Kovács:
Hate Speech Detection Using Transformer Ensembles on the HASOC Dataset. 13-21 - Iustina Andronic, Ludwig Kürzinger, Edgar Ricardo Chavez Rosas, Gerhard Rigoll, Bernhard U. Seeber:
MP3 Compression to Diminish Adversarial Noise in End-to-End Speech Recognition. 22-34 - Andrei Andrusenko, Aleksandr Laptev, Ivan Medennikov:
Exploration of End-to-End ASR for OpenSTT - Russian Open Speech-to-Text Dataset. 35-44 - Sergei Astapov, Dmitriy Popov, Vladimir Kabarov:
Directional Clustering with Polyharmonic Phase Estimation for Enhanced Speaker Localization. 45-56 - Umut Avci:
Speech Emotion Recognition Using Spectrogram Patterns as Features. 57-67 - Natalia Bogdanova-Beglarian, Olga Blinova, Tatiana Y. Sherstinova, Daria Gorbunova, Kristina Zaides, Tatiana I. Popova:
Pragmatic Markers in Dialogue and Monologue: Difficulties of Identification and Typical Formation Models. 68-78 - Sebastian Braun, Ivan Tashev:
Data Augmentation and Loss Normalization for Deep Noise Suppression. 79-86 - Lukás Bures, Petr Neduchal, Ludek Müller:
Automatic Information Extraction from Scanned Documents. 87-96 - Petr Cerva, Veronika Volna, Lenka Weingartová:
Dealing with Newly Emerging OOVs in Broadcast Programs by Daily Updates of the Lexicon and Language Model. 97-107 - Aleksandr Chernyaev, Alexey Spryiskov, Alexander Ivashko, Yuliya Bidulya:
A Rumor Detection in Russian Tweets. 108-118 - Maria Dayter, Elena I. Riekhakaynen:
Automatic Prediction of Word Form Reduction in Russian Spontaneous Speech. 119-127 - Ghania Droua-Hamdani:
Formant Frequency Analysis of MSA Vowels in Six Algerian Regions. 128-135 - Anastasia Dvoynikova, Oxana Verkholyak, Alexey Karpov:
Emotion Recognition and Sentiment Analysis of Extemporaneous Speech Transcriptions in Russian. 136-144 - José Vicente Egas López, Gábor Gosztolya:
Predicting a Cold from Speech Using Fisher Vectors; SVM and XGBoost as Classifiers. 145-155 - Denis Gordeev, Vsevolod Potapov:
Toxicity in Texts and Images on the Internet. 156-165 - Ivan Gruber, Pavel Ircing, Petr Neduchal, Marek Hrúz, Miroslav Hlavác, Zbynek Zajíc, Jan Svec, Martin Bulín:
An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents. 166-175 - Miroslav Hlavác, Ivan Gruber, Milos Zelezný, Alexey Karpov:
Lipreading with LipsID. 176-183 - Anastasia Iskhakova, Daniyar Wolf, Roman V. Meshcheryakov:
Automated Destructive Behavior State Detection on the 1D CNN-Based Voice Analysis. 184-193 - Evgeny Kazartsev, Arina Davydova, Tatiana Y. Sherstinova:
Rhythmic Structures of Russian Prose and Occasional Iambs (a Diachronic Case Study). 194-203 - Pavel Kholiavin, Anna Mamushina, Daniil Kocharov, Tatiana Kachkovskaia:
Automatic Detection of Backchannels in Russian Dialogue Speech. 204-213 - Irina S. Kipyatkova, Nikita Markovnikov:
Experimenting with Attention Mechanisms in Joint CTC-Attention Models for Russian Speech Recognition. 214-222 - Can Korkut, Ali Haznedaroglu, Levent Arslan:
Comparison of Deep Learning Methods for Spoken Language Identification. 223-231 - Artemy Kotov, Liudmila Zaidelman, Anna Zinina, Nikita Arinkin, Alexander Filatov, Kirill Kivva:
Conceptual Operations with Semantics for a Companion Robot. 232-243 - Sergey V. Kuleshov, Alexandra A. Zaytseva, Konstantin Nenausnikov:
Legal Tech: Documents' Validation Method Based on the Associative-Ontological Approach. 244-254 - Ludwig Kürzinger, Edgar Ricardo Chavez Rosas, Lujun Li, Tobias Watzel, Gerhard Rigoll:
Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition. 255-266 - Ludwig Kürzinger, Dominik Winkelbauer, Lujun Li, Tobias Watzel, Gerhard Rigoll:
CTC-Segmentation of Large Corpora for German End-to-End Speech Recognition. 267-278 - Tatiana Litvinova:
Stylometrics Features Under Domain Shift: Do They Really "Context-Independent"? 279-290 - Elena E. Lyakso, Olga V. Frolova, Aleksey Grigorev, Viktor Gorodnyi, Aleksandr Nikolaev, Anna V. Kurazhova:
Speech Features of 13-15 Year-Old Children with Autism Spectrum Disorders. 291-303 - Manon Macary, Martin Lebourdais, Marie Tahon, Yannick Estève, Anthony Rousseau:
Multi-corpus Experiment on Continuous Speech Emotion Recognition: Convolution or Recurrence? 304-314 - Olesia Makhnytkina, Anton Matveev, Darya Bogoradnikova, Inna Lizunova, Anna Maltseva, Natalia Shilkina:
Detection of Toxic Language in Short Text Messages. 315-325 - Maxim Markitantov:
Transfer Learning in Speaker's Age and Gender Recognition. 326-335 - Thilo Michael, Sebastian Möller:
Interactivity-Based Quality Prediction of Conversations with Transmission Delay. 336-345 - Polina Mikhailova:
Graphic Markers of Irony and Sarcasm in Written Texts. 346-356 - Oliver Niebuhr, Jana Neitsch:
Digital Rhetoric 2.0: How to Train Charismatic Speaking with Speech-Melody Visualization Software. 357-368 - Arif Sirri Özçelik, Tunga Güngör:
Generating a Concept Relation Network for Turkish Based on ConceptNet Using Translational Methods. 369-378 - Dimitar Popov, Velka Popova, Krasimir Kordov, Stanimir Zhelezov:
Bulgarian Associative Dictionaries in the LABLASS Web-Based System. 379-388 - Rodmonga Potapova, Andrey Dzhunkovskiy:
Preliminary Investigation of Potential Steganographic Container Localization. 389-398 - Rodmonga Potapova, Vsevolod Potapov:
Some Comparative Cognitive and Neurophysiological Reactions to Code-Modified Internet Information. 399-411 - Rodmonga Potapova, Vsevolod Potapov, Nataliya Lebedeva, Ekaterina Karimova, Nikolay Bobrov:
The Influence of Multimodal Polycode Internet Content on Human Brain Activity. 412-423 - Jirí Pribil, Anna Pribilová, Jindrich Matousek:
Synthetic Speech Evaluation by Differential Maps in Pleasure-Arousal Space. 424-434 - Ilyos Rabbimov, Iosif Mporas, Vasiliki Simaki, Sami Kobilov:
Investigating the Effect of Emoji in Opinion Classification of Uzbek Movie Review Comments. 435-445 - Rajeev Rajan, Abhijith Girish, Adharsh Sabu, Akshay Prasannan Latha:
Evaluation of Voice Mimicking Using I-Vector Framework. 446-456 - Ivan Rakhmanenko, Evgeny Kostyuchenko, Evgeny L. Choynzonov, Lidiya N. Balatskaya, Alexander Alexandrovich Shelupanov:
Score Normalization of X-Vector Speaker Verification System for Short-Duration Speaker Verification Challenge. 457-466 - Ekaterina Razubaeva, Anton Stepikhov:
Genuine Spontaneous vs Fake Spontaneous Speech: In Search of Distinction. 467-478 - Meysam Shamsi, Nelly Barbot, Damien Lolive, Jonathan Chevelu:
Mixing Synthetic and Recorded Signals for Audio-Book Generation. 479-489 - Tatiana Shevchenko, Anastasia Gorbyleva:
Temporal Concord in Speech Interaction: Overlaps and Interruptions in Spoken American English. 490-499 - Tatiana Shevchenko, Tatiana Sokoreva:
Cognitively Challenging: Language Shift and Speech Rate of Academic Bilinguals. 500-508 - Dima Shulga, Vered Silber-Varod, Diamanta Benson-Karai, Ofer Levi, Elad Vashdi, Anat Lerner:
Toward Explainable Automatic Classification of Children's Speech Disorders. 509-519 - Ingo Siegert, Yamini Sinha, Oliver Jokisch, Andreas Wendemuth:
Recognition Performance of Selected Speech Recognition APIs - A Longitudinal Study. 520-529 - Lucy Skidmore, Alexander Gutkin:
Does A Priori Phonological Knowledge Improve Cross-Lingual Robustness of Phonemic Contrasts? 530-543 - Pavel A. Skrelin, Uliana E. Kochetkova, Vera Evdokimova, Daria Novoselova:
Can We Detect Irony in Speech Using Phonetic Characteristics Only? - Looking for a Methodology of Analysis. 544-553 - Valery D. Solovyev, Vladimir Ivanov:
Automated Compilation of a Corpus-Based Dictionary and Computing Concreteness Ratings of Russian. 554-561 - Petr Stanislav, Josef V. Psutka, Josef Psutka:
Increasing the Accuracy of the ASR System by Prolonging Voiceless Phonemes in the Speech of Patients Using the Electrolarynx. 562-571 - Paul Tardy, Louis de Seynes, François Hernandez, Vincent Nguyen, David Janiszek, Yannick Estève:
Leverage Unlabeled Data for Abstractive Speech Summarization with Self-supervised Learning and Back-Summarization. 572-580 - Daniel Tihelka, Zdenek Hanzlícek, Markéta Juzová:
Uncertainty of Phone Voicing and Its Impact on Speech Synthesis. 581-591 - Daniel Tihelka, Markéta Juzová, Jakub Vít:
Grappling with Web Technologies: The Problems of Remote Speech Recording. 592-602 - Ryhor Vashkevich, Elias Azarov:
Robust Noisy Speech Parameterization Using Convolutional Neural Networks. 603-612 - Vass Verkhodanova, Dominika Trcková, Matt Coler, Wander Lowie:
More than Words: Cross-Linguistic Exploration of Parkinson's Disease Identification from Speech. 613-623 - Jitka Veronková, Tomás Boril:
Phonological Length of L2 Czech Speakers' Vowels in Ambiguous Contexts as Perceived by L1 Listeners. 624-635 - Siwei Wang, Catherine Soladié, Renaud Séguier:
Learning an Unsupervised and Interpretable Representation of Emotion from Speech. 636-645 - Tobias Watzel, Ludwig Kürzinger, Lujun Li, Gerhard Rigoll:
Synchronized Forward-Backward Transformer for End-to-End Speech Recognition. 646-656 - Zhandos Yessenbayev, Zhanibek Kozhirbayev, Aibek Makazhanov:
KazNLP: A Pipeline for Automated Processing of Texts Written in Kazakh Language. 657-666 - Zbynek Zajíc, Josef V. Psutka, Ludek Müller:
Diarization Based on Identification with X-Vectors. 667-678 - Vladimir Zhebel, Denis Zubarev, Ilya Sochenkov:
Different Approaches in Cross-Language Similar Documents Retrieval in the Legal Domain. 679-686
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.