


default search action
Odyssey 2018: Les Sables d'Olonne, France
- Anthony Larcher, Jean-François Bonastre

:
Odyssey 2018: The Speaker and Language Recognition Workshop, 26-29 June 2018, Les Sables d'Olonne, France. ISCA 2018
Keynote: Els Kindt
- Els Kindt:

Speaker identification and Data protection.
Speaker Recognition I
- Moez Ajili, Solange Rossato, Dan Zhang, Jean-François Bonastre:

Impact of rhythm on forensic voice comparison reliability. 1-8 - Georgina Brown

:
Segmental Content Effects on Text-dependent Automatic Accent Recognition. 9-15 - Andreas Nautsch

, Sergey Isadskiy, Jascha Kolberg
, Marta Gomez-Barrero, Christoph Busch:
Homomorphic Encryption for Speaker Recognition: Protection of Biometric Templates and Vendor Model Parameters. 16-23 - Martin Karu, Tanel Alumäe

:
Weakly Supervised Training of Speaker Identification Models. 24-30
Language Recognition
- Bharat Padi, Shreyas Ramoji, Vaishnavi Yeruva, Satish Kumar, Sriram Ganapathy:

The LEAP Language Recognition System for LRE 2017 Challenge - Improvements and Error Analysis. 31-38 - Alicia Lozano-Diez

, Oldrich Plchot, Pavel Matejka, Ondrej Novotný, Joaquin Gonzalez-Rodriguez:
Analysis of DNN-based Embeddings for Language Recognition on the NIST LRE 2017. 39-46 - Oldrich Plchot, Pavel Matejka, Ondrej Novotný, Sandro Cumani, Alicia Lozano-Diez

, Josef Slavícek, Mireia Díez, Frantisek Grézl, Ondrej Glembek, Mounika Kamsali, Anna Silnova, Lukás Burget, Lucas Ondel, Santosh Kesiraju
, Johan Rohdin:
Analysis of BUT-PT Submission for NIST LRE 2017. 47-53 - Fred Richardson, Pedro A. Torres-Carrasquillo, Jonas Borgstrom, Douglas E. Sturim, Youngjune Gwon, Jesús Villalba, Jan Trmal, Nanxin Chen, Réda Dehak, Najim Dehak

:
The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System. 54-59 - Trung Ngo Trong, Ville Hautamäki, Kristiina Jokinen:

Staircase Network: structural language identification via hierarchical attentive units. 60-67 - Alan McCree, David Snyder, Gregory Sell, Daniel Garcia-Romero:

Language Recognition for Telephone and Video Speech: The JHU HLTCOE Submission for NIST LRE17. 68-73 - Weicheng Cai, Jinkun Chen, Ming Li:

Exploring the Encoding Layer and Loss Function in End-to-End Speaker and Language Recognition System. 74-81 - Seyed Omid Sadjadi, Timothée Kheyrkhah, Audrey Tong, Craig S. Greenberg, Douglas A. Reynolds, Elliot Singer, Lisa P. Mason, Jaime Hernandez-Cordero

:
The 2017 NIST Language Recognition Evaluation. 82-89 - Mitchell McLaren, Mahesh Kumar Nandwana, Diego Castán, Luciana Ferrer:

Approaches to Multi-domain Language Recognition. 90-97 - Suwon Shon, Ahmed Ali, James R. Glass:

Convolutional Neural Network and Language Embeddings for End-to-End Dialect Recognition. 98-104 - David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Daniel Povey, Sanjeev Khudanpur:

Spoken Language Recognition using X-vectors. 105-111 - Jesús Antonio Villalba López, Niko Brummer, Najim Dehak

:
End-to-End versus Embedding Neural Networks for Language Recognition in Mismatched Conditions. 112-119
Speaker diarization
- Ruth Aloni-Lavi, Irit Opher, Itshak Lapidot:

Incremental On-Line Clustering of Speakers' Short Segments. 120-127 - Liang He

, Xianhong Chen, Can Xu, Jia Liu:
Latent Class Model for Single Channel Speaker Diarization. 128-133 - Xianhong Chen, Liang He

, Can Xu, Yi Liu, Tianyu Liang
, Jia Liu:
VB-HMM Speaker Diarization with Enhanced and Refined Segment Representation. 134-139 - Jose Patino, Ruiqing Yin, Héctor Delgado

, Hervé Bredin, Alain Komaty
, Guillaume Wisniewski, Claude Barras, Nicholas W. D. Evans, Sébastien Marcel:
Low-latency speaker spotting with online diarization and detection. 140-146 - Mireia Díez

, Lukás Burget, Pavel Matejka:
Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. 147-154
Noise Robustness
- Md. Hafizur Rahman, Ivan Himawan, David Dean, Clinton Fookes, Sridha Sridharan:

Domain-invariant I-vector Feature Extraction for PLDA Speaker Verification. 155-161 - Wei-Wei Lin

, Man-Wai Mak, Longxin Li, Jen-Tzung Chien
:
Reducing Domain Mismatch by Maximum Mean Discrepancy Based Autoencoders. 162-167 - Ondrej Novotný, Oldrich Plchot, Pavel Matejka, Ladislav Mosner, Ondrej Glembek:

On the use of X-vectors for Robust Speaker Recognition. 168-175 - Md. Jahangir Alam, Gautam Bhattacharya, Patrick Kenny:

Speaker Verification in Mismatched Conditions with Frustratingly Easy Domain Adaptation. 176-180 - Chunlei Zhang, Shivesh Ranjan, John H. L. Hansen:

An Analysis of Transfer Learning for Domain Mismatched Text-independent Speaker Verification. 181-186
Keynote: Simon King
- Simoin King:

Speaking naturally? It depends who is listening.
Voice conversion
- Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhen-Hua Ling:

A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment. 187-194 - Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhen-Hua Ling:

The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods. 195-202 - Kazuhiro Kobayashi, Tomoki Toda:

sprocket: Open-Source Voice Conversion Software. 203-210
Voice conversion and spoofing
- Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:

The NU Non-Parallel Voice Conversion System for the Voice Conversion Challenge 2018. 211-218 - Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda:

NU Voice Conversion System for the Voice Conversion Challenge 2018. 219-226 - Xiaohai Tian, Junchao Wang, Haihua Xu, Eng Siong Chng, Haizhou Li:

Average Modeling Approach to Voice Conversion with Non-Parallel Data. 227-232 - Shihono Mochizuki, Sayaka Shiota, Hitoshi Kiya:

Voice liveness detection using phoneme-based pop-noise detector for speaker verification. 233-239 - Jaime Lorenzo-Trueba, Fuming Fang, Xin Wang

, Isao Echizen, Junichi Yamagishi, Tomi Kinnunen:
Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data. 240-247 - Songxiang Liu, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:

The HCCL-CUHK System for the Voice Conversion Challenge 2018. 248-254 - Fahimeh Bahmaninezhad, Chunlei Zhang, John H. L. Hansen:

Convolutional Neural Network Based Speaker De-Identification. 255-260 - Kentaro Sone, Shinji Takaki, Toru Nakashika:

Bidirectional Voice Conversion Based on Joint Training Using Gaussian-Gaussian Deep Relational Model. 261-266 - Berrak Sisman

, Grandee Lee, Haizhou Li:
Phonetically Aware Exemplar-Based Prosody Transformation. 267-274 - Akihiro Kato, Tomi Kinnunen:

A Regression Model of Recurrent Deep Neural Networks for Noise Robust Estimation of the Fundamental Frequency Contour of Speech. 275-282 - Anna Silnova, Pavel Matejka, Ondrej Glembek, Oldrich Plchot, Ondrej Novotný, Frantisek Grézl, Petr Schwarz, Lukás Burget, Jan Cernocký:

BUT/Phonexia Bottleneck Feature Extractor. 283-287
Spoofing
- Giacomo Valenti, Héctor Delgado

, Massimiliano Todisco, Nicholas W. D. Evans, Laurent Pilati:
An end-to-end spoofing countermeasure for automatic speaker verification using evolving recurrent neural networks. 288-295 - Héctor Delgado

, Massimiliano Todisco, Md. Sahidullah, Nicholas W. D. Evans, Tomi Kinnunen, Kong-Aik Lee, Junichi Yamagishi:
ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements. 296-303 - Joaquin Gonzalez-Rodriguez, Álvaro Escudero, Diego de Benito-Gorrón

, Beltran Labrador, Javier Franco-Pedroso:
An Audio Fingerprinting Approach to Replay Attack Detection on ASVSPOOF 2017 Challenge Data. 304-311 - Tomi Kinnunen, Kong-Aik Lee, Héctor Delgado

, Nicholas W. D. Evans, Massimiliano Todisco, Md. Sahidullah, Junichi Yamagishi, Douglas A. Reynolds:
t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. 312-319 - Rosa González Hautamäki, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen:

Perceptual Evaluation of the Effectiveness of Voice Disguise by Age Modification. 320-326
Keynote: Pascal Belin
- Pascal Belin:

A Vocal Brain: Cerebral Processing of Voice Information.
Speaker recognition II
- Mitchell McLaren, Diego Castán, Mahesh Kumar Nandwana, Luciana Ferrer, Emre Yilmaz

:
How to train your speaker embeddings extractor. 327-334 - Giacomo Valenti, Adrien Daniel, Nicholas W. D. Evans:

End-to-end automatic speaker verification with evolving recurrent neural networks. 335-341 - Jen-Tzung Chien

, Kang-Ting Peng:
Adversarial Learning and Augmentation for Speaker Recognition. 342-348 - Niko Brummer, Anna Silnova, Lukás Burget, Themos Stafylakis

:
Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. 349-356 - Ville Vestman, Tomi Kinnunen:

Supervector Compression Strategies to Speed up I-Vector System Development. 357-364
Text-dependent speaker recognition
- Ziqiang Shi, Mengjiao Wang, Liu Liu, Huibin Lin, Rujie Liu:

A Double Joint Bayesian Approach for J-Vector Based Text-dependent Speaker Verification. 365-371 - Hossein Zeinali, Lukás Burget, Hossein Sameti, Honza Cernocký:

Spoken Pass-Phrase Verification in the i-vector Space. 372-377 - Sergey Novoselov, Andrey Shulipa, Ivan Kremnev, Alexandr Kozlov, Vadim Shchemelinin:

On deep speaker embeddings for text-independent speaker recognition. 378-385 - Hossein Zeinali, Hossein Sameti, Themos Stafylakis

:
DeepMine Speech Processing Database: Text-Dependent and Independent Speaker Verification and Speech Recognition in Persian and English. 386-392 - Md. Jahangir Alam, Gautam Bhattacharya, Patrick Kenny:

Boosting the Performance of Spoofing Detection Systems on Replay Attacks Using q-Logarithm Domain Feature Normalization. 393-398

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














