default search action

combined dblp search
author search
venue search
publication search

ask others

Odyssey 2024: Quebec City, Canada

> Home > Conferences and Workshops > Odyssey

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/2024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/2024
Najim Dehak, Patrick Cardinal:
Odyssey 2024: The Speaker and Language Recognition Workshop, Quebec City, Canada, June 18-21, 2024. ISCA 2024

Keynotes

- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/Meuwly24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Meuwly24
Didier Meuwly:
Development and validation of an automatic approach addressing the forensic question of identity of source - the contribution of the speaker recognition field.
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/Greenberg24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Greenberg24
Craig S. Greenberg:
A Brief History of the NIST Speaker Recognition Evaluations.
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/000124
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/000124
Jesús Villalba:
Towards Speech Processing Robust to Adversarial Deceptions.
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/Busso24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Busso24
Carlos Busso:
Toward Robust and Discriminative Emotional Speech Representations.
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/odyssey/Chung24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Chung24
Joon Son Chung:
Multimodal Learning of Speech and Speaker Representations.

Forensic Speaker Recognition

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/HughesXFHWKV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/HughesXFHWKV24
Vincent Hughes, Chenzi Xu, Paul Foulkes, Philip Harrison, Poppy Welch, Finnian Kelly, David van der Vloed:
Exploring individual speaker behaviour within a forensic automatic speaker recognition system. 1-8
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/AmorBV24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/AmorBV24
Imen Ben Amor, Jean-François Bonastre, David van der Vloed:
Forensic speaker recognition with BA-LR: calibration and evaluation on a forensically realistic database. 9-16
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/MotlicekDMRJBTS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/MotlicekDMRJBTS24
Petr Motlícek, Erinç Dikici, Srikanth R. Madikeri, Pradeep Rangappa, Miroslav Jánosík, Gerhard Backfried, Dorothea Thomas-Aniola, Maximilian Schürz, Johan Rohdin, Petr Schwarz, Marek Kovác, Kvetoslav Malý, Dominik Bobos, Mathias Leibiger, Costas Kalogiros, Andreas Alexopoulos, Daniel Kudenko, Zahra Ahmadi, Hoang H. Nguyen, Aravind Krishnan, Dawei Zhu, Dietrich Klakow, Maria Jofre, Francesco Calderoni, Denis Marraud, Nikolaos Koutras, Nikos Nikolau, Christiana Aposkiti, Panagiotis Douris, Konstantinos Gkountas, Eleni-Konstantina Sergidou, Wauter Bosma, Joshua Hughes, Hellenic Police Team:
ROXSD: The ROXANNE Multimodal and Simulated Dataset for Advancing Criminal Investigations. 17-24
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/GerlachKMA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/GerlachKMA24
Linda Gerlach, Finnian Kelly, Kirsty McDougall, Anil Alexander:
Exploring speaker similarity based selection of relevant populations for forensic automatic speaker recognition. 25-30

Speaker Verification

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/GriotMMBB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/GriotMMBB24
Nathan Griot, Mohammad MohammadAmini, Driss Matrouf, Raphaël Blouet, Jean-François Bonastre:
Attention-based Comparison on Aligned Utterances for Text-Dependent Speaker Verification. 31-37
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LepageD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LepageD24
Théo Lepage, Réda Dehak:
Additive Margin in Contrastive Self-Supervised Frameworks to Learn Discriminative Speaker Representations. 38-42
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/FathanZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/FathanZ024
Abderrahim Fathan, Xiaolin Zhu, Jahangir Alam:
An investigative study of the effect of several regularization techniques on label noise robustness of self-supervised speaker verification systems. 43-50
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ZamanaKA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ZamanaKA24
Oleksandra Zamana, Priit Käärd, Tanel Alumäe:
Using Pretrained Language Models for Improved Speaker Identification. 51-58
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ThebaudHJT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ThebaudHJT24
Thomas Thebaud, Gabriel Hernández, Sarah Flora Samson Juan, Marie Tahon:
A Phonetic Analysis of Speaker Verification Systems through Phoneme selection and Integrated Gradients. 59-66

Speaker and Language Recogniton

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LonerganQCGC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LonerganQCGC24
Liam Lonergan, Mengjie Qian, Neasa Ní Chiaráin, Christer Gobl, Ailbhe Ní Chasaide:
Low-resource speech recognition and dialect identification of Irish in a multi-task framework. 67-73
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/EspunaPMMS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/EspunaPMMS24
Aleix Espuña, Amrutha Prasad, Petr Motlícek, Srikanth R. Madikeri, Christof Schüpbach:
Normalizing Flows for Speaker and Language Recognition Backend. 74-80
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/DuttaLIH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DuttaLIH24
Satwik Dutta, Iván López-Espejo, Dwight Irvin, John H. L. Hansen:
Joint Language and Speaker Classification in Naturalistic Bilingual Adult-Toddler Interactions. 81-85
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/JonesWCS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/JonesWCS24
Karen Jones, Kevin Walker, Christopher Caruso, Stephanie M. Strassel:
MAGLIC: The Maghrebi Language Identification Corpus. 86-90

Speaker Diarization

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/RajWM0PK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/RajWM0PK24
Desh Raj, Matthew Wiesner, Matthew Maciejewski, Paola García, Daniel Povey, Sanjeev Khudanpur:
On Speaker Attribution with SURT. 91-98
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/0007SS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/0007SS024
Can Cui, Imran A. Sheikh, Mostafa Sadeghi, Emmanuel Vincent:
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications. 99-106
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Alvarez-TrejosL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Alvarez-TrejosL24
Juan Ignacio Álvarez-Trejos, Beltrán Labrador, Alicia Lozano-Diez:
Leveraging Speaker Embeddings in End-to-End Neural Diarization for Two-Speaker Scenarios. 107-114
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/KaldaPMAB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/KaldaPMAB24
Joonas Kalda, Clément Pagés, Ricard Marxer, Tanel Alumäe, Hervé Bredin:
PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings. 115-122
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ZhangSLDSB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ZhangSLDSB24
Lin Zhang, Themos Stafylakis, Federico Landini, Mireia Díez, Anna Silnova, Lukás Burget:
Do End-to-End Neural Diarization Attractors Need to Encode Speaker Characteristic Information? 123-130
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ThienpondtD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ThienpondtD24
Jenthe Thienpondt, Kris Demuynck:
Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization. 131-136

Spoofing and Adversarial Attacks

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/HeXWZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/HeXWZD24
Mingrui He, Longting Xu, Han Wang, Mingjun Zhang, Rohan Kumar Das:
Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks. 137-144
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/DaoEM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DaoEM24
Anh-Tuan Dao, Nicholas W. D. Evans, Driss Matrouf:
Spoofing detection in the wild: an investigation of approaches to improve generalisation. 145-150
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/KaroYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/KaroYL24
Matan Karo, Arie Yeredor, Itshak Lapidot:
Meaningful Embeddings for Explainable Countermeasures. 151-157
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ShimJKEBL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ShimJKEBL24
Hye-jin Shim, Jee-weon Jung, Tomi Kinnunen, Nicholas W. D. Evans, Jean-François Bonastre, Itshak Lapidot:
a-DCF: an architecture agnostic metric with application to spoofing-robust speaker verification. 158-164
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/JoshiT0D24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/JoshiT0D24
Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. 165-171

Speech Synthesis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/DuL0KS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DuL0KS24
Zongyang Du, Junchen Lu, Kun Zhou, Lakshmish Kaushik, Berrak Sisman:
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with A Conditional Diffusion Model. 172-179
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/0003SB0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/0003SB0024
Kun Zhou, Berrak Sisman, Carlos Busso, Bin Ma, Haizhou Li:
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion. 180-186
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/GaudierTLE24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/GaudierTLE24
Thibault Gaudier, Marie Tahon, Anthony Larcher, Yannick Estève:
Automatic Voice Identification after Speech Resynthesis using PPG. 187-193
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ChandraDS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ChandraDS24
Shreeram Suresh Chandra, Zongyang Du, Berrak Sisman:
Exploring speech style spaces with language models: Emotional TTS without emotion labels. 194-200

Speech Pathologies and Fairness

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/FavaroDT0OM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/FavaroDT0OM24
Anna Favaro, Najim Dehak, Thomas Thebaud, Jesús Villalba, Esther S. Oh, Laureano Moro-Velázquez:
Discovering Invariant Patterns of Cognitive Decline Via an Automated Analysis of the Cookie Thief Picture Description Task. 201-208
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Chouchane0GET24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Chouchane0GET24
Oubaïda Chouchane, Christoph Busch, Chiara Galdi, Nicholas W. D. Evans, Massimiliano Todisco:
A Comparison of Differential Performance Metrics for the Evaluation of Automatic Speaker Verification Fairness. 209-216
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/BhattPP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/BhattPP24
Japan Bhatt, Harsh Patel, Hemant A. Patil:
Noise Robust Whisper Features for Dysarthric Automatic Speech Recognition. 217-224

Applications and Multimedia

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/GougehZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/GougehZZ24
Reza Amini Gougeh, Nu Zhang, Zeljko Zilic:
Optimizing Auditory Immersion Safety on Edge Devices: An On-Device Sound Event Detection System. 225-231
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LebourdaisGMT0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LebourdaisGMT0L24
Martin Lebourdais, Pablo Gimeno, Théo Mariotte, Marie Tahon, Alfonso Ortega, Anthony Larcher:
3MAS: a multitask, multilabel, multidataset semi-supervised audio segmentation model. 232-239
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Rajasekhar024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Rajasekhar024
Gnana Praveen Rajasekhar, Jahangir Alam:
Cross-Modal Transformers for Audio-Visual Person Verification. 240-246

Emotion Challenge 1

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/GoncalvesSNMT0D24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/GoncalvesSNMT0D24
Lucas Goncalves, Ali N. Salman, Abinay Reddy Naini, Laureano Moro-Velázquez, Thomas Thebaud, Paola García, Najim Dehak, Berrak Sisman, Carlos Busso:
Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results. 247-254
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/HarmA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/HarmA24
Henry Härm, Tanel Alumäe:
TalTech Systems for the Odyssey 2024 Emotion Recognition Challenge. 255-259
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ChenZLLWM0LRWW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ChenZLLWM0LRWW024
Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang Ma, Peter Bell, Catherine Lai, Joshua D. Reiss, Lin Wang, Philip C. Woodland, Xie Chen, Huy Phan, Thomas Hain:
1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem. 260-265
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/CostaIH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/CostaIH24
Federico Costa, Miquel India, Javier Hernando:
Double Multi-Head Attention Multimodal System for Odyssey 2024 Speech Emotion Recognition Challenge. 266-273

Emotion Challenge 2

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Yoldi0MR24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Yoldi0MR24
Miguel Ángel Pastor Yoldi, Alfonso Ortega, Antonio Miguel, Dayana Ribas:
The ViVoLab System for the Odyssey Emotion Recognition Challenge 2024 Evaluation. 274-280
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ShamsiGT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ShamsiGT24
Meysam Shamsi, Lara Gauder, Marie Tahon:
The CONILIUM proposition for Odyssey Emotion Challenge : Leveraging major class with complex annotations. 281-287
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/Bellver-SolerMB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/Bellver-SolerMB24
Jaime Bellver-Soler, Iván Martín-Fernández, Jose M. Bravo-Pacheco, Sergio Esteban Romero, Fernando Fernández Martínez, Luis Fernando D'Haro:
Multimodal Audio-Language Model for Speech Emotion Recognition. 288-295
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/LaforePMQBPBFBB24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LaforePMQBPBFBB24
Adrien Lafore, Clément Pagés, Leila Moudjari, Sebastião Quintas, Hervé Bredin, Thomas Pellegrini, Farah Benamara, Isabelle Ferrané, Jérôme Bertrand, Marie-Françoise Bertrand, Véronique Moriceau, Jérôme Farinas:
IRIT-MFU Multi-modal systems for emotion classification for Odyssey 2024 challenge. 296-302
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/DiatlovaUSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DiatlovaUSS24
Daria Diatlova, Anton Udalov, Vitalii Shutov, Egor Spirin:
Adapting WavLM for Speech Emotion Recognition. 303-308
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/DuretER24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DuretER24
Jarod Duret, Yannick Estève, Mickael Rouvier:
MSP-Podcast SER Challenge 2024: L'antenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition. 309-314

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.