default search action

combined dblp search
author search
venue search
publication search

ask others

Egor Lakomkin

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiaKZL0WSMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiaKZL0WSMK25
Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:
Efficient Streaming LLM for Speech Recognition. ICASSP 2025: 1-5
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangRLMJKLHDMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangRLMJKLHDMK25
Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. ICASSP 2025: 1-5
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoMLXXZAGLF25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoMLXXZAGLF25
Jinzheng Zhao, Niko Moritz, Egor Lakomkin, Ruiming Xie, Zhiping Xiu, Katerina Zmolíková, Zeeshan Ahmed, Yashesh Gaur, Duc Le, Christian Fuegen:
Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens. ICASSP 2025: 1-5
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KangJWZLGSKLMK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KangJWZLGSKLMK25
Wonjune Kang, Junteng Jia, Chunyang Wu, Wei Zhou, Egor Lakomkin, Yashesh Gaur, Leda Sari, Suyoun Kim, Ke Li, Jay Mahadeokar, Ozlem Kalinli:
Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech. INTERSPEECH 2025
2024
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LakomkinWFKSF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LakomkinWFKSF24
Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. ICASSP 2024: 12406-12410
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FathullahWLJSLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FathullahWLJSLG24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/FathullahWLLJSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/FathullahWLLJSM24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11494
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11494
Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli:
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses. CoRR abs/2409.11494 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-01162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-01162
Wonjune Kang, Junteng Jia, Chunyang Wu, Wei Zhou, Egor Lakomkin, Yashesh Gaur, Leda Sari, Suyoun Kim, Ke Li, Jay Mahadeokar, Ozlem Kalinli:
Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech. CoRR abs/2410.01162 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-03752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-03752
Junteng Jia, Gil Keren, Wei Zhou, Egor Lakomkin, Xiaohui Zhang, Chunyang Wu, Frank Seide, Jay Mahadeokar, Ozlem Kalinli:
Efficient Streaming LLM for Speech Recognition. CoRR abs/2410.03752 (2024)
2023
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiuLV0CXDMKPPF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiuLV0CXDMKPPF23
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision. CVPR 2023: 18806-18815
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SharmaHLLLK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SharmaHLLLK23
Roshan Sharma, Weipeng He, Ju Lin, Egor Lakomkin, Yang Liu, Kaustubh Kalgaonkar:
Egocentric Audio-Visual Noise Suppression. ICASSP 2023: 1-5
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-17200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-17200
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision. CoRR abs/2303.17200 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11795
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10917
Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. CoRR abs/2309.10917 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06753
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023)
2022
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeymannLR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeymannLR22
Jahn Heymann, Egor Lakomkin, Leif Rädel:
Being Greedy Does Not Hurt: Sampling Strategies for End-To-End Speech Recognition. ICASSP 2022: 7787-7791
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-03643
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-03643
Roshan Sharma, Weipeng He, Ju Lin, Egor Lakomkin, Yang Liu, Kaustubh Kalgaonkar:
Egocentric Audio-Visual Noise Suppression. CoRR abs/2211.03643 (2022)
2020
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LakomkinHSW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LakomkinHSW20
Egor Lakomkin, Jahn Heymann, Ilya Sklyar, Simon Wiesler:
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition. INTERSPEECH 2020: 3600-3604
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-04034
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-04034
Egor Lakomkin, Jahn Heymann, Ilya Sklyar, Simon Wiesler:
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition. CoRR abs/2008.04034 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/LakomkinZWMW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/LakomkinZWMW19
Egor Lakomkin, Mohammad-Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter:
Incorporating End-to-End Speech Recognition Models for Sentiment Analysis. ICRA 2019: 7976-7982
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SpringenbergLWW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SpringenbergLWW19
Sebastian Springenberg, Egor Lakomkin, Cornelius Weber, Stefan Wermter:
Predictive Auxiliary Variational Autoencoder for Representation Learning of Global Speech Characteristics. INTERSPEECH 2019: 934-938
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-11245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-11245
Egor Lakomkin, Mohammad-Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter:
Incorporating End-to-End Speech Recognition Models for Sentiment Analysis. CoRR abs/1902.11245 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00216
Egor Lakomkin, Sven Magg, Cornelius Weber, Stefan Wermter:
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos. CoRR abs/1903.00216 (2019)
2018
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/LakomkinMWW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LakomkinMWW18
Egor Lakomkin, Sven Magg, Cornelius Weber, Stefan Wermter:
KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos. EMNLP (Demonstration) 2018: 90-95
[c9]
- view
  - electronic edition @ esann.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/esann/SpringenbergLWW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/esann/SpringenbergLWW18
Sebastian Springenberg, Egor Lakomkin, Cornelius Weber, Stefan Wermter:
Image-to-Text Transduction with Spatial Self-Attention. ESANN 2018
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icann/QuWLTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icann/QuWLTW18
Leyuan Qu, Cornelius Weber, Egor Lakomkin, Johannes Twiefel, Stefan Wermter:
Combining Articulatory Features with End-to-End Learning in Speech Recognition. ICANN (3) 2018: 500-510
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/LakomkinZWMW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/LakomkinZWMW18
Egor Lakomkin, Mohammad-Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter:
EmoRL: Continuous Acoustic Emotion Classification Using Deep Reinforcement Learning. ICRA 2018: 1-6
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/BarrosCLSSW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/BarrosCLSSW18
Pablo V. A. Barros, Nikhil Churamani, Egor Lakomkin, Henrique Siqueira, Alexander Sutherland, Stefan Wermter:
The OMG-Emotion Behavior Dataset. IJCNN 2018: 1-7
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/LakomkinZWMW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/LakomkinZWMW18
Egor Lakomkin, Mohammad-Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter:
On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks. IROS 2018: 854-860
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-05434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-05434
Pablo V. A. Barros, Nikhil Churamani, Egor Lakomkin, Henrique Siqueira, Alexander Sutherland, Stefan Wermter:
The OMG-Emotion Behavior Dataset. CoRR abs/1803.05434 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-11506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-11506
Egor Lakomkin, Cornelius Weber, Stefan Wermter:
Automatically augmenting an emotion dataset improves classification using audio. CoRR abs/1803.11506 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-11508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-11508
Egor Lakomkin, Cornelius Weber, Sven Magg, Stefan Wermter:
Reusing Neural Speech Representations for Auditory Emotion Recognition. CoRR abs/1803.11508 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-11509
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-11509
Egor Lakomkin, Chandrakant Bothe, Stefan Wermter:
GradAscent at EmoInt-2017: Character- and Word-Level Recurrent Neural Network Models for Tweet Emotion Intensity Detection. CoRR abs/1803.11509 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-02173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-02173
Egor Lakomkin, Mohammad-Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter:
On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks. CoRR abs/1804.02173 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-04053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-04053
Egor Lakomkin, Mohammad-Ali Zamani, Cornelius Weber, Sven Magg, Stefan Wermter:
EmoRL: Continuous Acoustic Emotion Classification using Deep Reinforcement Learning. CoRR abs/1804.04053 (2018)
2017
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/eacl/WermterLW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eacl/WermterLW17
Egor Lakomkin, Cornelius Weber, Stefan Wermter:
Automatically augmenting an emotion dataset improves classification using audio. EACL (2) 2017: 194-197
[c3]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcnlp/LakomkinWMW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnlp/LakomkinWMW17
Egor Lakomkin, Cornelius Weber, Sven Magg, Stefan Wermter:
Reusing Neural Speech Representations for Auditory Emotion Recognition. IJCNLP(1) 2017: 423-430
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/wassa/LakomkinBW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wassa/LakomkinBW17
Egor Lakomkin, Chandrakant Bothe, Stefan Wermter:
GradAscent at EmoInt-2017: Character and Word Level Recurrent Neural Network Models for Tweet Emotion Intensity Detection. WASSA@EMNLP 2017: 169-174
2011
[c1]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/syrcodis/GapanyukLID11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/syrcodis/GapanyukLID11
Yuriy Gapanyuk, Egor Lakomkin, Sergey Ionkin, Martin Davtyan:
MVC Web Framework Based on eXist Application Server and XRX Architecture. SYRCoDIS 2011: 19-25

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.