default search action
Andreas Stolcke
Person information
- affiliation: Microsoft Research, Mountain View, CA, USA
- affiliation: Microsoft Research
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c204]Ambuje Gupta, Mrinal Rawat, Andreas Stolcke, Roberto Pieraccini:
REFINE on Scarce Data: Retrieval Enhancement Through Fine-Tuning via Model Fusion of Embedding Models. AI (1) 2024: 73-85 - [c203]Hithesh Sankararaman, Mohammed Yasin, Tanner Sorensen, Alessandro Di Bari, Andreas Stolcke:
Provenance: A Light-weight Fact-checker for Retrieval Augmented LLM Generation Output. EMNLP (Industry Track) 2024: 1305-1313 - [c202]Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-Yi Lee, Ivan Bulyko:
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue. ICASSP 2024: 10316-10320 - [c201]Chenyang Gao, Brecht Desplanques, Chelsea J.-T. Ju, Aman Chadha, Andreas Stolcke:
Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models. ICASSP 2024: 10836-10840 - [c200]Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran:
Turn-Taking and Backchannel Prediction with Acoustic and Large Language Model Fusion. ICASSP 2024: 12121-12125 - [c199]Kevin Everson, Yile Gu, Chao-Han Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-Yi Lee, Ariya Rastrow, Andreas Stolcke:
Towards ASR Robust Spoken Language Understanding Through in-Context Learning with Word Confusion Networks. ICASSP 2024: 12856-12860 - [i65]Kevin Everson, Yile Gu, Chao-Han Huck Yang, Prashanth Gurunath Shivakumar, Guan-Ting Lin, Jari Kolehmainen, Ivan Bulyko, Ankur Gandhe, Shalini Ghosh, Wael Hamza, Hung-yi Lee, Ariya Rastrow, Andreas Stolcke:
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks. CoRR abs/2401.02921 (2024) - [i64]Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth Gurunath Shivakumar, Ankur Gandhe, Ariya Rastrow, Jia Xu, Ivan Bulyko, Andreas Stolcke:
Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition. CoRR abs/2401.10447 (2024) - [i63]Chenyang Gao, Brecht Desplanques, Chelsea J.-T. Ju, Aman Chadha, Andreas Stolcke:
Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models. CoRR abs/2401.12440 (2024) - [i62]Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran:
Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion. CoRR abs/2401.14717 (2024) - [i61]Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen Hu, Kunal Dhawan, Piotr Zelasko, Chao Zhang, Yun-Nung Chen, Yu Tsao, Jagadeesh Balam, Boris Ginsburg, Sabato Marco Siniscalchi, Eng Siong Chng, Peter Bell, Catherine Lai, Shinji Watanabe, Andreas Stolcke:
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition. CoRR abs/2409.09785 (2024) - [i60]Ambuje Gupta, Mrinal Rawat, Andreas Stolcke, Roberto Pieraccini:
REFINE on Scarce Data: Retrieval Enhancement through Fine-Tuning via Model Fusion of Embedding Models. CoRR abs/2410.12890 (2024) - [i59]Hithesh Sankararaman, Mohammed Yasin, Tanner Sorensen, Alessandro Di Bari, Andreas Stolcke:
Provenance: A Light-weight Fact-checker for Retrieval Augmented LLM Generation Output. CoRR abs/2411.01022 (2024) - [i58]Shashi Kumar, Iuliia Thorbecke, Sergio Burdisso, Esaú Villatoro-Tello, Manjunath K. E, Kadri Hacioglu, Pradeep Rangappa, Petr Motlícek, Aravind Ganapathiraju, Andreas Stolcke:
Performance evaluation of SLAM-ASR: The Good, the Bad, the Ugly, and the Way Forward. CoRR abs/2411.03866 (2024) - [i57]Aaron Zheng, Mansi Rana, Andreas Stolcke:
Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings. CoRR abs/2411.14398 (2024) - [i56]Nikhil Kumar Koditala, Chelsea Jui-Ting Ju, Ruirui Li, Minho Jin, Aman Chadha, Andreas Stolcke:
Improving speaker verification robustness with synthetic emotional utterances. CoRR abs/2412.00319 (2024) - 2023
- [c198]Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke:
Generative Speech Recognition Error Correction With Large Language Models and Task-Activating Prompting. ASRU 2023: 1-8 - [c197]Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastrow, Ivan Bulyko:
Low-Rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition. ASRU 2023: 1-8 - [c196]Do June Min, Andreas Stolcke, Anirudh Raju, Colin Vaz, Di He, Venkatesh Ravichandran, Viet Anh Trinh:
Adaptive Endpointing with Deep Contextual Multi-Armed Bandits. ICASSP 2023: 1-5 - [c195]Rahul Pandey, Roger Ren, Qi Luo, Jing Liu, Ariya Rastrow, Ankur Gandhe, Denis Filimonov, Grant P. Strimel, Andreas Stolcke, Ivan Bulyko:
Procter: Pronunciation-Aware Contextual Adapter For Personalized Speech Recognition In Neural Transducers. ICASSP 2023: 1-5 - [c194]Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran:
Cross-Utterance ASR Rescoring with Graph-Based Label Propagation. ICASSP 2023: 1-5 - [c193]Aakriti Agrawal, Milind Rao, Anit Kumar Sahu, Gopinath Chennupati, Andreas Stolcke:
Learning When to Trust Which Teacher for Weakly Supervised ASR. INTERSPEECH 2023: 381-385 - [c192]Denis Filimonov, Prabhat Pandey, Ariya Rastrow, Ankur Gandhe, Andreas Stolcke:
Streaming Speech-to-Confusion Network Speech Recognition. INTERSPEECH 2023: 4099-4103 - [i55]Do June Min, Andreas Stolcke, Anirudh Raju, Colin Vaz, Di He, Venkatesh Ravichandran, Viet Anh Trinh:
Adaptive Endpointing with Deep Contextual Multi-armed Bandits. CoRR abs/2303.13407 (2023) - [i54]Srinath Tankasala, Long Chen, Andreas Stolcke, Anirudh Raju, Qianli Deng, Chander Chandak, Aparna Khare, Roland Maas, Venkatesh Ravichandran:
Cross-utterance ASR Rescoring with Graph-based Label Propagation. CoRR abs/2303.15132 (2023) - [i53]Rahul Pandey, Roger Ren, Qi Luo, Jing Liu, Ariya Rastrow, Ankur Gandhe, Denis Filimonov, Grant P. Strimel, Andreas Stolcke, Ivan Bulyko:
PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers. CoRR abs/2303.17131 (2023) - [i52]Denis Filimonov, Prabhat Pandey, Ariya Rastrow, Ankur Gandhe, Andreas Stolcke:
Streaming Speech-to-Confusion Network Speech Recognition. CoRR abs/2306.03778 (2023) - [i51]Aakriti Agrawal, Milind Rao, Anit Kumar Sahu, Gopinath Chennupati, Andreas Stolcke:
Learning When to Trust Which Teacher for Weakly Supervised ASR. CoRR abs/2306.12012 (2023) - [i50]Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastrow, Ivan Bulyko:
Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition. CoRR abs/2309.15223 (2023) - [i49]Chao-Han Huck Yang, Yile Gu, Yi-Chieh Liu, Shalini Ghosh, Ivan Bulyko, Andreas Stolcke:
Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting. CoRR abs/2309.15649 (2023) - [i48]Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan Bulyko:
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue. CoRR abs/2312.15316 (2023) - 2022
- [c191]Scott Novotney, Sreeparna Mukherjee, Zeeshan Ahmed, Andreas Stolcke:
CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual Signals. ACL (Findings) 2022: 3368-3379 - [c190]Liyan Xu, Yile Gu, Jari Kolehmainen, Haidar Khan, Ankur Gandhe, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
RescoreBERT: Discriminative Speech Recognition Rescoring With Bert. ICASSP 2022: 6117-6121 - [c189]Metehan Cekic, Ruirui Li, Zeya Chen, Yuguang Yang, Andreas Stolcke, Upamanyu Madhow:
Self-Supervised Speaker Recognition Training using Human-Machine Dialogues. ICASSP 2022: 6132-6136 - [c188]Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren, Linda Liu, Andreas Stolcke, Ivan Bulyko:
Mitigating Closed-Model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition. ICASSP 2022: 6302-6306 - [c187]K. C. Kishan, Zhenning Tan, Long Chen, Minho Jin, Eunjung Han, Andreas Stolcke, Chul Lee:
OpenFEAT: Improving Speaker Identification by Open-Set Few-Shot Embedding Adaptation with Transformer. ICASSP 2022: 7062-7066 - [c186]Hua Shen, Yuguang Yang, Guoli Sun, Ryan Langman, Eunjung Han, Jasha Droppo, Andreas Stolcke:
Improving Fairness in Speaker Verification via Group-Adapted Fusion Network. ICASSP 2022: 7077-7081 - [c185]Xin Zhang, Minho Jin, Roger Cheng, Ruirui Li, Eunjung Han, Andreas Stolcke:
Contrastive-mixup Learning for Improved Speaker Verification. ICASSP 2022: 7652-7656 - [c184]Aparna Khare, Eunjung Han, Yuguang Yang, Andreas Stolcke:
ASR-Aware End-to-End Neural Diarization. ICASSP 2022: 8092-8096 - [c183]Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke:
Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities. INTERSPEECH 2022: 1268-1272 - [c182]Viet Anh Trinh, Pegah Ghahremani, Brian John King, Jasha Droppo, Andreas Stolcke, Roland Maas:
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation. INTERSPEECH 2022: 1298-1302 - [c181]Minho Jin, Chelsea Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, Andreas Stolcke:
Adversarial Reweighting for Speaker Verification Fairness. INTERSPEECH 2022: 4800-4804 - [c180]Long Chen, Yixiong Meng, Venkatesh Ravichandran, Andreas Stolcke:
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification. INTERSPEECH 2022: 4805-4809 - [c179]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. SLT 2022: 1074-1080 - [i47]Liyan Xu, Yile Gu, Jari Kolehmainen, Haidar Khan, Ankur Gandhe, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
RescoreBERT: Discriminative Speech Recognition Rescoring with BERT. CoRR abs/2202.01094 (2022) - [i46]Aparna Khare, Eunjung Han, Yuguang Yang, Andreas Stolcke:
ASR-Aware End-to-end Neural Diarization. CoRR abs/2202.01286 (2022) - [i45]Metehan Cekic, Ruirui Li, Zeya Chen, Yuguang Yang, Andreas Stolcke, Upamanyu Madhow:
Self-supervised Speaker Recognition Training Using Human-Machine Dialogues. CoRR abs/2202.03484 (2022) - [i44]Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren, Linda Liu, Andreas Stolcke, Ivan Bulyko:
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition. CoRR abs/2202.08532 (2022) - [i43]Xin Zhang, Minho Jin, Roger Cheng, Ruirui Li, Eunjung Han, Andreas Stolcke:
Contrastive-mixup learning for improved speaker verification. CoRR abs/2202.10672 (2022) - [i42]Hua Shen, Yuguang Yang, Guoli Sun, Ryan Langman, Eunjung Han, Jasha Droppo, Andreas Stolcke:
Improving fairness in speaker verification via Group-adapted Fusion Network. CoRR abs/2202.11323 (2022) - [i41]Scott Novotney, Sreeparna Mukherjee, Zeeshan Ahmed, Andreas Stolcke:
CUE Vectors: Modular Training of Language Models Conditioned on Diverse Contextual Signals. CoRR abs/2203.08774 (2022) - [i40]Long Chen, Yixiong Meng, Venkatesh Ravichandran, Andreas Stolcke:
Graph-based Multi-View Fusion and Local Adaptation: Mitigating Within-Household Confusability for Speaker Identification. CoRR abs/2207.04081 (2022) - [i39]Minho Jin, Chelsea J.-T. Ju, Zeya Chen, Yi-Chieh Liu, Jasha Droppo, Andreas Stolcke:
Adversarial Reweighting for Speaker Verification Fairness. CoRR abs/2207.07776 (2022) - [i38]Viet Anh Trinh, Pegah Ghahremani, Brian John King, Jasha Droppo, Andreas Stolcke, Roland Maas:
Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation. CoRR abs/2207.07850 (2022) - [i37]Pranav Dheram, Murugesan Ramakrishnan, Anirudh Raju, I-Fan Chen, Brian King, Katherine Powell, Melissa Saboowala, Karan Shetty, Andreas Stolcke:
Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities. CoRR abs/2207.11345 (2022) - [i36]Chao-Han Huck Yang, I-Fan Chen, Andreas Stolcke, Sabato Marco Siniscalchi, Chin-Hui Lee:
An Experimental Study on Private Aggregation of Teacher Ensemble Learning for End-to-End Speech Recognition. CoRR abs/2210.05614 (2022) - [i35]Xin Zhang, Iván Vallés-Pérez, Andreas Stolcke, Chengzhu Yu, Jasha Droppo, Olabanji Shonibare, Roberto Barra-Chicote, Venkatesh Ravichandran:
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech. CoRR abs/2211.09731 (2022) - 2021
- [c178]Richard Diehl Martinez, Scott Novotney, Ivan Bulyko, Ariya Rastrow, Andreas Stolcke, Ankur Gandhe:
Attention-based Contextual Language Model Adaptation for Speech Recognition. ACL/IJCNLP (Findings) 2021: 1994-2003 - [c177]Zhenning Tan, Yuguang Yang, Eunjung Han, Andreas Stolcke:
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets. ASRU 2021: 1124-1131 - [c176]Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang:
Contrastive Unsupervised Learning for Speech Emotion Recognition. ICASSP 2021: 6329-6333 - [c175]Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gokce Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas:
REDAT: Accent-Invariant Representation for End-To-End ASR by Domain Adversarial Training with Relabeling. ICASSP 2021: 6408-6412 - [c174]Eunjung Han, Chul Lee, Andreas Stolcke:
BW-EDA-EEND: streaming END-TO-END Neural Speaker Diarization for a Variable Number of Speakers. ICASSP 2021: 7193-7197 - [c173]Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann:
Joint ASR and Language Identification Using RNN-T: An Efficient Approach to Dynamic Language Switching. ICASSP 2021: 7218-7222 - [c172]Aditya Gourav, Linda Liu, Ankur Gandhe, Yile Gu, Guitang Lan, Xiangyang Huang, Shashank Kalmane, Gautam Tiwari, Denis Filimonov, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
Personalization Strategies for End-to-End Speech Recognition Systems. ICASSP 2021: 7348-7352 - [c171]Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke:
DO as I Mean, Not as I Say: Sequence Loss Training for Spoken Language Understanding. ICASSP 2021: 7473-7477 - [c170]Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
wav2vec-C: A Self-Supervised Model for Speech Representation Learning. Interspeech 2021: 711-715 - [c169]Yi-Chieh Liu, Eunjung Han, Chul Lee, Andreas Stolcke:
End-to-End Neural Diarization: From Transformer to Conformer. Interspeech 2021: 3081-3085 - [c168]Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo:
Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End. Interspeech 2021: 3455-3459 - [c167]Long Chen, Venkatesh Ravichandran, Andreas Stolcke:
Graph-Based Label Propagation for Semi-Supervised Speaker Identification. Interspeech 2021: 4588-4592 - [c166]Ruirui Li, Chelsea J.-T. Ju, Zeya Chen, Hongda Mao, Oguz Elibol, Andreas Stolcke:
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition. Interspeech 2021: 4593-4597 - [c165]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-Aware Diarization Outputs. SLT 2021: 881-888 - [i34]Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang:
Contrastive Unsupervised Learning for Speech Emotion Recognition. CoRR abs/2102.06357 (2021) - [i33]Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke:
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding. CoRR abs/2102.06750 (2021) - [i32]Aditya Gourav, Linda Liu, Ankur Gandhe, Yile Gu, Guitang Lan, Xiangyang Huang, Shashank Kalmane, Gautam Tiwari, Denis Filimonov, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko:
Personalization Strategies for End-to-End Speech Recognition Systems. CoRR abs/2102.07739 (2021) - [i31]Samik Sadhu, Di He, Che-Wei Huang, Sri Harish Mallidi, Minhua Wu, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Roland Maas:
Wav2vec-C: A Self-supervised Model for Speech Representation Learning. CoRR abs/2103.08393 (2021) - [i30]Wen Wang, Andreas Stolcke, Jing Zheng:
Reranking Machine Translation Hypotheses with Structured and Web-based Language Models. CoRR abs/2104.12277 (2021) - [i29]Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo:
Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End. CoRR abs/2105.07071 (2021) - [i28]Richard Diehl Martinez, Scott Novotney, Ivan Bulyko, Ariya Rastrow, Andreas Stolcke, Ankur Gandhe:
Attention-based Contextual Language Model Adaptation for Speech Recognition. CoRR abs/2106.01451 (2021) - [i27]Yi-Chieh Liu, Eunjung Han, Chul Lee, Andreas Stolcke:
End-to-end Neural Diarization: From Transformer to Conformer. CoRR abs/2106.07167 (2021) - [i26]Long Chen, Venkatesh Ravichandran, Andreas Stolcke:
Graph-based Label Propagation for Semi-Supervised Speaker Identification. CoRR abs/2106.08207 (2021) - [i25]Ruirui Li, Chelsea J.-T. Ju, Zeya Chen, Hongda Mao, Oguz Elibol, Andreas Stolcke:
Fusion of Embeddings Networks for Robust Combination of Text Dependent and Independent Speaker Recognition. CoRR abs/2106.10169 (2021) - [i24]Zhenning Tan, Yuguang Yang, Eunjung Han, Andreas Stolcke:
Improving Speaker Identification for Shared Devices by Adapting Embeddings to Speaker Subsets. CoRR abs/2109.02576 (2021) - 2020
- [c164]Dave Makhervaks, William Hinthorn, Dimitrios Dimitriadis, Andreas Stolcke:
Combining Acoustics, Content and Interaction Features to Find Hot Spots in Meetings. ICASSP 2020: 8054-8058 - [c163]Ruirui Li, Jyun-Yu Jiang, Xian Wu, Chu-Cheng Hsieh, Andreas Stolcke:
Speaker Identification for Household Scenarios with Self-Attention and Adversarial Training. INTERSPEECH 2020: 2272-2276 - [c162]Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas:
Efficient Minimum Word Error Rate Training of RNN-Transducer for End-to-End Speech Recognition. INTERSPEECH 2020: 2807-2811 - [c161]Andreas Stolcke:
Improving Diarization Robustness using Diversification, Randomization and the DOVER Algorithm. Odyssey 2020: 95-101 - [i23]Jinxi Guo, Gautam Tiwari, Jasha Droppo, Maarten Van Segbroeck, Che-Wei Huang, Andreas Stolcke, Roland Maas:
Efficient minimum word error rate training of RNN-Transducer for end-to-end speech recognition. CoRR abs/2007.13802 (2020) - [i22]Desh Raj, Leibny Paola García-Perera, Zili Huang, Shinji Watanabe, Daniel Povey, Andreas Stolcke, Sanjeev Khudanpur:
DOVER-Lap: A Method for Combining Overlap-aware Diarization Outputs. CoRR abs/2011.01997 (2020) - [i21]Eunjung Han, Chul Lee, Andreas Stolcke:
BW-EDA-EEND: Streaming End-to-End Neural Speaker Diarization for a Variable Number of Speakers. CoRR abs/2011.02678 (2020) - [i20]Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gökçe Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas:
REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling. CoRR abs/2012.07353 (2020)
2010 – 2019
- 2019
- [c160]Andreas Stolcke, Takuya Yoshioka:
Dover: A Method for Combining Diarization Outputs. ASRU 2019: 757-763 - [c159]Bryan Li, Dimitrios Dimitriadis, Andreas Stolcke:
Acoustic and Lexical Sentiment Analysis for Customer Service Calls. ICASSP 2019: 5876-5880 - [c158]Takuya Yoshioka, Dimitrios Dimitriadis, Andreas Stolcke, William Hinthorn, Zhuo Chen, Michael Zeng, Xuedong Huang:
Meeting Transcription Using Asynchronous Distant Microphones. INTERSPEECH 2019: 2968-2972 - [i19]Takuya Yoshioka, Zhuo Chen, Dimitrios Dimitriadis, William Hinthorn, Xuedong Huang, Andreas Stolcke, Michael Zeng:
Meeting Transcription Using Virtual Microphone Arrays. CoRR abs/1905.02545 (2019) - [i18]Andreas Stolcke, Takuya Yoshioka:
DOVER: A Method for Combining Diarization Outputs. CoRR abs/1909.08090 (2019) - [i17]Dave Makhervaks, William Hinthorn, Dimitrios Dimitriadis, Andreas Stolcke:
Combining Acoustics, Content and Interaction Features to Find Hot Spots in Meetings. CoRR abs/1910.10869 (2019) - [i16]Andreas Stolcke:
Improving Diarization Robustness using Diversification, Randomization and the DOVER Algorithm. CoRR abs/1910.11691 (2019) - 2018
- [j27]Jorge Proença, Carla Lopes, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão:
Mispronunciation Detection in Children's Reading of Sentences. IEEE ACM Trans. Audio Speech Lang. Process. 26(7): 1203-1215 (2018) - [c157]Wayne Xiong, Lingfeng Wu, Jun Zhang, Andreas Stolcke:
Session-level Language Modeling for Conversational Speech. EMNLP 2018: 2764-2768 - [c156]Wayne Xiong, Lingfeng Wu, Fil Alleva, Jasha Droppo, Xuedong Huang, Andreas Stolcke:
The Microsoft 2017 Conversational Speech Recognition System. ICASSP 2018: 5934-5938 - 2017
- [j26]Jorge Proença, Carla Lopes, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão:
Automatic evaluation of reading aloud performance in children. Speech Commun. 94: 1-14 (2017) - [j25]Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Michael L. Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
Toward Human Parity in Conversational Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(12): 2410-2423 (2017) - [c155]Geoffrey Zweig, Chengzhu Yu, Jasha Droppo, Andreas Stolcke:
Advances in all-neural speech recognition. ICASSP 2017: 4805-4809 - [c154]Wayne Xiong, Jasha Droppo, Xuedong Huang, Frank Seide, Mike Seltzer, Andreas Stolcke, Dong Yu, Geoffrey Zweig:
The microsoft 2016 conversational speech recognition system. ICASSP 2017: 5255-5259 - [c153]Andreas Stolcke, Jasha Droppo:
Comparing Human and Machine Errors in Conversational Speech Transcription. INTERSPEECH 2017: 137-141 - [c152]Jorge Proença, Carla Lopes, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão:
Detection of Mispronunciations and Disfluencies in Children Reading Aloud. INTERSPEECH 2017: 1437-1441 - [c151]Jorge Proença, Carla Lopes, Michael Tjalve, Andreas Stolcke, Sara Candeias, Fernando Perdigão:
Automatic Evaluation of Children Reading Aloud on Sentences and Pseudowords. INTERSPEECH 2017: 2749-2753 - [i15]Wayne Xiong, Lingfeng Wu, Fil Alleva, Jasha Droppo,