


Остановите войну!
for scientists:


default search action
Yanmin Qian
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [i45]Chenda Li, Yao Qian, Zhuo Chen, Dongmei Wang, Takuya Yoshioka, Shujie Liu, Yanmin Qian, Michael Zeng:
Target Sound Extraction with Variable Cross-modality Clues. CoRR abs/2303.08372 (2023) - [i44]Haibin Yu, Yuxuan Hu, Yao Qian, Ma Jin, Linquan Liu, Shujie Liu, Yu Shi, Yanmin Qian, Edward Lin, Michael Zeng:
Code-Switching Text Generation and Injection in Mandarin-English ASR. CoRR abs/2303.10949 (2023) - 2022
- [j29]Sanyuan Chen
, Chengyi Wang, Zhengyang Chen, Yu Wu
, Shujie Liu, Zhuo Chen, Jinyu Li
, Naoyuki Kanda
, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian
, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1505-1518 (2022) - [j28]Yanmin Qian
, Zhikai Zhou
:
Optimizing Data Usage for Low-Resource Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 394-403 (2022) - [j27]Chenda Li
, Zhuo Chen, Yanmin Qian
:
Dual-Path Modeling With Memory Embedding Model for Continuous Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1508-1520 (2022) - [j26]Yanmin Qian
, Xun Gong
, Houjun Huang:
Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2842-2853 (2022) - [j25]Wangyou Zhang
, Xuankai Chang
, Christoph Böddeker, Tomohiro Nakatani
, Shinji Watanabe
, Yanmin Qian
:
End-to-End Dereverberation, Beamforming, and Speech Recognition in a Cocktail Party. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3173-3188 (2022) - [c139]Yifei Wu, Chenda Li, Jinfeng Bai, Zhongqin Wu, Yanmin Qian:
Time-Domain Audio-Visual Speech Separation on Low Quality Videos. ICASSP 2022: 256-260 - [c138]Chenda Li, Lei Yang, Weiqin Wang, Yanmin Qian:
Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech Separation. ICASSP 2022: 681-685 - [c137]Zhengyang Chen, Sanyuan Chen, Yu Wu, Yao Qian, Chengyi Wang, Shujie Liu, Yanmin Qian, Michael Zeng:
Large-Scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification. ICASSP 2022: 6147-6151 - [c136]Bing Han, Zhengyang Chen, Yanmin Qian:
Local Information Modeling with Self-Attention for Speaker Verification. ICASSP 2022: 6727-6731 - [c135]Zhikai Zhou, Tian Tan, Yanmin Qian:
Punctuation Prediction for Streaming On-Device Speech Recognition. ICASSP 2022: 7277-7281 - [c134]Bing Han, Zhengyang Chen, Bei Liu, Yanmin Qian:
MLP-SVNET: A Multi-Layer Perceptrons Based Network for Speaker Verification. ICASSP 2022: 7522-7526 - [c133]Bei Liu, Haoyu Wang, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Knowledge Distillation via Feature Enhancement for Speaker Verification. ICASSP 2022: 7542-7546 - [c132]Wei Wang, Shuo Ren, Yao Qian, Shujie Liu, Yu Shi, Yanmin Qian, Michael Zeng:
Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding. ICASSP 2022: 7802-7806 - [c131]Zhikai Zhou, Wei Wang, Wangyou Zhang, Yanmin Qian:
Exploring Effective Data Utilization for Low-Resource Speech Recognition. ICASSP 2022: 8192-8196 - [c130]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160 - [c129]Wei Wang, Xun Gong, Yifei Wu, Zhikai Zhou, Chenda Li, Wangyou Zhang, Bing Han, Yanmin Qian:
The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021. ICASSP 2022: 9261-9265 - [c128]Bei Liu, Zhengyang Chen, Yanmin Qian:
Attentive Feature Fusion for Robust Speaker Verification. INTERSPEECH 2022: 286-290 - [c127]Bei Liu, Zhengyang Chen, Yanmin Qian:
Dual Path Embedding Learning for Speaker Verification with Triplet Attention. INTERSPEECH 2022: 291-295 - [c126]Bei Liu, Zhengyang Chen, Shuai Wang, Haoyu Wang, Bing Han, Yanmin Qian:
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design. INTERSPEECH 2022: 296-300 - [c125]Leying Zhang, Zhengyang Chen, Yanmin Qian:
Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification. INTERSPEECH 2022: 311-315 - [c124]Tao Liu, Shuai Fan, Xu Xiang
, Hongbo Song, Shaoxiong Lin, Jiaqi Sun, Tianyuan Han, Siyuan Chen, Binwei Yao, Sen Liu, Yifei Wu, Yanmin Qian, Kai Yu:
MSDWild: Multi-modal Speaker Diarization Dataset in the Wild. INTERSPEECH 2022: 1476-1480 - [c123]Xun Gong, Zhikai Zhou, Yanmin Qian:
Knowledge Transfer and Distillation from Autoregressive to Non-Autoregessive Speech Recognition. INTERSPEECH 2022: 2618-2622 - [c122]Bing Han, Zhengyang Chen, Yanmin Qian:
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction. INTERSPEECH 2022: 4780-4784 - [c121]Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li
, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-wise Permutation Invariant Training. INTERSPEECH 2022: 5383-5387 - [c120]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe
:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. INTERSPEECH 2022: 5458-5462 - [c119]Bowen Qu, Chenda Li, Jinfeng Bai, Yanmin Qian:
Improving Speech Separation with Knowledge Distilled from Self-supervised Pre-trained Models. ISCSLP 2022: 329-333 - [c118]Wei Wang, Wangyou Zhang, Shaoxiong Lin, Yanmin Qian:
Text-Informed Knowledge Distillation for Robust Speech Enhancement and Recognition. ISCSLP 2022: 334-338 - [c117]Zhikai Zhou, Shuang Cao, Zhengyang Chen, Bei Liu, Ming Xia, Hong Jiang, Yanmin Qian:
Medical Difficult Airway Detection using Speech Technology. ISCSLP 2022: 349-353 - [c116]Houjun Huang, Yanmin Qian:
Speaking style compensation on synthetic audio for robust keyword spotting. ISCSLP 2022: 448-452 - [c115]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. ISCSLP 2022: 488-492 - [c114]Tao Liu, Xu Xiang, Zhengyang Chen, Bing Han, Kai Yu, Yanmin Qian:
The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022. ISCSLP 2022: 498-501 - [c113]Robin Scheibler, Wangyou Zhang, Xuankai Chang, Shinji Watanabe
, Yanmin Qian:
End-to-End Multi-Speaker ASR with Independent Vector Analysis. SLT 2022: 496-501 - [c112]Zhengyang Chen, Yao Qian, Bing Han, Yanmin Qian, Michael Zeng:
A Comprehensive Study on Self-Supervised Distillation for Speaker Representation Learning. SLT 2022: 599-604 - [i43]Chenda Li, Lei Yang, Weiqin Wang, Yanmin Qian:
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation. CoRR abs/2201.10800 (2022) - [i42]Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022) - [i41]Robin Scheibler, Wangyou Zhang, Xuankai Chang, Shinji Watanabe, Yanmin Qian:
End-to-End Multi-speaker ASR with Independent Vector Analysis. CoRR abs/2204.00218 (2022) - [i40]Xun Gong, Yizhou Lu, Zhikai Zhou, Yanmin Qian:
Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition. CoRR abs/2204.09883 (2022) - [i39]Zhengyang Chen, Bei Liu, Bing Han, Leying Zhang, Yanmin Qian:
The SJTU X-LANCE Lab System for CNSRC 2022. CoRR abs/2206.11699 (2022) - [i38]Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe
:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. CoRR abs/2207.09514 (2022) - [i37]Xun Gong, Zhikai Zhou, Yanmin Qian:
Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition. CoRR abs/2207.10600 (2022) - [i36]Bing Han, Zhengyang Chen, Yanmin Qian:
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction. CoRR abs/2208.01928 (2022) - [i35]Bing Han, Zhengyang Chen, Zhikai Zhou, Yanmin Qian:
The SJTU System for Short-duration Speaker Verification Challenge 2021. CoRR abs/2208.01933 (2022) - [i34]Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. CoRR abs/2208.08042 (2022) - [i33]Zhengyang Chen, Bing Han, Xu Xiang
, Houjun Huang, Bei Liu, Yanmin Qian:
SJTU-AISPEECH System for VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2209.09076 (2022) - [i32]Zhengyang Chen, Yao Qian, Bing Han, Yanmin Qian, Michael Zeng:
A comprehensive study on self-supervised distillation for speaker representation learning. CoRR abs/2210.15936 (2022) - [i31]Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang
, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit. CoRR abs/2210.17016 (2022) - [i30]Zhengyang Chen, Bing Han, Xu Xiang
, Houjun Huang, Bei Liu, Yanmin Qian:
Build a SRE Challenge System: Lessons from VoxSRC 2022 and CNSRC 2022. CoRR abs/2211.00815 (2022) - [i29]Xun Gong, Yu Wu, Jinyu Li
, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
LongFNT: Long-form Speech Recognition with Factorized Neural Transducer. CoRR abs/2211.09412 (2022) - 2021
- [j24]Jichen Yang
, Hongji Wang, Rohan Kumar Das
, Yanmin Qian
:
Modified Magnitude-Phase Spectrum Information for Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1065-1078 (2021) - [j23]Yanmin Qian
, Zhengyang Chen
, Shuai Wang
:
Audio-Visual Deep Neural Network for Robust Person Verification. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1079-1092 (2021) - [c111]Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix
, Shinji Watanabe
, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. ICASSP 2021: 5739-5743 - [c110]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. ICASSP 2021: 5834-5838 - [c109]Chenpeng Du
, Bing Han, Shuai Wang, Yanmin Qian, Kai Yu:
SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification. ICASSP 2021: 5844-5848 - [c108]Houjun Huang, Xu Xiang
, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit Selection Synthesis Based Data Augmentation for Fixed Phrase Speaker Verification. ICASSP 2021: 5849-5853 - [c107]Houjun Huang, Xu Xiang
, Yexin Yang, Rao Ma, Yanmin Qian:
AISpeech-SJTU Accent Identification System for the Accented English Speech Recognition Challenge. ICASSP 2021: 6254-6258 - [c106]Tian Tan, Yizhou Lu, Rao Ma, Sen Zhu, Jiaqi Guo, Yanmin Qian:
AISpeech-SJTU ASR System for the Accented English Speech Recognition Challenge. ICASSP 2021: 6413-6417 - [c105]Wei Wang, Zhikai Zhou, Yizhou Lu, Hongji Wang, Chenpeng Du
, Yanmin Qian:
Towards Data Selection on TTS Data for Children's Speech Recognition. ICASSP 2021: 6888-6892 - [c104]Wangyou Zhang, Christoph Böddeker, Shinji Watanabe
, Tomohiro Nakatani, Marc Delcroix
, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021: 6898-6902 - [c103]Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie:
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. ICASSP 2021: 6918-6922 - [c102]Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix
, Naoyuki Kamo, Yanmin Qian, Reinhold Haeb-Umbach:
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. ICASSP 2021: 8428-8432 - [c101]Xun Gong, Yizhou Lu, Zhikai Zhou, Yanmin Qian:
Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition. Interspeech 2021: 1274-1278 - [c100]Leying Zhang, Zhengyang Chen, Yanmin Qian:
Knowledge Distillation from Multi-Modality to Single-Modality for Person Verification. Interspeech 2021: 1897-1901 - [c99]Zhengxi Liu, Yanmin Qian:
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition. Interspeech 2021: 2222-2226 - [c98]Bing Han, Zhengyang Chen, Zhikai Zhou, Yanmin Qian:
The SJTU System for Short-Duration Speaker Verification Challenge 2021. Interspeech 2021: 2332-2336 - [c97]Yifei Wu, Chenda Li, Song Yang, Zhongqin Wu, Yanmin Qian:
Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party. Interspeech 2021: 3021-3025 - [c96]Xun Gong, Zhengyang Chen, Yexin Yang, Shuai Wang, Lan Wang, Yanmin Qian:
Speaker Embedding Augmentation with Noise Distribution Matching. ISCSLP 2021: 1-5 - [c95]Shuai Wang, Yexin Yang, Yanmin Qian, Kai Yu:
Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning. ISCSLP 2021: 1-5 - [c94]Chenpeng Du
, Hao Li, Yizhou Lu, Lan Wang, Yanmin Qian:
Data Augmentation for end-to-end Code-Switching Speech Recognition. SLT 2021: 194-200 - [c93]Chenda Li, Yi Luo, Cong Han, Jinyu Li
, Takuya Yoshioka, Tianyan Zhou, Marc Delcroix
, Keisuke Kinoshita, Christoph Böddeker, Yanmin Qian, Shinji Watanabe
, Zhuo Chen:
Dual-Path RNN for Long Recording Speech Separation. SLT 2021: 865-872 - [c92]Wangyou Zhang, Jing Shi, Chenda Li, Shinji Watanabe
, Yanmin Qian:
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions. WASPAA 2021: 146-150 - [i28]Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit selection synthesis based data augmentation for fixed phrase speaker verification. CoRR abs/2102.09817 (2021) - [i27]Houjun Huang, Xu Xiang, Yexin Yang, Rao Ma, Yanmin Qian:
AISPEECH-SJTU accent identification system for the Accented English Speech Recognition Challenge. CoRR abs/2102.09828 (2021) - [i26]Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie:
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. CoRR abs/2102.10233 (2021) - [i25]Wangyou Zhang, Christoph Böddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. CoRR abs/2102.11525 (2021) - [i24]Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. CoRR abs/2102.11634 (2021) - [i23]Zhengxi Liu, Yanmin Qian:
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition. CoRR abs/2106.13419 (2021) - [i22]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. CoRR abs/2108.13843 (2021) - [i21]Zhengyang Chen, Sanyuan Chen, Yu Wu, Yao Qian, Chengyi Wang, Shujie Liu, Yanmin Qian, Michael Zeng:
Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification. CoRR abs/2110.05777 (2021) - [i20]Wei Wang, Shuo Ren, Yao Qian, Shujie Liu, Yu Shi, Yanmin Qian, Michael Zeng:
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding. CoRR abs/2110.12138 (2021) - [i19]Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. CoRR abs/2110.13900 (2021) - [i18]Wangyou Zhang, Jing Shi, Chenda Li, Shinji Watanabe, Yanmin Qian:
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions. CoRR abs/2110.14139 (2021) - [i17]Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-Wise Permutation Invariant Training. CoRR abs/2110.14142 (2021) - 2020
- [j22]Wangyou Zhang
, Xuankai Chang
, Yanmin Qian
, Shinji Watanabe
:
Improving End-to-End Single-Channel Multi-Talker Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1385-1394 (2020) - [j21]Shuai Wang
, Yexin Yang
, Zhanghao Wu
, Yanmin Qian
, Kai Yu
:
Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2598-2609 (2020) - [c91]Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe
:
End-To-End Multi-Speaker Speech Recognition With Transformer. ICASSP 2020: 6134-6138 - [c90]Yexin Yang, Shuai Wang, Xun Gong, Yanmin Qian, Kai Yu:
Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings. ICASSP 2020: 6454-6458 - [c89]Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
Channel Invariant Speaker Embedding Learning with Joint Multi-Task and Adversarial Training. ICASSP 2020: 6574-6578 - [c88]Chenda Li, Yanmin Qian:
Deep Audio-Visual Speech Separation with Attention Mechanism. ICASSP 2020: 7314-7318 - [c87]Wangyou Zhang, Yanmin Qian:
Learning Contextual Language Embeddings for Monaural Multi-Talker Speech Recognition. INTERSPEECH 2020: 304-308 - [c86]Wangyou Zhang, Aswin Shanmugam Subramanian
, Xuankai Chang, Shinji Watanabe
, Yanmin Qian:
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming. INTERSPEECH 2020: 324-328 - [c85]Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection. INTERSPEECH 2020: 1086-1090 - [c84]Chenda Li, Yanmin Qian:
Listen, Watch and Understand at the Cocktail Party: Audio-Visual-Contextual Speech Separation. INTERSPEECH 2020: 1426-1430 - [c83]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Multi-Modality Matters: A Performance Leap on VoxCeleb. INTERSPEECH 2020: 2252-2256 - [c82]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network. INTERSPEECH 2020: 3017-3021 - [c81]Yizhou Lu, Mingkun Huang, Hao Li, Jiaqi Guo, Yanmin Qian:
Bi-Encoder Transformer Network for Mandarin-English Code-Switching Speech Recognition Using Mixture of Experts. INTERSPEECH 2020: 4766-4770 - [i16]Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe:
End-to-End Multi-speaker Speech Recognition with Transformer. CoRR abs/2002.03921 (2020) - [i15]Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Shinji Watanabe, Yanmin Qian:
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming. CoRR abs/2005.10479 (2020) - [i14]Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Kai Yu:
End-to-end spoofing detection with raw waveform CLDNNs. CoRR abs/2007.13060 (2020) - [i13]Qi Liu, Yanmin Qian, Kai Yu:
Future Vector Enhanced LSTM Language Model for LVCSR. CoRR abs/2008.01832 (2020) - [i12]Yefei Chen, Shuai Wang, Yanmin Qian, Kai Yu:
End-to-End Speaker-Dependent Voice Activity Detection. CoRR abs/2009.09906 (2020) - [i11]Chenpeng Du, Hao Li, Yizhou Lu, Lan Wang, Yanmin Qian:
Data Augmentation for End-to-end Code-switching Speech Recognition. CoRR abs/2011.02160 (2020) - [i10]Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, Shinji Watanabe, Reinhold Haeb-Umbach:
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation. CoRR abs/2011.15003 (2020)
2010 – 2019
- 2019
- [j20]Yanmin Qian
, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 20(3): 438 (2019) - [j19]Yanmin Qian
, Xu Xiang
:
Binary neural networks for speech recognition. Frontiers Inf. Technol. Electron. Eng. 20(5): 701-715 (2019) - [j18]Yanmin Qian, Hu Hu, Tian Tan:
Data augmentation using generative adversarial networks for robust speech recognition. Speech Commun. 114: 1-9 (2019) - [j17]Shuai Wang
, Zili Huang, Yanmin Qian
, Kai Yu
:
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1686-1696 (2019) - [c80]Xu Xiang
, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. APSIPA 2019: 1652-1656 - [c79]Peiyao Sheng, Zhuolin Yang, Yanmin Qian:
GANs for Children: A Generative Data Augmentation Strategy for Children Speech Recognition. ASRU 2019: 129-135 - [c78]