default search action

combined dblp search
author search
venue search
publication search

ask others

Yanmin Qian

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[j41]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/MasuyamaCZCWOQW26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/MasuyamaCZCWOQW26
Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
An end-to-end integration of speech separation and recognition with self-supervised learning representation. Comput. Speech Lang. 95: 101813 (2026)
2025
[c213]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/0004GWZQ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/0004GWZQ25
Wen Huang, Yanmei Gu, Zhiming Wang, Huijia Zhu, Yanmin Qian:
SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods. ACL (1) 2025: 9985-9998
[c212]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0004GWZQ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0004GWZQ25
Wen Huang, Yanmei Gu, Zhiming Wang, Huijia Zhu, Yanmin Qian:
Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation. ICASSP 2025: 1-5
[c211]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0374ZQ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0374ZQ25
Wei Wang, Siyi Zhao, Yanmin Qian:
Advancing Non-intrusive Suppression on Enhancement Distortion for Noise Robust ASR. ICASSP 2025: 1-5
[c210]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHWJQ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHWJQ25
Zhengyang Chen, Bing Han, Shuai Wang, Yidi Jiang, Yanmin Qian:
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching for Speaker Diarization. ICASSP 2025: 1-5
[c209]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Gu0Q25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Gu0Q25
Tianteng Gu, Bei Liu, Yanmin Qian:
Efficient Pruning for Large-Scale Seq2Seq Speech Models without Back-Propagation. ICASSP 2025: 1-5
[c208]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanHCJFLLL0Q25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanHCJFLLL0Q25
Bing Han, Wen Huang, Zhengyang Chen, Anbai Jiang, Pingyi Fan, Cheng Lu, Zhiqiang Lv, Jia Liu, Wei-Qiang Zhang, Yanmin Qian:
Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning. ICASSP 2025: 1-5
[c207]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuCLZQZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuCLZQZ25
Haitian Lu, Gaofeng Cheng, Liuping Luo, Leying Zhang, Yanmin Qian, Pengyuan Zhang:
SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation. ICASSP 2025: 1-5
[c206]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangZCQ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangZCQ25
Leying Zhang, Wangyou Zhang, Zhengyang Chen, Yanmin Qian:
Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction. ICASSP 2025: 1-5
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-00805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-00805
Haitian Lu, Gaofeng Cheng, Liuping Luo, Leying Zhang, Yanmin Qian, Pengyuan Zhang:
SLIDE: Integrating Speech Language Model with LLM for Spontaneous Spoken Dialogue Generation. CoRR abs/2501.00805 (2025)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-14240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-14240
Wen Huang, Yanmei Gu, Zhiming Wang, Huijia Zhu, Yanmin Qian:
Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation. CoRR abs/2501.14240 (2025)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-07345
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-07345
Leying Zhang, Wangyou Zhang, Zhengyang Chen, Yanmin Qian:
Advanced Zero-Shot Text-to-Speech for Background Removal and Preservation with Controllable Masked Speech Prediction. CoRR abs/2502.07345 (2025)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-19179
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-19179
Xun Gong, Anqi Lv, Zhiming Wang, Huijia Zhu, Yanmin Qian:
BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM. CoRR abs/2505.19179 (2025)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-19669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-19669
Haiyang Sun, Shujie Hu, Shujie Liu, Lingwei Meng, Hui Wang, Bing Han, Yifan Yang, Yanqing Liu, Sheng Zhao, Yan Lu, Yanmin Qian:
Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling. CoRR abs/2505.19669 (2025)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-23049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-23049
Tianteng Gu, Bei Liu, Bo Xiao, Ke Zeng, Jiacheng Liu, Yanmin Qian:
DenoiseRotator: Enhance Pruning Robustness for LLMs via Importance Concentration. CoRR abs/2505.23049 (2025)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-00885
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-00885
Leying Zhang, Yao Qian, Xiaofei Wang, Manthan Thakker, Dongmei Wang, Jianwei Yu, Haibin Wu, Yuxuan Hu, Jinyu Li, Yanmin Qian, Sheng Zhao:
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching. CoRR abs/2506.00885 (2025)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01611
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01611
Wangyou Zhang, Kohei Saijo, Samuele Cornell, Robin Scheibler, Chenda Li, Zhaoheng Ni, Anurag Kumar, Marvin Sach, Wei Wang, Yihui Fu, Shinji Watanabe, Tim Fingscheidt, Yanmin Qian:
Lessons Learned from the URGENT 2024 Speech Enhancement Challenge. CoRR abs/2506.01611 (2025)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-03722
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-03722
Yinfeng Xia, Huiyan Li, Chenyang Le, Manhong Wang, Yutao Sun, Xingyang Ma, Yanmin Qian:
MFLA: Monotonic Finite Look-ahead Attention for Streaming Speech Recognition. CoRR abs/2506.03722 (2025)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-11532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-11532
Wen Huang, Xuechen Liu, Xin Wang, Junichi Yamagishi, Yanmin Qian:
From Sharpness to Better Generalization for Speech Deepfake Detection. CoRR abs/2506.11532 (2025)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-12260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-12260
Wei Wang, Wangyou Zhang, Chenda Li, Jiatong Shi, Shinji Watanabe, Yanmin Qian:
Improving Speech Enhancement with Multi-Metric Supervision from Learned Quality Assessment. CoRR abs/2506.12260 (2025)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-21555
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-21555
Jiahong Li, Yiwen Shao, Jianheng Zhuo, Chenda Li, Liliang Tang, Dong Yu, Yanmin Qian:
Efficient Multilingual ASR Finetuning via LoRA Language Experts. CoRR abs/2506.21555 (2025)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-23859
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-23859
Chenda Li, Wangyou Zhang, Wei Wang, Robin Scheibler, Kohei Saijo, Samuele Cornell, Yihui Fu, Marvin Sach, Zhaoheng Ni, Anurag Kumar, Tim Fingscheidt, Shinji Watanabe, Yanmin Qian:
Less is More: Data Curation Matters in Scaling Speech Enhancement. CoRR abs/2506.23859 (2025)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-23874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-23874
Jiahe Wang, Chenda Li, Wei Wang, Wangyou Zhang, Samuele Cornell, Marvin Sach, Robin Scheibler, Kohei Saijo, Yihui Fu, Zhaoheng Ni, Anurag Kumar, Tim Fingscheidt, Shinji Watanabe, Yanmin Qian:
URGENT-PK: Perceptually-Aligned Ranking Model Designed for Speech Enhancement Competition. CoRR abs/2506.23874 (2025)
2024
[j40]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/WangCHWLZXDRSQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/WangCHWLZXDRSQL24
Shuai Wang, Zhengyang Chen, Bing Han, Hongji Wang, Chengdong Liang, Binbin Zhang, Xu Xiang, Wen Ding, Johan Rohdin, Anna Silnova, Yanmin Qian, Haizhou Li:
Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Commun. 162: 103104 (2024)
[j39]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/ChangWDOZQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/ChangWDOZQ24
Xuankai Chang, Shinji Watanabe, Marc Delcroix, Tsubasa Ochiai, Wangyou Zhang, Yanmin Qian:
Module-Based End-to-End Distant Speech Processing: A case study of far-field automatic speech recognition [Special Issue On Model-Based and Data-Driven Audio Signal Processing]. IEEE Signal Process. Mag. 41(6): 39-50 (2024)
[j38]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HanCQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HanCQ24
Bing Han, Zhengyang Chen, Yanmin Qian:
Self-Supervised Learning With Cluster-Aware-DINO for High-Performance Robust Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 529-541 (2024)
[j37]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangQ24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangQ24a
Wei Wang, Yanmin Qian:
Universal Cross-Lingual Data Generation for Low Resource ASR. IEEE ACM Trans. Audio Speech Lang. Process. 32: 973-983 (2024)
[j36]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenHWQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenHWQ24
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1636-1649 (2024)
[j35]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GongWLLZCQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GongWLLZCQ24
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
Advanced Long-Content Speech Recognition With Factorized Neural Transducer. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1803-1815 (2024)
[j34]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiLWQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiLWQ24
Jiahong Li, Chenda Li, Yifei Wu, Yanmin Qian:
Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1941-1953 (2024)
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuWQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuWQ24
Bei Liu, Haoyu Wang, Yanmin Qian:
Towards Lightweight Speaker Verification via Adaptive Neural Network Quantization. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3771-3784 (2024)
[j32]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangCLQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangCLQL24
Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4971-4998 (2024)
[c205]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanLJHCDD00F0Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanLJHCDD00F0Q24
Bing Han, Zhiqiang Lv, Anbai Jiang, Wen Huang, Zhengyang Chen, Yufeng Deng, Jiawei Ding, Cheng Lu, Wei-Qiang Zhang, Pingyi Fan, Jia Liu, Yanmin Qian:
Exploring Large Scale Pre-Trained Models for Robust Machine Anomalous Sound Detection. ICASSP 2024: 1326-1330
[c204]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangJQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangJQ24
Wangyou Zhang, Jee-weon Jung, Yanmin Qian:
Improving Design of Input Condition Invariant Speech Enhancement. ICASSP 2024: 10696-10700
[c203]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangBLYCHQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangBLYCHQ024
Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition. ICASSP 2024: 10901-10905
[c202]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiangCTDQ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiangCTDQ024
Yidi Jiang, Zhengyang Chen, Ruijie Tao, Liqun Deng, Yanmin Qian, Haizhou Li:
Prompt-Driven Target Speech Diarization. ICASSP 2024: 11086-11090
[c201]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShaoLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShaoLQ24
Hang Shao, Bei Liu, Yanmin Qian:
One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models. ICASSP 2024: 11296-11300
[c200]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangHWCQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangHWCQ24
Wen Huang, Bing Han, Shuai Wang, Zhengyang Chen, Yanmin Qian:
Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters. ICASSP 2024: 11781-11785
[c199]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZDZLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZDZLQ24
Linfeng Yu, Wangyou Zhang, Chenpeng Du, Leying Zhang, Zheng Liang, Yanmin Qian:
Generation-Based Target Speech Extraction with Speech Discretization and Vocoder. ICASSP 2024: 12612-12616
[c198]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/HuangJHZQCLFZLCLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/HuangJHZQCLFZLCLQ24
Wen Huang, Anbai Jiang, Bing Han, Xinhu Zheng, Yihong Qiu, Wenxi Chen, Yuzhe Liang, Pingyi Fan, Wei-Qiang Zhang, Cheng Lu, Xie Chen, Jia Liu, Yanmin Qian:
Semi-Supervised Acoustic Scene Classification with Test-Time Adaptation. ICME Workshops 2024: 1-5
[c197]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/LiangCJQZHHQFZCLC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/LiangCJQZHHQFZCLC24
Yuzhe Liang, Wenxi Chen, Anbai Jiang, Yihong Qiu, Xinhu Zheng, Wen Huang, Bing Han, Yanmin Qian, Pingyi Fan, Wei-Qiang Zhang, L. Cheng, Jia Liu, Xie Chen:
Improving Acoustic Scene Classification via Self-Supervised and Semi-Supervised Learning with Efficient Audio Transformer. ICME Workshops 2024: 1-6
[c196]
- view
  - electronic edition @ ijcai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/ijcai/HanDHHGC0QS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/HanDHHGC0QS24
Bing Han, Junyu Dai, Weituo Hao, Xinyan He, Dong Guo, Jitong Chen, Yuxuan Wang, Yanmin Qian, Xuchen Song:
InstructME: An Instruction Guided Music Edit Framework with Latent Diffusion Models. IJCAI 2024: 5835-5843
[c195]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0005LWQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0005LWQ24
Xun Gong, Anqi Lv, Zhiming Wang, Yanmin Qian:
Contextual Biasing Speech Recognition in Speech-enhanced Large Language Model. INTERSPEECH 2024
[c194]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLCYQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLCYQ24
Zhengyang Chen, Xuechen Liu, Erica Cooper, Junichi Yamagishi, Yanmin Qian:
Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems. INTERSPEECH 2024
[c193]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuL0Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuL0Q24
Tianteng Gu, Bei Liu, Hang Shao, Yanmin Qian:
SparseWAV: Fast and Accurate One-Shot Unstructured Pruning for Large Speech Foundation Models. INTERSPEECH 2024
[c192]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiangHLDZ0Q0F24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiangHLDZ0Q0F24
Anbai Jiang, Bing Han, Zhiqiang Lv, Yufeng Deng, Wei-Qiang Zhang, Xie Chen, Yanmin Qian, Jia Liu, Pingyi Fan:
AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection. INTERSPEECH 2024
[c191]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZLLWG0Q024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZLLWG0Q024
Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li:
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction. INTERSPEECH 2024
[c190]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangSJL0Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangSJL0Q24
Wangyou Zhang, Kohei Saijo, Jee-weon Jung, Chenda Li, Shinji Watanabe, Yanmin Qian:
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement. INTERSPEECH 2024
[c189]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangSSCLNPS0FQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangSSCLNPS0FQ24
Wangyou Zhang, Robin Scheibler, Kohei Saijo, Samuele Cornell, Chenda Li, Zhaoheng Ni, Jan Pirklbauer, Marvin Sach, Shinji Watanabe, Tim Fingscheidt, Yanmin Qian:
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement. INTERSPEECH 2024
[c188]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhouZQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhouZQ24
Tingxiao Zhou, Leying Zhang, Yanmin Qian:
Knowledge Distillation from Discriminative Model to Generative Model with Parallel Architecture for Speech Enhancement. ISCSLP 2024: 179-183
[c187]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HuangHCWQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HuangHCWQ24
Wen Huang, Bing Han, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification. ISCSLP 2024: 383-387
[c186]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhouZLQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhouZLQ24
Xin Zhou, Wangyou Zhang, Chenda Li, Yanmin Qian:
Insights from Hyperparameter Scaling of Online Speech Separation. ISCSLP 2024: 561-565
[c185]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/Zhao0Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Zhao0Q24
Siyi Zhao, Wei Wang, Yanmin Qian:
Band-Wise Front-End Distortion Suppression for Robust Speech Recognition. ISCSLP 2024: 681-685
[c184]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChenWHQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChenWHQ24
Zhengyang Chen, Shuai Wang, Bing Han, Yanmin Qian:
Combining Self-Supervised Learning and Adversarial Training Based Domain Adaptation for Speaker Verification. ISCSLP 2024: 701-705
[c183]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/Hou0Q24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/Hou0Q24
Haoxiang Hou, Xun Gong, Yanmin Qian:
ConMamba: A Convolution-Augmented Mamba Encoder Model for Efficient End-to-End ASR Systems. ISCSLP 2024: 711-715
[c182]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LeQWZ00YQ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeQWZ00YQ0024
Chenyang Le, Yao Qian, Dongmei Wang, Long Zhou, Shujie Liu, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Sheng Zhao, Michael Zeng:
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation. NeurIPS 2024
[c181]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangQZ0WWYQ00Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangQZ0WWYQ00Z24
Leying Zhang, Yao Qian, Long Zhou, Shujie Liu, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Lei He, Sheng Zhao, Michael Zeng:
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations. NeurIPS 2024
[c180]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ShaoLWGQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ShaoLWGQ24
Hang Shao, Bei Liu, Wei Wang, Xun Gong, Yanmin Qian:
DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition. SLT 2024: 240-246
[c179]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhangQYWYLZQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhangQYWYLZQ24
Leying Zhang, Yao Qian, Linfeng Yu, Heming Wang, Hemin Yang, Shujie Liu, Long Zhou, Yanmin Qian:
DDTSE: Discriminative Diffusion Model for Target Speech Extraction. SLT 2024: 294-301
[c178]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiCWQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiCWQ24
Chenda Li, Samuele Cornell, Shinji Watanabe, Yanmin Qian:
Diffusion-Based Generative Modeling With Discriminative Guidance for Streamable Speech Enhancement. SLT 2024: 333-340
[c177]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WangWLZQL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WangWLZQL24
Jiahe Wang, Shuai Wang, Junjie Li, Ke Zhang, Yanmin Qian, Haizhou Li:
Enhancing Speaker Extraction Through Rectifying Target Confusion. SLT 2024: 349-356
[c176]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChenWZLYQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChenWZLYQ24
Zhengyang Chen, Shuai Wang, Mingyang Zhang, Xuechen Liu, Junichi Yamagishi, Yanmin Qian:
Disentangling The Prosody And Semantic Information With Pre-Trained Model For In-Context Learning Based Zero-Shot Voice Conversion. SLT 2024: 698-704
[c175]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhengJHQFLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhengJHQFLZ24
Xinhu Zheng, Anbai Jiang, Bing Han, Yanmin Qian, Pingyi Fan, Jia Liu, Wei-Qiang Zhang:
Improving Anomalous Sound Detection Via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models. SLT 2024: 969-974
[e1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/2024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/2024
Yanmin Qian, Qin Jin, Zhijian Ou, Zhenhua Ling, Zhiyong Wu, Ya Li, Lei Xie, Jianhua Tao:
14th IEEE International Symposium on Chinese Spoken Language Processing, ISCSLP 2024, Beijing, China, November 7-10, 2024. IEEE 2024, ISBN 979-8-3315-1682-6 [contents]
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-14271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-14271
Wangyou Zhang, Jee-weon Jung, Shinji Watanabe, Yanmin Qian:
Improving Design of Input Condition Invariant Speech Enhancement. CoRR abs/2401.14271 (2024)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-13423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-13423
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
Advanced Long-Content Speech Recognition With Factorized Neural Transducer. CoRR abs/2403.13423 (2024)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06690
Leying Zhang, Yao Qian, Long Zhou, Shujie Liu, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Lei He, Sheng Zhao, Michael Zeng:
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations. CoRR abs/2404.06690 (2024)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-19040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-19040
Bo Chen, Shoukang Hu, Qi Chen, Chenpeng Du, Ran Yi, Yanmin Qian, Xie Chen:
GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting. CoRR abs/2404.19040 (2024)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17233
Haoyu Wang, Bei Liu, Hang Shao, Bo Xiao, Ke Zeng, Guanglu Wan, Yanmin Qian:
CLAQ: Pushing the Limits of Low-Bit Post-Training Quantization for LLMs. CoRR abs/2405.17233 (2024)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17809
Chenyang Le, Yao Qian, Dongmei Wang, Long Zhou, Shujie Liu, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Sheng Zhao, Michael Zeng:
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation. CoRR abs/2405.17809 (2024)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04269
Wangyou Zhang, Kohei Saijo, Jee-weon Jung, Chenda Li, Shinji Watanabe, Yanmin Qian:
Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement. CoRR abs/2406.04269 (2024)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04660
Wangyou Zhang, Robin Scheibler, Kohei Saijo, Samuele Cornell, Chenda Li, Zhaoheng Ni, Anurag Kumar, Jan Pirklbauer, Marvin Sach, Shinji Watanabe, Tim Fingscheidt, Yanmin Qian:
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement. CoRR abs/2406.04660 (2024)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05359
Bei Liu, Haoyu Wang, Yanmin Qian:
Towards Lightweight Speaker Verification via Adaptive Neural Network Quantization. CoRR abs/2406.05359 (2024)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07198
Yidi Jiang, Ruijie Tao, Zhengyang Chen, Yanmin Qian, Haizhou Li:
Target Speech Diarization with Multimodal Prompts. CoRR abs/2406.07198 (2024)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08812
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08812
Zhengyang Chen, Xuechen Liu, Erica Cooper, Junichi Yamagishi, Yanmin Qian:
Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems. CoRR abs/2406.08812 (2024)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11364
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11364
Anbai Jiang, Bing Han, Zhiqiang Lv, Yufeng Deng, Wei-Qiang Zhang, Xie Chen, Yanmin Qian, Jia Liu, Pingyi Fan:
AnoPatch: Towards Better Consistency in Machine Anomalous Sound Detection. CoRR abs/2406.11364 (2024)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13471
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13471
Chenda Li, Samuele Cornell, Shinji Watanabe, Yanmin Qian:
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement. CoRR abs/2406.13471 (2024)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15188
Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. CoRR abs/2407.15188 (2024)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-04859
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-04859
Zhengyang Chen, Bing Han, Shuai Wang, Yidi Jiang, Yanmin Qian:
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching. CoRR abs/2409.04859 (2024)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-05004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-05004
Zhengyang Chen, Shuai Wang, Mingyang Zhang, Xuechen Liu, Junichi Yamagishi, Yanmin Qian:
Disentangling the Prosody and Semantic Information with Pre-trained Model for In-Context Learning based Zero-Shot Voice Conversion. CoRR abs/2409.05004 (2024)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-07016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-07016
Xinhu Zheng, Anbai Jiang, Bing Han, Yanmin Qian, Pingyi Fan, Jia Liu, Wei-Qiang Zhang:
Improving Anomalous Sound Detection via Low-Rank Adaptation Fine-Tuning of Pre-Trained Audio Models. CoRR abs/2409.07016 (2024)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15799
Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li:
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction. CoRR abs/2409.15799 (2024)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-17033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-17033
Wen Huang, Bing Han, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification. CoRR abs/2410.17033 (2024)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-20775
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-20775
Bing Han, Wen Huang, Zhengyang Chen, Anbai Jiang, Pingyi Fan, Cheng Lu, Zhiqiang Lv, Jia Liu, Wei-Qiang Zhang, Yanmin Qian:
Data-Efficient Low-Complexity Acoustic Scene Classification via Distilling and Progressive Pruning. CoRR abs/2410.20775 (2024)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-01195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-01195
Bei Liu, Yanmin Qian:
Memory-Efficient Training for Deep Speaker Embedding Learning in Speaker Verification. CoRR abs/2412.01195 (2024)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-14890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-14890
Leying Zhang, Wangyou Zhang, Chenda Li, Yanmin Qian:
Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling. CoRR abs/2412.14890 (2024)
2023
[j31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jossw/LuCLZCNMYSWTQW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jossw/LuCLZCNMYSWTQW23
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing. J. Open Source Softw. 8(91): 5403 (2023)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuCQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuCQ23
Bei Liu, Zhengyang Chen, Yanmin Qian:
Depth-First Neural Architecture With Attentive Feature Fusion for Efficient Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1825-1838 (2023)
[c174]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChenGQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChenGQ23
Chang Chen, Xun Gong, Yanmin Qian:
Efficient Text-Only Domain Adaptation For CTC-Based ASR. ASRU 2023: 1-7
[c173]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiangSYLZDCXQWCLYB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiangSYLZDCXQWCLYB23
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR. ASRU 2023: 1-8
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LinZQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LinZQ23
Shaoxiong Lin, Chao Zhang, Yanmin Qian:
Improving Speech Enhancement Using Audio Tagging Knowledge From Pre-Trained Representations and Multi-Task Learning. ASRU 2023: 1-7
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YangWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YangWQ23
Dongning Yang, Wei Wang, Yanmin Qian:
FAT-HuBERT: Front-End Adaptive Training of Hidden-Unit BERT For Distortion-Invariant Robust Speech Recognition. ASRU 2023: 1-8
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhangSWWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhangSWWQ23
Wangyou Zhang, Kohei Saijo, Zhong-Qiu Wang, Shinji Watanabe, Yanmin Qian:
Toward Universal Speech Enhancement For Diverse Input Conditions. ASRU 2023: 1-6
[c169]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhangYQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhangYQ23
Wangyou Zhang, Lei Yang, Yanmin Qian:
Exploring Time-Frequency Domain Target Speaker Extraction For Causal and Non-Causal Processing. ASRU 2023: 1-6
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GongWLLZCQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GongWLLZCQ23
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer. ICASSP 2023: 1-5
[c167]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GongWSCQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GongWSCQ23
Xun Gong, Wei Wang, Hang Shao, Xie Chen, Yanmin Qian:
Factorized AED: Factorized Attention-Based Encoder-Decoder for Text-Only Domain Adaptive ASR. ICASSP 2023: 1-5
[c166]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanCQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanCQ23
Bing Han, Zhengyang Chen, Yanmin Qian:
Exploring Binary Classification Loss for Speaker Verification. ICASSP 2023: 1-5
[c165]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanHCQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanHCQ23
Bing Han, Wen Huang, Zhengyang Chen, Yanmin Qian:
Improving Dino-Based Self-Supervised Speaker Verification with Progressive Cluster-Aware Training. ICASSP Workshops 2023: 1-5
[c164]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiLWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiLWQ23
Jiahong Li, Chenda Li, Yifei Wu, Yanmin Qian:
Robust Audio-Visual ASR with Unified Cross-Modal Attention. ICASSP 2023: 1-5
[c163]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiQCWYLQZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiQCWYLQZ23
Chenda Li, Yao Qian, Zhuo Chen, Dongmei Wang, Takuya Yoshioka, Shujie Liu, Yanmin Qian, Michael Zeng:
Target Sound Extraction with Variable Cross-Modality Clues. ICASSP 2023: 1-5
[c162]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiWQ23
Chenda Li, Yifei Wu, Yanmin Qian:
Predictive Skim: Contrastive Predictive Coding for Low-Latency Online Speech Separation. ICASSP 2023: 1-5
[c161]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuCQY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuCQY23
Tao Liu, Zhengyang Chen, Yanmin Qian, Kai Yu:
Multi-Speaker End-to-End Multi-Modal Speaker Diarization System for the MISP 2022 Challenge. ICASSP 2023: 1-2
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShaoTWGQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShaoTWGQ23
Hang Shao, Tian Tan, Wei Wang, Xun Gong, Yanmin Qian:
Joint Discriminator and Transfer Based Fast Domain Adaptation For End-To-End Speech Recognition. ICASSP 2023: 1-5
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLWCQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLWCQ23
Haoyu Wang, Bei Liu, Yifei Wu, Zhengyang Chen, Yanmin Qian:
Lowbit Neural Network Quantization for Speaker Verification. ICASSP Workshops 2023: 1-5
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLWCZXDQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLWCZXDQ23
Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production Oriented Speaker Embedding Learning Toolkit. ICASSP 2023: 1-5
[c157]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangQ23
Wei Wang, Yanmin Qian:
HuBERT-AGG: Aggregated Representation Distillation of Hidden-Unit Bert for Robust Speech Recognition. ICASSP 2023: 1-5
[c156]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuLQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuLQ23
Yifei Wu, Chenda Li, Yanmin Qian:
Light-Weight Visualvoice: Neural Network Quantization On Audio Visual Speech Separation. ICASSP Workshops 2023: 1-5
[c155]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuHQJLLSQLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuHQJLLSQLZ23
Haibin Yu, Yuxuan Hu, Yao Qian, Ma Jin, Linquan Liu, Shujie Liu, Yu Shi, Yanmin Qian, Edward Lin, Michael Zeng:
Code-Switching Text Generation and Injection in Mandarin-English ASR. ICASSP 2023: 1-5
[c154]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCQ23
Leying Zhang, Zhengyang Chen, Yanmin Qian:
Adaptive Large Margin Fine-Tuning For Robust Speaker Verification. ICASSP 2023: 1-5
[c153]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiQ0KWYQ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiQ0KWYQ023
Chenda Li, Yao Qian, Zhuo Chen, Naoyuki Kanda, Dongmei Wang, Takuya Yoshioka, Yanmin Qian, Michael Zeng:
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers. INTERSPEECH 2023: 1314-1318
[c152]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuWQ23
Bei Liu, Haoyu Wang, Yanmin Qian:
Extremely Low Bit Quantization for Mobile Speaker Verification Systems Under 1MB Memory. INTERSPEECH 2023: 1973-1977
[c151]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangWQ23
Zhilong Zhang, Wei Wang, Yanmin Qian:
Fast and Efficient Multilingual Self-Supervised Pre-training for Low-Resource Speech Recognition. INTERSPEECH 2023: 2248-2252
[c150]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangQ23
Wei Wang, Yanmin Qian:
UniSplice: Universal Cross-Lingual Data Splicing for Low-Resource ASR. INTERSPEECH 2023: 2253-2257
[c149]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuQ23
Bei Liu, Yanmin Qian:
Reversible Neural Networks for Memory-Efficient Speaker Verification. INTERSPEECH 2023: 3127-3131
[c148]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuQ23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuQ23a
Bei Liu, Yanmin Qian:
ECAPA++: Fine-grained Deep Embedding Learning for TDNN Based Speaker Verification. INTERSPEECH 2023: 3132-3136
[c147]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHXHLQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHXHLQ23
Zhengyang Chen, Bing Han, Xu Xiang, Houjun Huang, Bei Liu, Yanmin Qian:
Build a SRE Challenge System: Lessons from VoxSRC 2022 and CNSRC 2022. INTERSPEECH 2023: 3202-3206
[c146]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wang0SYQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wang0SYQ23
Wei Wang, Xun Gong, Hang Shao, Dongning Yang, Yanmin Qian:
Text Only Domain Adaptation with Phoneme Guided Data Splicing for End-to-End Speech Recognition. INTERSPEECH 2023: 3347-3351
[c145]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuZLQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuZLQ23
Linfeng Yu, Wangyou Zhang, Chenda Li, Yanmin Qian:
Overlap Aware Continuous Speech Separation without Permutation Invariant Training. INTERSPEECH 2023: 3512-3516
[c144]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangQ23
Wangyou Zhang, Yanmin Qian:
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition. INTERSPEECH 2023: 3517-3521
[c143]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHWQ23
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. INTERSPEECH 2023: 3552-3556
[c142]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLWQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLWQ23
Haoyu Wang, Bei Liu, Yifei Wu, Yanmin Qian:
Adaptive Neural Network Quantization For Lightweight Speaker Verification. INTERSPEECH 2023: 5331-5335
[c141]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LeQZLQ0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeQZLQ0023
Chenyang Le, Yao Qian, Long Zhou, Shujie Liu, Yanmin Qian, Michael Zeng, Xuedong Huang:
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation. NeurIPS 2023
[c140]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/MasuyamaCZCWOQW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/MasuyamaCZCWOQW23
Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation. WASPAA 2023: 1-5
[d1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/10/LuCLZCNMYSWTQW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/10/LuCLZCNMYSWTQW23
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing (espnet-v.202310). Zenodo, 2023
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08372
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08372
Chenda Li, Yao Qian, Zhuo Chen, Dongmei Wang, Takuya Yoshioka, Shujie Liu, Yanmin Qian, Michael Zeng:
Target Sound Extraction with Variable Cross-modality Clues. CoRR abs/2303.08372 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-10949
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-10949
Haibin Yu, Yuxuan Hu, Yao Qian, Ma Jin, Linquan Liu, Shujie Liu, Yu Shi, Yanmin Qian, Edward Lin, Michael Zeng:
Code-Switching Text Generation and Injection in Mandarin-English ASR. CoRR abs/2303.10949 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-05754
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-05754
Bing Han, Zhengyang Chen, Yanmin Qian:
Self-Supervised Learning with Cluster-Aware-DINO for High-Performance Robust Speaker Verification. CoRR abs/2304.05754 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10704
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10704
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. CoRR abs/2305.10704 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10788
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10788
Hang Shao, Wei Wang, Bei Liu, Xun Gong, Haoyu Wang, Yanmin Qian:
Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR. CoRR abs/2305.10788 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16286
Wangyou Zhang, Yanmin Qian:
Weakly-Supervised Speech Pre-training: A Case Study on Target Speech Recognition. CoRR abs/2305.16286 (2023)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18747
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18747
Chenda Li, Yao Qian, Zhuo Chen, Naoyuki Kanda, Dongmei Wang, Takuya Yoshioka, Yanmin Qian, Michael Zeng:
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers. CoRR abs/2305.18747 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-08205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-08205
Bing Han, Zhengyang Chen, Yanmin Qian:
Exploring Binary Classification Loss For Speaker Verification. CoRR abs/2307.08205 (2023)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-12231
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-12231
Yoshiki Masuyama, Xuankai Chang, Wangyou Zhang, Samuele Cornell, Zhong-Qiu Wang, Nobutaka Ono, Yanmin Qian, Shinji Watanabe:
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation. CoRR abs/2307.12231 (2023)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14360
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14360
Bing Han, Junyu Dai, Xuchen Song, Weituo Hao, Xinyan He, Dong Guo, Jitong Chen, Yuxuan Wang, Yanmin Qian:
InstructME: An Instruction Guided Music Edit And Remix Framework with Latent Diffusion Models. CoRR abs/2308.14360 (2023)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-06672
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-06672
Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer. CoRR abs/2309.06672 (2023)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10674
Junyi Ao, Mehmet Sinan Yildirim, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li:
USED: Universal Speaker Extraction and Diarization. CoRR abs/2309.10674 (2023)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-11730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-11730
Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition. CoRR abs/2309.11730 (2023)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13573
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13573
Yuhao Liang, Mohan Shi, Fan Yu, Yangze Li, Shiliang Zhang, Zhihao Du, Qian Chen, Lei Xie, Yanmin Qian, Jian Wu, Zhuo Chen, Kong Aik Lee, Zhijie Yan, Hui Bu:
The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR. CoRR abs/2309.13573 (2023)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13874
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13874
Leying Zhang, Yao Qian, Linfeng Yu, Heming Wang, Xinkai Wang, Hemin Yang, Long Zhou, Shujie Liu, Yanmin Qian, Michael Zeng:
Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction. CoRR abs/2309.13874 (2023)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-17384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-17384
Wangyou Zhang, Kohei Saijo, Zhong-Qiu Wang, Shinji Watanabe, Yanmin Qian:
Toward Universal Speech Enhancement for Diverse Input Conditions. CoRR abs/2309.17384 (2023)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-09499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-09499
Hang Shao, Bei Liu, Yanmin Qian:
One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models. CoRR abs/2310.09499 (2023)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17790
Dongning Yang, Wei Wang, Yanmin Qian:
FAT-HuBERT: Front-end Adaptive Training of Hidden-unit BERT for Distortion-Invariant Robust Speech Recognition. CoRR abs/2311.17790 (2023)
2022
[j29]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChenWCWLCLKYXWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChenWCWLCLKYXWZ22
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1505-1518 (2022)
[j28]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianZ22
Yanmin Qian, Zhikai Zhou:
Optimizing Data Usage for Low-Resource Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 394-403 (2022)
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiCQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiCQ22
Chenda Li, Zhuo Chen, Yanmin Qian:
Dual-Path Modeling With Memory Embedding Model for Continuous Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1508-1520 (2022)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianGH22
Yanmin Qian, Xun Gong, Houjun Huang:
Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2842-2853 (2022)
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangCBNWQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangCBNWQ22
Wangyou Zhang, Xuankai Chang, Christoph Böddeker, Tomohiro Nakatani, Shinji Watanabe, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition in a Cocktail Party. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3173-3188 (2022)
[c139]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuLBWQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuLBWQ22
Yifei Wu, Chenda Li, Jinfeng Bai, Zhongqin Wu, Yanmin Qian:
Time-Domain Audio-Visual Speech Separation on Low Quality Videos. ICASSP 2022: 256-260
[c138]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiYWQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiYWQ22
Chenda Li, Lei Yang, Weiqin Wang, Yanmin Qian:
Skim: Skipping Memory Lstm for Low-Latency Real-Time Continuous Speech Separation. ICASSP 2022: 681-685
[c137]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenCWQWLQZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenCWQWLQZ22
Zhengyang Chen, Sanyuan Chen, Yu Wu, Yao Qian, Chengyi Wang, Shujie Liu, Yanmin Qian, Michael Zeng:
Large-Scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification. ICASSP 2022: 6147-6151
[c136]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanCQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanCQ22
Bing Han, Zhengyang Chen, Yanmin Qian:
Local Information Modeling with Self-Attention for Speaker Verification. ICASSP 2022: 6727-6731
[c135]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouTQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouTQ22
Zhikai Zhou, Tian Tan, Yanmin Qian:
Punctuation Prediction for Streaming On-Device Speech Recognition. ICASSP 2022: 7277-7281
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanCLQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanCLQ22
Bing Han, Zhengyang Chen, Bei Liu, Yanmin Qian:
MLP-SVNET: A Multi-Layer Perceptrons Based Network for Speaker Verification. ICASSP 2022: 7522-7526
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuWCWQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuWCWQ22
Bei Liu, Haoyu Wang, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Knowledge Distillation via Feature Enhancement for Speaker Verification. ICASSP 2022: 7542-7546
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangRQLSQZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangRQLSQZ22
Wei Wang, Shuo Ren, Yao Qian, Shujie Liu, Yu Shi, Yanmin Qian, Michael Zeng:
Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding. ICASSP 2022: 7802-7806
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouWZQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouWZQ22
Zhikai Zhou, Wei Wang, Wangyou Zhang, Yanmin Qian:
Exploring Effective Data Utilization for Low-Resource Speech Recognition. ICASSP 2022: 8192-8196
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZGFDZHXTWQLYM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZGFDZHXTWQLYM22
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary on the ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. ICASSP 2022: 9156-9160
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangGWZLZHQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangGWZLZHQ22
Wei Wang, Xun Gong, Yifei Wu, Zhikai Zhou, Chenda Li, Wangyou Zhang, Bing Han, Yanmin Qian:
The Sjtu System For Multimodal Information Based Speech Processing Challenge 2021. ICASSP 2022: 9261-9265
[c128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuCQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCQ22
Bei Liu, Zhengyang Chen, Yanmin Qian:
Attentive Feature Fusion for Robust Speaker Verification. INTERSPEECH 2022: 286-290
[c127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuCQ22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCQ22a
Bei Liu, Zhengyang Chen, Yanmin Qian:
Dual Path Embedding Learning for Speaker Verification with Triplet Attention. INTERSPEECH 2022: 291-295
[c126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuCWWHQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCWWHQ22
Bei Liu, Zhengyang Chen, Shuai Wang, Haoyu Wang, Bing Han, Yanmin Qian:
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design. INTERSPEECH 2022: 296-300
[c125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangCQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangCQ22
Leying Zhang, Zhengyang Chen, Yanmin Qian:
Enroll-Aware Attentive Statistics Pooling for Target Speaker Verification. INTERSPEECH 2022: 311-315
[c124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Liu0XSLSHCYLWQ022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Liu0XSLSHCYLWQ022
Tao Liu, Shuai Fan, Xu Xiang, Hongbo Song, Shaoxiong Lin, Jiaqi Sun, Tianyuan Han, Siyuan Chen, Binwei Yao, Sen Liu, Yifei Wu, Yanmin Qian, Kai Yu:
MSDWild: Multi-modal Speaker Diarization Dataset in the Wild. INTERSPEECH 2022: 1476-1480
[c123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0005ZQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0005ZQ22
Xun Gong, Zhikai Zhou, Yanmin Qian:
Knowledge Transfer and Distillation from Autoregressive to Non-Autoregessive Speech Recognition. INTERSPEECH 2022: 2618-2622
[c122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanCQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanCQ22
Bing Han, Zhengyang Chen, Yanmin Qian:
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction. INTERSPEECH 2022: 4780-4784
[c121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zhang0K00EYXMQW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zhang0K00EYXMQW22
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-wise Permutation Invariant Training. INTERSPEECH 2022: 5383-5387
[c120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuCLZCNMYSW0Q022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuCLZCNMYSW0Q022
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. INTERSPEECH 2022: 5458-5462
[c119]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QuLBQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QuLBQ22
Bowen Qu, Chenda Li, Jinfeng Bai, Yanmin Qian:
Improving Speech Separation with Knowledge Distilled from Self-supervised Pre-trained Models. ISCSLP 2022: 329-333
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangZLQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangZLQ22
Wei Wang, Wangyou Zhang, Shaoxiong Lin, Yanmin Qian:
Text-Informed Knowledge Distillation for Robust Speech Enhancement and Recognition. ISCSLP 2022: 334-338
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhouCCLXJQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhouCCLXJQ22
Zhikai Zhou, Shuang Cao, Zhengyang Chen, Bei Liu, Ming Xia, Hong Jiang, Yanmin Qian:
Medical Difficult Airway Detection using Speech Technology. ISCSLP 2022: 349-353
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HuangQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HuangQ22
Houjun Huang, Yanmin Qian:
Speaking style compensation on synthetic audio for robust keyword spotting. ISCSLP 2022: 448-452
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChengCYLYYZZXQLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChengCYLYYZZXQLY22
Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. ISCSLP 2022: 488-492
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiuXCHYQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiuXCHYQ22
Tao Liu, Xu Xiang, Zhengyang Chen, Bing Han, Kai Yu, Yanmin Qian:
The X-Lance Speaker Diarization System for the Conversational Short-phrase Speaker Diarization Challenge 2022. ISCSLP 2022: 498-501
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ScheiblerZCWQ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ScheiblerZCWQ22
Robin Scheibler, Wangyou Zhang, Xuankai Chang, Shinji Watanabe, Yanmin Qian:
End-to-End Multi-Speaker ASR with Independent Vector Analysis. SLT 2022: 496-501
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChenQHQZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChenQHQZ22
Zhengyang Chen, Yao Qian, Bing Han, Yanmin Qian, Michael Zeng:
A Comprehensive Study on Self-Supervised Distillation for Speaker Representation Learning. SLT 2022: 599-604
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-10800
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-10800
Chenda Li, Lei Yang, Weiqin Wang, Yanmin Qian:
SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation. CoRR abs/2201.10800 (2022)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03647
Fan Yu, Shiliang Zhang, Pengcheng Guo, Yihui Fu, Zhihao Du, Siqi Zheng, Weilong Huang, Lei Xie, Zheng-Hua Tan, DeLiang Wang, Yanmin Qian, Kong Aik Lee, Zhijie Yan, Bin Ma, Xin Xu, Hui Bu:
Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge. CoRR abs/2202.03647 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-00218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-00218
Robin Scheibler, Wangyou Zhang, Xuankai Chang, Shinji Watanabe, Yanmin Qian:
End-to-End Multi-speaker ASR with Independent Vector Analysis. CoRR abs/2204.00218 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09883
Xun Gong, Yizhou Lu, Zhikai Zhou, Yanmin Qian:
Layer-wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition. CoRR abs/2204.09883 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-11699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-11699
Zhengyang Chen, Bei Liu, Bing Han, Leying Zhang, Yanmin Qian:
The SJTU X-LANCE Lab System for CNSRC 2022. CoRR abs/2206.11699 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-09514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-09514
Yen-Ju Lu, Xuankai Chang, Chenda Li, Wangyou Zhang, Samuele Cornell, Zhaoheng Ni, Yoshiki Masuyama, Brian Yan, Robin Scheibler, Zhong-Qiu Wang, Yu Tsao, Yanmin Qian, Shinji Watanabe:
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding. CoRR abs/2207.09514 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-10600
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-10600
Xun Gong, Zhikai Zhou, Yanmin Qian:
Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition. CoRR abs/2207.10600 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-01928
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-01928
Bing Han, Zhengyang Chen, Yanmin Qian:
Self-Supervised Speaker Verification Using Dynamic Loss-Gate and Label Correction. CoRR abs/2208.01928 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-01933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-01933
Bing Han, Zhengyang Chen, Zhikai Zhou, Yanmin Qian:
The SJTU System for Short-duration Speaker Verification Challenge 2021. CoRR abs/2208.01933 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-08042
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-08042
Gaofeng Cheng, Yifan Chen, Runyan Yang, Qingxuan Li, Zehui Yang, Lingxuan Ye, Pengyuan Zhang, Qingqing Zhang, Lei Xie, Yanmin Qian, Kong Aik Lee, Yonghong Yan:
The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines. CoRR abs/2208.08042 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-09076
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-09076
Zhengyang Chen, Bing Han, Xu Xiang, Houjun Huang, Bei Liu, Yanmin Qian:
SJTU-AISPEECH System for VoxCeleb Speaker Recognition Challenge 2022. CoRR abs/2209.09076 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15936
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15936
Zhengyang Chen, Yao Qian, Bing Han, Yanmin Qian, Michael Zeng:
A comprehensive study on self-supervised distillation for speaker representation learning. CoRR abs/2210.15936 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17016
Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit. CoRR abs/2210.17016 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00815
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00815
Zhengyang Chen, Bing Han, Xu Xiang, Houjun Huang, Bei Liu, Yanmin Qian:
Build a SRE Challenge System: Lessons from VoxSRC 2022 and CNSRC 2022. CoRR abs/2211.00815 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-09412
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-09412
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
LongFNT: Long-form Speech Recognition with Factorized Neural Transducer. CoRR abs/2211.09412 (2022)
2021
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangWDQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangWDQ21
Jichen Yang, Hongji Wang, Rohan Kumar Das, Yanmin Qian:
Modified Magnitude-Phase Spectrum Information for Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1065-1078 (2021)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianCW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianCW21
Yanmin Qian, Zhengyang Chen, Shuai Wang:
Audio-Visual Deep Neural Network for Robust Person Verification. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1079-1092 (2021)
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiCLHZKD0Q21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiCLHZKD0Q21
Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. ICASSP 2021: 5739-5743
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWQ21
Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. ICASSP 2021: 5834-5838
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DuHWQ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DuHWQ021
Chenpeng Du, Bing Han, Shuai Wang, Yanmin Qian, Kai Yu:
SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification. ICASSP 2021: 5844-5848
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangXZWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangXZWQ21
Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit Selection Synthesis Based Data Augmentation for Fixed Phrase Speaker Verification. ICASSP 2021: 5849-5853
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangXYMQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangXYMQ21
Houjun Huang, Xu Xiang, Yexin Yang, Rao Ma, Yanmin Qian:
AISpeech-SJTU Accent Identification System for the Accented English Speech Recognition Challenge. ICASSP 2021: 6254-6258
[c106]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0002LMZGQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0002LMZGQ21
Tian Tan, Yizhou Lu, Rao Ma, Sen Zhu, Jiaqi Guo, Yanmin Qian:
AISpeech-SJTU ASR System for the Accented English Speech Recognition Challenge. ICASSP 2021: 6413-6417
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangZLWDQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangZLWDQ21
Wei Wang, Zhikai Zhou, Yizhou Lu, Hongji Wang, Chenpeng Du, Yanmin Qian:
Towards Data Selection on TTS Data for Children's Speech Recognition. ICASSP 2021: 6888-6892
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangB0NDKOKHQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangB0NDKOKHQ21
Wangyou Zhang, Christoph Böddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. ICASSP 2021: 6898-6902
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiYLLFWQX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiYLLFWQX21
Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie:
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. ICASSP 2021: 6918-6922
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BoddekerZNKODKQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BoddekerZNKODKQ21
Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, Reinhold Haeb-Umbach:
Convolutive Transfer Function Invariant SDR Training Criteria for Multi-Channel Reverberant Speech Separation. ICASSP 2021: 8428-8432
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GongLZQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GongLZQ21
Xun Gong, Yizhou Lu, Zhikai Zhou, Yanmin Qian:
Layer-Wise Fast Adaptation for End-to-End Multi-Accent Speech Recognition. Interspeech 2021: 1274-1278
[c100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangCQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangCQ21
Leying Zhang, Zhengyang Chen, Yanmin Qian:
Knowledge Distillation from Multi-Modality to Single-Modality for Person Verification. Interspeech 2021: 1897-1901
[c99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuQ21
Zhengxi Liu, Yanmin Qian:
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition. Interspeech 2021: 2222-2226
[c98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanCZQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanCZQ21
Bing Han, Zhengyang Chen, Zhikai Zhou, Yanmin Qian:
The SJTU System for Short-Duration Speaker Verification Challenge 2021. Interspeech 2021: 2332-2336
[c97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuLYWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuLYWQ21
Yifei Wu, Chenda Li, Song Yang, Zhongqin Wu, Yanmin Qian:
Audio-Visual Multi-Talker Speech Recognition in a Cocktail Party. Interspeech 2021: 3021-3025
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/GongCYWWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/GongCYWWQ21
Xun Gong, Zhengyang Chen, Yexin Yang, Shuai Wang, Lan Wang, Yanmin Qian:
Speaker Embedding Augmentation with Noise Distribution Matching. ISCSLP 2021: 1-5
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangYQ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangYQ021
Shuai Wang, Yexin Yang, Yanmin Qian, Kai Yu:
Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning. ISCSLP 2021: 1-5
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/DuLLWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/DuLLWQ21
Chenpeng Du, Hao Li, Yizhou Lu, Lan Wang, Yanmin Qian:
Data Augmentation for end-to-end Code-Switching Speech Recognition. SLT 2021: 194-200
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiLHLYZDKBQ0C21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiLHLYZDKBQ0C21
Chenda Li, Yi Luo, Cong Han, Jinyu Li, Takuya Yoshioka, Tianyan Zhou, Marc Delcroix, Keisuke Kinoshita, Christoph Böddeker, Yanmin Qian, Shinji Watanabe, Zhuo Chen:
Dual-Path RNN for Long Recording Speech Separation. SLT 2021: 865-872
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/ZhangSLWQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/ZhangSLWQ21
Wangyou Zhang, Jing Shi, Chenda Li, Shinji Watanabe, Yanmin Qian:
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions. WASPAA 2021: 146-150
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-09817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-09817
Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit selection synthesis based data augmentation for fixed phrase speaker verification. CoRR abs/2102.09817 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-09828
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-09828
Houjun Huang, Xu Xiang, Yexin Yang, Rao Ma, Yanmin Qian:
AISPEECH-SJTU accent identification system for the Accented English Speech Recognition Challenge. CoRR abs/2102.09828 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-10233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-10233
Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie:
The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. CoRR abs/2102.10233 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11525
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11525
Wangyou Zhang, Christoph Böddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian:
End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend. CoRR abs/2102.11525 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11634
Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. CoRR abs/2102.11634 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-13419
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-13419
Zhengxi Liu, Yanmin Qian:
Basis-MelGAN: Efficient Neural Vocoder Based on Audio Decomposition. CoRR abs/2106.13419 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-13843
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-13843
Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. CoRR abs/2108.13843 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05777
Zhengyang Chen, Sanyuan Chen, Yu Wu, Yao Qian, Chengyi Wang, Shujie Liu, Yanmin Qian, Michael Zeng:
Large-scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification. CoRR abs/2110.05777 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-12138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-12138
Wei Wang, Shuo Ren, Yao Qian, Shujie Liu, Yu Shi, Yanmin Qian, Michael Zeng:
Optimizing Alignment of Speech and Language Latent Spaces for End-to-End Speech Recognition and Understanding. CoRR abs/2110.12138 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13900
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. CoRR abs/2110.13900 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14139
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14139
Wangyou Zhang, Jing Shi, Chenda Li, Shinji Watanabe, Yanmin Qian:
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions. CoRR abs/2110.14139 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14142
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14142
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-Wise Permutation Invariant Training. CoRR abs/2110.14142 (2021)
2020
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangCQW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangCQW20
Wangyou Zhang, Xuankai Chang, Yanmin Qian, Shinji Watanabe:
Improving End-to-End Single-Channel Multi-Talker Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1385-1394 (2020)
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangYWQY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangYWQY20
Shuai Wang, Yexin Yang, Zhanghao Wu, Yanmin Qian, Kai Yu:
Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2598-2609 (2020)
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangZQRW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangZQRW20
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe:
End-To-End Multi-Speaker Speech Recognition With Transformer. ICASSP 2020: 6134-6138
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Yang0GQ020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Yang0GQ020
Yexin Yang, Shuai Wang, Xun Gong, Yanmin Qian, Kai Yu:
Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings. ICASSP 2020: 6454-6458
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Chen0Q020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Chen0Q020
Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
Channel Invariant Speaker Embedding Learning with Joint Multi-Task and Adversarial Training. ICASSP 2020: 6574-6578
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiQ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiQ20
Chenda Li, Yanmin Qian:
Deep Audio-Visual Speech Separation with Attention Mechanism. ICASSP 2020: 7314-7318
[c87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangQ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangQ20
Wangyou Zhang, Yanmin Qian:
Learning Contextual Language Embeddings for Monaural Multi-Talker Speech Recognition. INTERSPEECH 2020: 304-308
[c86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangSC0Q20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangSC0Q20
Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Shinji Watanabe, Yanmin Qian:
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming. INTERSPEECH 2020: 324-328
[c85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangD0Q020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangD0Q020
Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection. INTERSPEECH 2020: 1086-1090
[c84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiQ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiQ20
Chenda Li, Yanmin Qian:
Listen, Watch and Understand at the Cocktail Party: Audio-Visual-Contextual Speech Separation. INTERSPEECH 2020: 1426-1430
[c83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenWQ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWQ20
Zhengyang Chen, Shuai Wang, Yanmin Qian:
Multi-Modality Matters: A Performance Leap on VoxCeleb. INTERSPEECH 2020: 2252-2256
[c82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenWQ20a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWQ20a
Zhengyang Chen, Shuai Wang, Yanmin Qian:
Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network. INTERSPEECH 2020: 3017-3021
[c81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuHLGQ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuHLGQ20
Yizhou Lu, Mingkun Huang, Hao Li, Jiaqi Guo, Yanmin Qian:
Bi-Encoder Transformer Network for Mandarin-English Code-Switching Speech Recognition Using Mixture of Experts. INTERSPEECH 2020: 4766-4770
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-03921
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-03921
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe:
End-to-End Multi-speaker Speech Recognition with Transformer. CoRR abs/2002.03921 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-10479
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-10479
Wangyou Zhang, Aswin Shanmugam Subramanian, Xuankai Chang, Shinji Watanabe, Yanmin Qian:
End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming. CoRR abs/2005.10479 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13060
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13060
Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Kai Yu:
End-to-end spoofing detection with raw waveform CLDNNs. CoRR abs/2007.13060 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-01832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-01832
Qi Liu, Yanmin Qian, Kai Yu:
Future Vector Enhanced LSTM Language Model for LVCSR. CoRR abs/2008.01832 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-09906
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-09906
Yefei Chen, Shuai Wang, Yanmin Qian, Kai Yu:
End-to-End Speaker-Dependent Voice Activity Detection. CoRR abs/2009.09906 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02160
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02160
Chenpeng Du, Hao Li, Yizhou Lu, Lan Wang, Yanmin Qian:
Data Augmentation for End-to-end Code-switching Speech Recognition. CoRR abs/2011.02160 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-15003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-15003
Christoph Böddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, Shinji Watanabe, Reinhold Haeb-Umbach:
Convolutive Transfer Function Invariant SDR training criteria for Multi-Channel Reverberant Speech Separation. CoRR abs/2011.15003 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jzusc/QianWCWY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/QianWCWY19
Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 20(3): 438 (2019)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/jzusc/QianX19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/QianX19
Yanmin Qian, Xu Xiang:
Binary neural networks for speech recognition. Frontiers Inf. Technol. Electron. Eng. 20(5): 701-715 (2019)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/QianHT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/QianHT19
Yanmin Qian, Hu Hu, Tian Tan:
Data augmentation using generative adversarial networks for robust speech recognition. Speech Commun. 114: 1-9 (2019)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangHQY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangHQY19
Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1686-1696 (2019)
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/XiangWHQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/XiangWHQ019
Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. APSIPA 2019: 1652-1656
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ShengYQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ShengYQ19
Peiyao Sheng, Zhuolin Yang, Yanmin Qian:
GANs for Children: A Generative Data Augmentation Strategy for Children Speech Recognition. ASRU 2019: 129-135
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChangZQRW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChangZQRW19
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe:
MIMO-Speech: End-to-End Multi-Channel Multi-Speaker Speech Recognition. ASRU 2019: 237-244
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HuangLWQY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HuangLWQY19
Mingkun Huang, Yizhou Lu, Lan Wang, Yanmin Qian, Kai Yu:
Exploring Model Units and Training Strategies for End-to-End Speech Recognition. ASRU 2019: 524-531
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhangSWQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhangSWQ19
Wangyou Zhang, Man Sun, Lan Wang, Yanmin Qian:
End-to-End Overlapped Speech Detection and Speaker Counting with Raw Waveform. ASRU 2019: 660-666
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangYWQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangYWQ019
Shuai Wang, Yexin Yang, Tianzhe Wang, Yanmin Qian, Kai Yu:
Knowledge Distillation for Small Foot-print Deep Speaker Embedding. ICASSP 2019: 6021-6025
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangQ0W19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangQ0W19
Xuankai Chang, Yanmin Qian, Kai Yu, Shinji Watanabe:
End-to-end Monaural Multi-speaker ASR System without Pretraining. ICASSP 2019: 6256-6260
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangWDCWQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangWDCWQ019
Yexin Yang, Hongji Wang, Heinrich Dinkel, Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge. INTERSPEECH 2019: 1038-1042
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangRBPQ0C19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangRBPQ0C19
Shuai Wang, Johan Rohdin, Lukás Burget, Oldrich Plchot, Yanmin Qian, Kai Yu, Jan Cernocký:
On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction. INTERSPEECH 2019: 1148-1152
[c71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuWQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuWQ019
Zhanghao Wu, Shuai Wang, Yanmin Qian, Kai Yu:
Data Augmentation Using Variational Autoencoder for Embedding Based Speaker Verification. INTERSPEECH 2019: 1163-1167
[c70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoYQ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoYQ019
Jiaqi Guo, Yongbin You, Yanmin Qian, Kai Yu:
Joint Decoding of CTC Based Systems for Speech Recognition. INTERSPEECH 2019: 2205-2209
[c69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangCQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangCQ19
Wangyou Zhang, Xuankai Chang, Yanmin Qian:
Knowledge Distillation for End-to-End Monaural Multi-Talker ASR System. INTERSPEECH 2019: 2633-2637
[c68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangZQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangZQ19
Wangyou Zhang, Ying Zhou, Yanmin Qian:
Robust DOA Estimation Based on Convolutional Neural Network and Time-Frequency Masking. INTERSPEECH 2019: 2703-2707
[c67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangD0Q019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangD0Q019
Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training. INTERSPEECH 2019: 2938-2942
[c66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiQ19
Chenda Li, Yanmin Qian:
Prosody Usage Optimization for Children Speech Recognition with Zero Resource Children Speech. INTERSPEECH 2019: 3446-3450
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07317
Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. CoRR abs/1906.07317 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-06522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-06522
Xuankai Chang, Wangyou Zhang, Yanmin Qian, Jonathan Le Roux, Shinji Watanabe:
MIMO-SPEECH: End-to-End Multi-Channel Multi-Speaker Speech Recognition. CoRR abs/1910.06522 (2019)
2018
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/jzusc/QianWCWY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/QianWCWY18
Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 19(1): 40-63 (2018)
[j15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jzusc/QianWCWY18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jzusc/QianWCWY18a
Yanmin Qian, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 19(4): 582 (2018)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ChenQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ChenQ018
Zhehuai Chen, Yanmin Qian, Kai Yu:
Sequence discriminative training for deep learning based acoustic keyword spotting. Speech Commun. 102: 100-111 (2018)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/QianCY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/QianCY18
Yanmin Qian, Xuankai Chang, Dong Yu:
Single-channel multi-talker speech recognition with permutation invariant training. Speech Commun. 104: 1-11 (2018)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TanQHZDY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TanQHZDY18
Tian Tan, Yanmin Qian, Hu Hu, Ying Zhou, Wen Ding, Kai Yu:
Adaptive Very Deep Convolutional Residual Network for Noise Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 26(8): 1393-1405 (2018)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DinkelQY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DinkelQY18
Heinrich Dinkel, Yanmin Qian, Kai Yu:
Investigating Raw Wave Deep Neural Networks for End-to-End Speaker Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 26(11): 2002-2014 (2018)
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouQ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouQ18
Ying Zhou, Yanmin Qian:
Robust Mask Estimation By Integrating Neural Network-Based and Clustering-Based Approaches for Adaptive Acoustic Beamforming. ICASSP 2018: 536-540
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0002Q018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0002Q018
Tian Tan, Yanmin Qian, Dong Yu:
Knowledge Transfer in Permutation Invariant Training for Single-Channel Multi-Talker Speech Recognition. ICASSP 2018: 571-5718
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangWQ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangWQ18
Zili Huang, Shuai Wang, Yanmin Qian:
Joint I-Vector with End-to-End System for Short Duration Text-Independent Speaker Verification. ICASSP 2018: 4869-4873
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Hu0Q18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Hu0Q18
Hu Hu, Tian Tan, Yanmin Qian:
Generative Adversarial Networks Based Data Augmentation for Noise Robust Speech Recognition. ICASSP 2018: 5044-5048
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangQ018
Shuai Wang, Yanmin Qian, Kai Yu:
Focal Kl-Divergence Based Dilated Convolutional Neural Networks for Co-Channel Speaker Identification. ICASSP 2018: 5339-5343
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Qian0HL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Qian0HL18
Yanmin Qian, Tian Tan, Hu Hu, Qi Liu:
Noise Robust Speech Recognition on Aurora4 by Humans and Machines. ICASSP 2018: 5604-5608
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Ding0Q18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Ding0Q18
Wen Ding, Tian Tan, Yanmin Qian:
Fast Adaptation on Deepmixture Generative Network Based Acoustic Modeling. ICASSP 2018: 5944-5948
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangQY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangQY18
Xuankai Chang, Yanmin Qian, Dong Yu:
Adaptive Permutation Invariant Training with Auxiliary Information for Monaural Multi-Talker Speech Recognition. ICASSP 2018: 5974-5978
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenYQSY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenYQSY18
Lianwu Chen, Meng Yu, Yanmin Qian, Dan Su, Dong Yu:
Permutation Invariant Training of Generative Adversarial Network for Monaural Speech Separation. INTERSPEECH 2018: 302-306
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCSCYQY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCSCYQY18
Jun Wang, Jie Chen, Dan Su, Lianwu Chen, Meng Yu, Yanmin Qian, Dong Yu:
Deep Extractor Network for Target Speaker Recovery from Single Channel Speech Mixtures. INTERSPEECH 2018: 307-311
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangQ018
Xuankai Chang, Yanmin Qian, Dong Yu:
Monaural Multi-Talker Speech Recognition with Attention Mechanism and Gated Convolutional Networks. INTERSPEECH 2018: 1586-1590
[c54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangYCQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangYCQ018
Mingkun Huang, Yongbin You, Zhehuai Chen, Yanmin Qian, Kai Yu:
Knowledge Distillation for Sequence Model. INTERSPEECH 2018: 3703-3707
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/iscide/WangDQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscide/WangDQ018
Shuai Wang, Heinrich Dinkel, Yanmin Qian, Kai Yu:
Covariance Based Deep Feature for Text-Dependent Speaker Verification. IScIDE 2018: 231-242
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ShengYH0Q18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ShengYH0Q18
Peiyao Sheng, Zhuolin Yang, Hu Hu, Tian Tan, Yanmin Qian:
Data Augmentation using Conditional Generative Adversarial Networks for Robust Speech Recognition. ISCSLP 2018: 121-125
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WangHQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WangHQ018
Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. ISCSLP 2018: 195-199
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/YangWSQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/YangWSQ018
Yexin Yang, Shuai Wang, Man Sun, Yanmin Qian, Kai Yu:
Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification. ISCSLP 2018: 205-209
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-01344
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-01344
Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. CoRR abs/1805.01344 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1807-08974
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-08974
Jun Wang, Jie Chen, Dan Su, Lianwu Chen, Meng Yu, Yanmin Qian, Dong Yu:
Deep Extractor Network for Target Speaker Recovery From Single Channel Speech Mixtures. CoRR abs/1807.08974 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-00639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-00639
Zhehuai Chen, Yanmin Qian, Kai Yu:
Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting. CoRR abs/1808.00639 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02062
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02062
Xuankai Chang, Yanmin Qian, Kai Yu, Shinji Watanabe:
End-to-End Monaural Multi-speaker ASR System without Pretraining. CoRR abs/1811.02062 (2018)
2017
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenZQY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenZQY17
Zhehuai Chen, Yimeng Zhuang, Yanmin Qian, Kai Yu:
Phone Synchronous Speech Recognition With CTC Lattices. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 86-97 (2017)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianCDW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianCDW17
Yanmin Qian, Nanxin Chen, Heinrich Dinkel, Zhizheng Wu:
Deep Feature Engineering for Noise Robust Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1942-1955 (2017)
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/JiangWXQ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/JiangWXQ17
Xiaowei Jiang, Shuai Wang, Xu Xiang, Yanmin Qian:
Integrating online i-vector into GMM-UBM for text-dependent speaker verification. APSIPA 2017: 1628-1632
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiuQ017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiuQ017
Qi Liu, Yanmin Qian, Kai Yu:
Future vector enhanced LSTM language model for LVCSR. ASRU 2017: 104-110
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/cncl/WuHCQY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cncl/WuHCQY17
Yue Wu, Tianxing He, Zhehuai Chen, Yanmin Qian, Kai Yu:
Multi-view LSTM Language Model with Word-Synchronized Auxiliary Feature for LVCSR. CCL 2017: 398-410
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DinkelCQY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DinkelCQY17
Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Kai Yu:
End-to-end spoofing detection with raw waveform CLDNNS. ICASSP 2017: 4860-4864
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/DinkelQY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/DinkelQY17
Heinrich Dinkel, Yanmin Qian, Kai Yu:
Small-footprint convolutional neural network for spoofing detection. IJCNN 2017: 3086-3091
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiangQ017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiangQ017
Xu Xiang, Yanmin Qian, Kai Yu:
Binary Deep Neural Networks for Speech Recognition. INTERSPEECH 2017: 533-537
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangQ017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangQ017
Shuai Wang, Yanmin Qian, Kai Yu:
What Does the Speaker Embedding Encode? INTERSPEECH 2017: 1497-1501
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuCQ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuCQ17
Dong Yu, Xuankai Chang, Yanmin Qian:
Recognizing Multi-Talker Speech with Permutation Invariant Training. INTERSPEECH 2017: 2456-2460
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/iscide/ChenQY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscide/ChenQY17
Zhehuai Chen, Yanmin Qian, Kai Yu:
A Unified Confidence Measure Framework Using Auxiliary Normalization Graph. IScIDE 2017: 123-133
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/SimQMSKT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/SimQMSKT17
Khe Chai Sim, Yanmin Qian, Gautam Mantena, Lahiru Samarakoon, Souvik Kundu, Tian Tan:
Adaptation of Deep Neural Network Acoustic Models for Robust Automatic Speech Recognition. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 219-243
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YuCQ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YuCQ17
Dong Yu, Xuankai Chang, Yanmin Qian:
Recognizing Multi-talker Speech with Permutation Invariant Training. CoRR abs/1704.01985 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/QianCY17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/QianCY17
Yanmin Qian, Xuankai Chang, Dong Yu:
Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training. CoRR abs/1707.06527 (2017)
2016
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/QianCY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/QianCY16
Yanmin Qian, Nanxin Chen, Kai Yu:
Deep features for automatic spoofing detection. Speech Commun. 85: 43-52 (2016)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TanQY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TanQY16
Tian Tan, Yanmin Qian, Kai Yu:
Cluster Adaptive Training for Deep Neural Network Based Acoustic Model. IEEE ACM Trans. Audio Speech Lang. Process. 24(3): 459-468 (2016)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianTY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianTY16
Yanmin Qian, Tian Tan, Dong Yu:
Neural Network Based Multi-Factor Aware Joint Training for Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2231-2240 (2016)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianBTY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianBTY16
Yanmin Qian, Mengxiao Bi, Tian Tan, Kai Yu:
Very Deep Convolutional Neural Networks for Noise Robust Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2263-2276 (2016)
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/btas/KorshunovMMGMVS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/btas/KorshunovMMGMVS16
Pavel Korshunov, Sébastien Marcel, Hannah Muckenhirn, André R. Gonçalves, A. G. Souza Mello, Ricardo Paranhos Velloso Violato, Flávio Olmos Simões, Mário Uliani Neto, Marcus de Assis Angeloni, José Augusto Stuchi, Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Dipjyoti Paul, Goutam Saha, Md. Sahidullah:
Overview of BTAS 2016 speaker anti-spoofing competition. BTAS 2016: 1-6
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KunduMQTDS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KunduMQTDS16
Souvik Kundu, Gautam Mantena, Yanmin Qian, Tian Tan, Marc Delcroix, Khe Chai Sim:
Joint acoustic factor learning for robust deep neural network based automatic speech recognition. ICASSP 2016: 5025-5029
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanQYKLSXZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanQYKLSXZ16
Tian Tan, Yanmin Qian, Dong Yu, Souvik Kundu, Liang Lu, Khe Chai Sim, Xiong Xiao, Yu Zhang:
Speaker-aware training of LSTM-RNNS for acoustic modelling. ICASSP 2016: 5280-5284
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangZWGKLLQ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangZWGKLLQ16
Linlin Wang, Chao Zhang, Philip C. Woodland, Mark J. F. Gales, Panagiota Karanasou, Pierre Lanchantin, Xunying Liu, Yanmin Qian:
Improved DNN-based segmentation for multi-genre broadcast audio. ICASSP 2016: 5700-5704
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianTY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianTY16
Yanmin Qian, Tian Tan, Dong Yu:
An investigation into using parallel data for far-field speech recognition. ICASSP 2016: 5725-5729
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianTYZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianTYZ16
Yanmin Qian, Tian Tan, Dong Yu, Yu Zhang:
Integrated adaptation with multi-factor joint-learning for far-field speech recognition. ICASSP 2016: 5770-5774
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuangCQY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuangCQY16
Yimeng Zhuang, Xuankai Chang, Yanmin Qian, Kai Yu:
Unrestricted Vocabulary Keyword Spotting Using LSTM-CTC. INTERSPEECH 2016: 938-942
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZhuangTYQY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZhuangTYQY16
Yimeng Zhuang, Sibo Tong, Maofan Yin, Yanmin Qian, Kai Yu:
Multi-task joint-learning for robust voice activity detection. ISCSLP 2016: 1-5
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/QianW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/QianW16
Yanmin Qian, Philip C. Woodland:
Very deep convolutional neural networks for robust speech recognition. SLT 2016: 481-488
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/QianW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/QianW16
Yanmin Qian, Philip C. Woodland:
Very Deep Convolutional Neural Networks for Robust Speech Recognition. CoRR abs/1610.00277 (2016)
2015
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LiuQCFZY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiuQCFZY15
Yuan Liu, Yanmin Qian, Nanxin Chen, Tianfan Fu, Ya Zhang, Kai Yu:
Deep feature for text-dependent speaker verification. Speech Commun. 73: 1-13 (2015)
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/QianYYY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/QianYYY15
Yanmin Qian, Maofan Yin, Yongbin You, Kai Yu:
Multi-task joint-learning of deep neural networks for robust speech recognition. ASRU 2015: 310-316
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WoodlandLQZGKLW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WoodlandLQZGKLW15
Philip C. Woodland, Xunying Liu, Yanmin Qian, Chao Zhang, Mark J. F. Gales, Penny Karanasou, Pierre Lanchantin, Linlin Wang:
Cambridge university transcription systems for the multi-genre broadcast challenge. ASRU 2015: 639-646
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LanchantinGKLQW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LanchantinGKLQW15
Pierre Lanchantin, Mark J. F. Gales, Penny Karanasou, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
The development of the cambridge university alignment systems for the multi-genre broadcast challenge. ASRU 2015: 647-653
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KaranasouGLLQWW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KaranasouGLLQWW15
Penny Karanasou, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanmin Qian, Linlin Wang, Philip C. Woodland, Chao Zhang:
Speaker diarisation and longitudinal linking in multi-genre broadcast data. ASRU 2015: 660-666
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/YouQY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/YouQY15
Yongbin You, Yanmin Qian, Kai Yu:
Local trajectory based speech enhancement for robust speech recognition with deep neural network. ChinaSIP 2015: 5-9
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/YouQHY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/YouQHY15
Yongbin You, Yanmin Qian, Tianxing He, Kai Yu:
An investigation on DNN-derived bottleneck features for GMM-HMM based robust speech recognition. ChinaSIP 2015: 30-34
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TanQYZY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TanQYZY15
Tian Tan, Yanmin Qian, Maofan Yin, Yimeng Zhuang, Kai Yu:
Cluster adaptive training for deep neural network. ICASSP 2015: 4325-4329
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BuZQY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BuZQY15
Suliang Bu, Yunxin Zhao, Yanmin Qian, Kai Yu:
A novel static parameter calculation method for model compensation. ICASSP 2015: 4510-4514
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeXQY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeXQY15
Tianxing He, Xu Xiang, Yanmin Qian, Kai Yu:
Recurrent neural network language model with structured word embeddings for speech recognition. ICASSP 2015: 5396-5400
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/QianHDY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/QianHDY15
Yanmin Qian, Tianxing He, Wei Deng, Kai Yu:
Automatic model redundancy reduction for fast back-propagation for deep neural networks in speech recognition. IJCNN 2015: 1-6
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenQY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenQY15
Nanxin Chen, Yanmin Qian, Kai Yu:
Multi-task learning for text-dependent speaker verification. INTERSPEECH 2015: 185-189
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenQDCY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenQDCY15
Nanxin Chen, Yanmin Qian, Heinrich Dinkel, Bo Chen, Kai Yu:
Robust deep feature for spoofing detection - the SJTU system for ASVspoof 2015 challenge. INTERSPEECH 2015: 2097-2101
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BiQY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BiQY15
Mengxiao Bi, Yanmin Qian, Kai Yu:
Very deep convolutional neural networks for LVCSR. INTERSPEECH 2015: 3259-3263
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JinHQY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JinHQY15
Wengong Jin, Tianxing He, Yanmin Qian, Kai Yu:
Paragraph vector based topic model for language model adaptation. INTERSPEECH 2015: 3516-3520
2014
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DengQFFY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DengQFFY14
Wei Deng, Yanmin Qian, Yuchen Fan, Tianfan Fu, Kai Yu:
Stochastic data sweeping for fast DNN training. ICASSP 2014: 240-244
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeFQTY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeFQTY14
Tianxing He, Yuchen Fan, Yanmin Qian, Tian Tan, Kai Yu:
Reshaping deep neural network for fast decoding by node-pruning. ICASSP 2014: 245-249
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BuQSYY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BuQSYY14
Suliang Bu, Yanmin Qian, Khe Chai Sim, Yongbin You, Kai Yu:
Second order vector taylor series based robust speech recognition. ICASSP 2014: 1769-1773
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/LiuFFQY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/LiuFFQY14
Yuan Liu, Tianfan Fu, Yuchen Fan, Yanmin Qian, Kai Yu:
Speaker verification with deep features. IJCNN 2014: 747-753
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuQLY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuQLY14
Tianfan Fu, Yanmin Qian, Yuan Liu, Kai Yu:
Tandem deep features for text-dependent speaker verification. INTERSPEECH 2014: 1327-1331
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BuQY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BuQY14
Suliang Bu, Yanmin Qian, Kai Yu:
A novel dynamic parameters calculation approach for model compensation. INTERSPEECH 2014: 2744-2748
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/NiuQY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/NiuQY14
Jianwei Niu, Yanmin Qian, Kai Yu:
Acoustic emotion recognition using deep neural network. ISCSLP 2014: 128-132
2013
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/QianYL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/QianYL13
Yanmin Qian, Kai Yu, Jia Liu:
Combination of data borrowing strategies for low-resource LVCSR. ASRU 2013: 404-409
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianL13
Yanmin Qian, Jia Liu:
MLP-HMM two-stage unsupervised training for low-resource languages on conversational telephone speech recognition. INTERSPEECH 2013: 1816-1820
2012
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PoveyHBBGJKKMQRVV12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PoveyHBBGJKKMQRVV12
Daniel Povey, Mirko Hannemann, Gilles Boulianne, Lukás Burget, Arnab Ghoshal, Milos Janda, Martin Karafiát, Stefan Kombrink, Petr Motlícek, Yanmin Qian, Korbinian Riedhammer, Karel Veselý, Ngoc Thang Vu:
Generating exact lattices in the WFST framework. ICASSP 2012: 4213-4216
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianL12
Yanmin Qian, Jia Liu:
Cross-Lingual and Ensemble MLPs Strategies for Low-Resource Speech Recognition. INTERSPEECH 2012: 2582-2585
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianL12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianL12a
Yanmin Qian, Jia Liu:
Articulatory Feature based Multilingual MLPs for Low-Resource Speech Recognition. INTERSPEECH 2012: 2602-2605
2011
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/DengZQL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/DengZQL11
Yan Deng, Weiqiang Zhang, Yanmin Qian, Jia Liu:
Language Recognition Based on Acoustic Diversified Phone Recognizers and Phonotactic Feature Fusion. IEICE Trans. Inf. Syst. 94-D(3): 679-689 (2011)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/jcp/DengZQL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jcp/DengZQL11
Yan Deng, Weiqiang Zhang, Yanmin Qian, Jia Liu:
Time-Frequency Cepstral Features and Combining Discriminative Training for Phonotactic Language Recognition. J. Comput. 6(2): 178-183 (2011)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/QianXPL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/QianXPL11
Yanmin Qian, Ji Xu, Daniel Povey, Jia Liu:
Strategies for using MLP based features with limited target-language training data. ASRU 2011: 354-358
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianPL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianPL11
Yanmin Qian, Daniel Povey, Jia Liu:
State-Level Data Borrowing for Low-Resource Speech Recognition Based on Subspace GMMs. INTERSPEECH 2011: 553-560
2010
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianL10
Yanmin Qian, Jia Liu:
Phone modeling and combining discriminative training for mandarinenglish bilingual speech recognition. ICASSP 2010: 4918-4921
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icica/DengZQL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icica/DengZQL10
Yan Deng, Weiqiang Zhang, Yanmin Qian, Jia Liu:
Integration of Complementary Phone Recognizers for Phonotactic Language Recognition. ICICA (LNCS) 2010: 237-244
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QianL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QianL10
Yanmin Qian, Jia Liu:
Mandarin-English bilingual phone modeling and combining MPE based Discriminative training for cross-language speech recognition. ISCSLP 2010: 103-108

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tce/QianLJ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tce/QianLJ09
Yanmin Qian, Jia Liu, Michael T. Johnson:
Efficient embedded speech recognition for very large vocabulary Mandarin car-navigation systems. IEEE Trans. Consumer Electron. 55(3): 1496-1500 (2009)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.