


default search action
Shuai Wang 0016
Person information
- affiliation: Chinese University of Hong Kong-Shenzhen (CUKH-SZ), Shenzhen Research Institute of Big Data, Shenzhen, China
- affiliation (PhD 2020): Shanghai Jiao Tong University, Department of Computer Science and Engineering, China
Other persons with the same name
- Shuai Wang — disambiguation page
- Shuai Wang 0001
— Simula Research Laboratory, Oslo, Norway (and 1 more)
- Shuai Wang 0002
— Chinese Academy of Sciences, Academy of Mathematics and Systems Science, NCMIS, Beijing, China
- Shuai Wang 0003
— Hangzhou Dianzi University, School of Cyberspace, Lishui Institute, China (and 3 more)
- Shuai Wang 0004
— Chinese Academy of Sciences, Shenzhen Institute of Advanced Technology, China (and 2 more)
- Shuai Wang 0005
— Chinese Academy of Sciences, Institute of Automation, State Key Laboratory of Management and Control for Complex Systems, Beijing, China
- Shuai Wang 0006 — Nanjing University, Department of Computer Science and Technology, China (and 1 more)
- Shuai Wang 0007
— Tencent Robotics X, Shenzhen, China (and 1 more)
- Shuai Wang 0008
— Southeast University, School of Computer Science and Engineering, Nanjing, China (and 1 more)
- Shuai Wang 0009
— Sun Yat-sen University, Shenzhen, China (and 1 more)
- Shuai Wang 0010
— Ryerson University, Department of Electrical and Computer Engineering, Toronto, ON, Canada (and 1 more)
- Shuai Wang 0011
— Hong Kong University of Science and Technology, Hong Kong (and 2 more)
- Shuai Wang 0012 — Hong Kong Polytechnic University, Department of Computing, Hong Kong
- Shuai Wang 0013
— Beijing Institute of Technology, School of Information and Electronics, China
- Shuai Wang 0014
— Vrije Universiteit Amsterdam, The Netherlands (and 2 more)
- Shuai Wang 0015
— Wuhan University, School of Resource and Environmental Science, China
- Shuai Wang 0017
— Tianjin University, Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology, China
- Shuai Wang 0018
— University of Science and Technology of China, Department of Automation, Hefei, China
- Shuai Wang 0019
— Xidian University, State Key Laboratory of Integrated Service Networks, Xian, China
- Shuai Wang 0020 — University of Illinois at Chicago, Department of Computer Science, USA
- Shuai Wang 0021 — George Mason University, Department of Computer Science, Fairfax, VA, USA (and 1 more)
- Shuai Wang 0022
— SRI International, Center for Technology in Learning, Menlo Park, CA, USA (and 1 more)
- Shuai Wang 0023
— Changchun University of Science and Technology, School of Computer Science and Technology, China (and 1 more)
- Shuai Wang 0024
— Shenyang Agricultural University, College of Land and Environment, China (and 2 more)
- Shuai Wang 0025
— Changchun University of Science and Technology, School of Science, China
- Shuai Wang 0026
— Chinese Academy of Sciences, Aerospace Information Research Institute, Beijing, China (and 1 more)
- Shuai Wang 0027
— Beihang University, School of Computer Science and Engineering, Beijing, China (and 1 more)
- Shuai Wang 0028
— Tsinghua University, Department of Computer Science and Technology, Beijing, China
- Shuai Wang 0029 — Boston University, Division of Systems Engineering, Boston, MA, USA
- Shuai Wang 0030
— JOYY Inc, Beijing, China (and 2 more)
- Shuai Wang 0031
— China Three Gorges University, Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering, Yichang, China (and 1 more)
- Shuai Wang 0032
— University of Queensland, QLD, Australia
- Shuai Wang 0033
— Singapore University of Technology and Design, Information Systems Technology and Design Pillar, Tampines, Singapore (and 2 more)
- Shuai Wang 0034
— Yuncheng University, Shanxi Province Optoelectronic Information Science and Technology Laboratory, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j12]Shuai Wang
, Zhengyang Chen, Bing Han, Hongji Wang, Chengdong Liang, Binbin Zhang, Xu Xiang, Wen Ding, Johan Rohdin, Anna Silnova, Yanmin Qian, Haizhou Li:
Advancing speaker embedding learning: Wespeaker toolkit for research and production. Speech Commun. 162: 103104 (2024) - [j11]Zhengyang Chen
, Bing Han
, Shuai Wang
, Yanmin Qian
:
Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1636-1649 (2024) - [j10]Wupeng Wang
, Zexu Pan
, Xinke Li, Shuai Wang
, Haizhou Li
:
Speech Separation With Pretrained Frontend to Minimize Domain Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4184-4198 (2024) - [j9]Shuai Wang
, Zhengyang Chen, Kong Aik Lee
, Yanmin Qian
, Haizhou Li
:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4971-4998 (2024) - [c45]Chenpeng Du
, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu:
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding. AAAI 2024: 17924-17932 - [c44]Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data. ICASSP 2024: 1136-1140 - [c43]Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. ICASSP 2024: 10601-10605 - [c42]Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech. ICASSP 2024: 10666-10670 - [c41]Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition. ICASSP 2024: 10901-10905 - [c40]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi:
Dualvc 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion. ICASSP 2024: 11106-11110 - [c39]Wen Huang
, Bing Han, Shuai Wang, Zhengyang Chen, Yanmin Qian:
Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters. ICASSP 2024: 11781-11785 - [i45]Chenpeng Du, Yiwei Guo, Hankun Wang, Yifan Yang, Zhikang Niu, Shuai Wang, Hui Zhang, Xie Chen, Kai Yu:
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech. CoRR abs/2401.14321 (2024) - [i44]Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Fine-Grained Quantitative Emotion Editing for Speech Generation. CoRR abs/2403.02002 (2024) - [i43]Yiwei Guo, Chenrun Wang, Yifan Yang, Hankun Wang, Ziyang Ma, Chenpeng Du, Shuai Wang, Hanzheng Li, Shuai Fan, Hui Zhang, Xie Chen, Kai Yu:
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge. CoRR abs/2404.06079 (2024) - [i42]Hankun Wang, Chenpeng Du, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu:
Attention-Constrained Inference for Robust Decoder-Only Text-to-Speech. CoRR abs/2404.19723 (2024) - [i41]Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. CoRR abs/2405.09171 (2024) - [i40]Zhijun Liu, Shuai Wang, Sho Inoue, Qibing Bai, Haizhou Li:
Autoregressive Diffusion Transformer for Text-to-Speech Synthesis. CoRR abs/2406.05551 (2024) - [i39]Bohan Li, Feiyu Shen, Yiwei Guo, Shuai Wang, Xie Chen, Kai Yu:
On the Effectiveness of Acoustic BPE in Decoder-Only TTS. CoRR abs/2407.03892 (2024) - [i38]Shuai Wang, Zhengyang Chen, Kong Aik Lee, Yanmin Qian, Haizhou Li:
Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. CoRR abs/2407.15188 (2024) - [i37]Ziqian Ning, Shuai Wang, Yuepeng Jiang, Jixun Yao, Lei He, Shifeng Pan, Jie Ding, Lei Xie:
Drop the beat! Freestyler for Accompaniment Conditioned Rapping Voice Generation. CoRR abs/2408.15474 (2024) - [i36]Yiyang Zhao, Shuai Wang, Guangzhi Sun, Zehua Chen, Chao Zhang, Mingxing Xu, Thomas Fang Zheng:
Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models. CoRR abs/2408.15585 (2024) - [i35]Yiwei Guo, Zhihan Li, Junjie Li, Chenpeng Du, Hankun Wang, Shuai Wang, Xie Chen, Kai Yu:
vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders. CoRR abs/2409.01995 (2024) - [i34]Zhengyang Chen, Bing Han, Shuai Wang, Yidi Jiang, Yanmin Qian:
Flow-TSVAD: Target-Speaker Voice Activity Detection via Latent Flow Matching. CoRR abs/2409.04859 (2024) - [i33]Zhengyang Chen, Shuai Wang, Mingyang Zhang, Xuechen Liu, Junichi Yamagishi, Yanmin Qian:
Disentangling the Prosody and Semantic Information with Pre-trained Model for In-Context Learning based Zero-Shot Voice Conversion. CoRR abs/2409.05004 (2024) - [i32]Zhijun Liu, Shuai Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
E1 TTS: Simple and Fast Non-Autoregressive TTS. CoRR abs/2409.09351 (2024) - [i31]Sho Inoue, Shuai Wang, Wanxing Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li:
MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion. CoRR abs/2409.09352 (2024) - [i30]Junjie Li, Ke Zhang, Shuai Wang, Haizhou Li, Man-Wai Mak, Kong Aik Lee:
On the effectiveness of enrollment speech augmentation for Target Speaker Extraction. CoRR abs/2409.09589 (2024) - [i29]Shuai Wang, Pengcheng Zhu, Haizhou Li:
M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions. CoRR abs/2409.15782 (2024) - [i28]Shuai Wang, Ke Zhang, Shaoxiong Lin, Junjie Li, Xuefei Wang, Meng Ge, Jianwei Yu, Yanmin Qian, Haizhou Li:
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction. CoRR abs/2409.15799 (2024) - [i27]Ke Zhang, Junjie Li, Shuai Wang, Yangjie Wei, Yi Wang, Yannan Wang, Haizhou Li:
Multi-Level Speaker Representation for Target Speaker Extraction. CoRR abs/2410.16059 (2024) - [i26]Wen Huang, Bing Han, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification. CoRR abs/2410.17033 (2024) - [i25]Kangxiang Xia, Dake Guo, Jixun Yao, Liumeng Xue, Hanzhao Li, Shuai Wang, Zhao Guo, Lei Xie, Qingqing Zhang, Lei Luo, Minghui Dong, Peng Sun:
The ISCSLP 2024 Conversational Voice Clone (CoVoC) Challenge: Tasks, Results and Findings. CoRR abs/2411.00064 (2024) - [i24]Wupeng Wang, Zexu Pan, Xinke Li, Shuai Wang, Haizhou Li:
Speech Separation with Pretrained Frontend to Minimize Domain Mismatch. CoRR abs/2411.03085 (2024) - [i23]Junjie Li, Ke Zhang, Shuai Wang, Kong Aik Lee, Haizhou Li:
MoMuSE: Momentum Multi-modal Target Speaker Extraction for Real-time Scenarios with Impaired Visual Cues. CoRR abs/2412.08247 (2024) - [i22]Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Control of Emotion Rendering in Speech Synthesis. CoRR abs/2412.12498 (2024) - [i21]Chenyu Yang, Shuai Wang, Hangting Chen, Jianwei Yu, Wei Tan, Rongzhi Gu, Yaoxun Xu, Yizhi Zhou, Haina Zhu, Haizhou Li:
SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor. CoRR abs/2412.13786 (2024) - 2023
- [c38]Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production Oriented Speaker Embedding Learning Toolkit. ICASSP 2023: 1-5 - [c37]Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng:
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation-based Voice Conversion. ICME 2023: 1691-1696 - [c36]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. INTERSPEECH 2023: 2063-2067 - [c35]Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. INTERSPEECH 2023: 3552-3556 - [i20]Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng:
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion. CoRR abs/2305.09167 (2023) - [i19]Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor. CoRR abs/2305.10704 (2023) - [i18]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi:
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding. CoRR abs/2305.12425 (2023) - [i17]Chenpeng Du, Yiwei Guo, Feiyu Shen, Zhijun Liu, Zheng Liang, Xie Chen, Shuai Wang, Hui Zhang, Kai Yu:
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding. CoRR abs/2306.07547 (2023) - [i16]Shuai Wang, Chengdong Liang, Xu Xiang, Bing Han, Zhengyang Chen, Hongji Wang, Wen Ding:
Wespeaker baselines for VoxSRC2023. CoRR abs/2306.15161 (2023) - [i15]Zhengyang Chen, Bing Han, Shuai Wang, Yanmin Qian:
Attention-based Encoder-Decoder End-to-End Neural Diarization with Embedding Enhancer. CoRR abs/2309.06672 (2023) - [i14]Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li:
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech. CoRR abs/2309.08408 (2023) - [i13]Junyi Ao, Mehmet Sinan Yildirim, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li:
USED: Universal Speaker Extraction and Diarization. CoRR abs/2309.10674 (2023) - [i12]Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li:
Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition. CoRR abs/2309.11730 (2023) - [i11]Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data. CoRR abs/2309.13905 (2023) - [i10]Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi:
DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion. CoRR abs/2309.15496 (2023) - [i9]Meng Ge, Yizhou Peng, Yidi Jiang, Jingru Lin, Junyi Ao, Mehmet Sinan Yildirim, Shuai Wang, Haizhou Li, Mengling Feng:
The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge. CoRR abs/2312.16002 (2023) - 2022
- [c34]Aiwen Deng, Shuai Wang, Wenxiong Kang, Feiqi Deng:
On the Importance of Different Frequency Bins for Speaker Verification. ICASSP 2022: 7537-7541 - [c33]Bei Liu, Haoyu Wang, Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Knowledge Distillation via Feature Enhancement for Speaker Verification. ICASSP 2022: 7542-7546 - [c32]Bei Liu, Zhengyang Chen, Shuai Wang, Haoyu Wang, Bing Han, Yanmin Qian:
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design. INTERSPEECH 2022: 296-300 - [c31]Jinchao Li, Shuai Wang, Yang Chao, Xunying Liu, Helen Meng:
Context-aware Multimodal Fusion for Emotion Recognition. INTERSPEECH 2022: 2013-2017 - [i8]Hongji Wang, Chengdong Liang, Shuai Wang, Zhengyang Chen, Binbin Zhang, Xu Xiang
, Yanlei Deng, Yanmin Qian:
Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit. CoRR abs/2210.17016 (2022) - 2021
- [j8]Yanmin Qian
, Zhengyang Chen
, Shuai Wang
:
Audio-Visual Deep Neural Network for Robust Person Verification. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1079-1092 (2021) - [j7]Heinrich Dinkel
, Shuai Wang
, Xuenan Xu, Mengyue Wu
, Kai Yu
:
Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1542-1555 (2021) - [c30]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. ICASSP 2021: 5834-5838 - [c29]Chenpeng Du
, Bing Han, Shuai Wang, Yanmin Qian, Kai Yu:
SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification. ICASSP 2021: 5844-5848 - [c28]Houjun Huang, Xu Xiang
, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit Selection Synthesis Based Data Augmentation for Fixed Phrase Speaker Verification. ICASSP 2021: 5849-5853 - [c27]Yufei Liu, Chengzhu Yu, Shuai Wang, Zhenchuan Yang, Yang Chao, Weibin Zhang:
Non-Parallel Any-to-Many Voice Conversion by Replacing Speaker Statistics. Interspeech 2021: 1369-1373 - [c26]Xun Gong, Zhengyang Chen, Yexin Yang, Shuai Wang, Lan Wang, Yanmin Qian:
Speaker Embedding Augmentation with Noise Distribution Matching. ISCSLP 2021: 1-5 - [c25]Shuai Wang, Yexin Yang, Yanmin Qian, Kai Yu:
Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning. ISCSLP 2021: 1-5 - [i7]Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian:
Unit selection synthesis based data augmentation for fixed phrase speaker verification. CoRR abs/2102.09817 (2021) - [i6]Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice activity detection in the wild: A data-driven approach using teacher-student training. CoRR abs/2105.04065 (2021) - [i5]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Self-Supervised Learning Based Domain Adaptation for Robust Speaker Verification. CoRR abs/2108.13843 (2021) - 2020
- [j6]Shuai Wang
, Yexin Yang
, Zhanghao Wu
, Yanmin Qian
, Kai Yu
:
Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2598-2609 (2020) - [c24]Yexin Yang, Shuai Wang, Xun Gong, Yanmin Qian, Kai Yu:
Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings. ICASSP 2020: 6454-6458 - [c23]Mireia Díez, Lukás Burget
, Federico Landini, Shuai Wang, Honza Cernocký:
Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge. ICASSP 2020: 6519-6523 - [c22]Federico Landini
, Shuai Wang, Mireia Díez, Lukás Burget
, Pavel Matejka, Katerina Zmolíková
, Ladislav Mosner, Anna Silnova, Oldrich Plchot, Ondrej Novotný, Hossein Zeinali, Johan Rohdin
:
But System for the Second Dihard Speech Diarization Challenge. ICASSP 2020: 6529-6533 - [c21]Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
Channel Invariant Speaker Embedding Learning with Joint Multi-Task and Adversarial Training. ICASSP 2020: 6574-6578 - [c20]Shuai Wang, Johan Rohdin
, Oldrich Plchot, Lukás Burget
, Kai Yu, Jan Cernocký
:
Investigation of Specaugment for Deep Speaker Embedding Learning. ICASSP 2020: 7139-7143 - [c19]Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection. INTERSPEECH 2020: 1086-1090 - [c18]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Multi-Modality Matters: A Performance Leap on VoxCeleb. INTERSPEECH 2020: 2252-2256 - [c17]Zhengyang Chen, Shuai Wang, Yanmin Qian:
Adversarial Domain Adaptation for Speaker Verification Using Partially Shared Network. INTERSPEECH 2020: 3017-3021 - [c16]Jahangir Alam, Gilles Boulianne, Lukás Burget, Mohamed Dahmane, Mireia Díez Sánchez, Alicia Lozano-Diez, Ondrej Glembek, Pierre-Luc St-Charles, Marc Lalonde, Pavel Matejka, Petr Mizera, João Monteiro, Ladislav Mosner, Cedric Noiseux, Ondrej Novotný, Oldrich Plchot, Johan Rohdin, Anna Silnova, Josef Slavícek, Themos Stafylakis, Shuai Wang, Hossein Zeinali:
Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. Odyssey 2020: 289-295 - [i4]Yefei Chen, Shuai Wang, Yanmin Qian, Kai Yu:
End-to-End Speaker-Dependent Voice Activity Detection. CoRR abs/2009.09906 (2020)
2010 – 2019
- 2019
- [j5]Yanmin Qian
, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 20(3): 438 (2019) - [j4]Shuai Wang
, Zili Huang, Yanmin Qian
, Kai Yu
:
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1686-1696 (2019) - [c15]Xu Xiang
, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. APSIPA 2019: 1652-1656 - [c14]Shuai Wang, Yexin Yang, Tianzhe Wang, Yanmin Qian, Kai Yu:
Knowledge Distillation for Small Foot-print Deep Speaker Embedding. ICASSP 2019: 6021-6025 - [c13]Mireia Díez, Lukás Burget
, Shuai Wang, Johan Rohdin, Jan Cernocký:
Bayesian HMM Based x-Vector Clustering for Speaker Diarization. INTERSPEECH 2019: 346-350 - [c12]Yexin Yang, Hongji Wang, Heinrich Dinkel, Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
The SJTU Robust Anti-Spoofing System for the ASVspoof 2019 Challenge. INTERSPEECH 2019: 1038-1042 - [c11]Shuai Wang, Johan Rohdin, Lukás Burget
, Oldrich Plchot, Yanmin Qian, Kai Yu, Jan Cernocký
:
On the Usage of Phonetic Information for Text-Independent Speaker Embedding Extraction. INTERSPEECH 2019: 1148-1152 - [c10]Zhanghao Wu, Shuai Wang, Yanmin Qian, Kai Yu:
Data Augmentation Using Variational Autoencoder for Embedding Based Speaker Verification. INTERSPEECH 2019: 1163-1167 - [c9]Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Cross-Domain Replay Spoofing Attack Detection Using Domain Adversarial Training. INTERSPEECH 2019: 2938-2942 - [i3]Xu Xiang, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. CoRR abs/1906.07317 (2019) - [i2]Hossein Zeinali, Shuai Wang, Anna Silnova, Pavel Matejka, Oldrich Plchot:
BUT System Description to VoxCeleb Speaker Recognition Challenge 2019. CoRR abs/1910.12592 (2019) - 2018
- [j3]Yanmin Qian
, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 19(1): 40-63 (2018) - [j2]Yanmin Qian
, Chao Weng, Xuankai Chang, Shuai Wang, Dong Yu:
Erratum to: Past review, current progress, and challenges ahead on the cocktail party problem. Frontiers Inf. Technol. Electron. Eng. 19(4): 582 (2018) - [c8]Zili Huang, Shuai Wang, Yanmin Qian:
Joint I-Vector with End-to-End System for Short Duration Text-Independent Speaker Verification. ICASSP 2018: 4869-4873 - [c7]Shuai Wang, Yanmin Qian, Kai Yu:
Focal Kl-Divergence Based Dilated Convolutional Neural Networks for Co-Channel Speaker Identification. ICASSP 2018: 5339-5343 - [c6]Zili Huang, Shuai Wang, Kai Yu:
Angular Softmax for Short-Duration Text-independent Speaker Verification. INTERSPEECH 2018: 3623-3627 - [c5]Shuai Wang, Heinrich Dinkel, Yanmin Qian, Kai Yu:
Covariance Based Deep Feature for Text-Dependent Speaker Verification. IScIDE 2018: 231-242 - [c4]Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. ISCSLP 2018: 195-199 - [c3]Yexin Yang, Shuai Wang, Man Sun, Yanmin Qian, Kai Yu:
Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification. ISCSLP 2018: 205-209 - [i1]Shuai Wang, Zili Huang, Yanmin Qian, Kai Yu:
Deep Discriminant Analysis for i-vector Based Robust Speaker Recognition. CoRR abs/1805.01344 (2018) - 2017
- [c2]Xiaowei Jiang, Shuai Wang, Xu Xiang
, Yanmin Qian:
Integrating online i-vector into GMM-UBM for text-dependent speaker verification. APSIPA 2017: 1628-1632 - [c1]Shuai Wang, Yanmin Qian, Kai Yu:
What Does the Speaker Embedding Encode? INTERSPEECH 2017: 1497-1501 - 2012
- [j1]Yizhong Zhang, Huamin Wang
, Shuai Wang, Yiying Tong, Kun Zhou:
A Deformable Surface Model for Real-Time Water Drop Animation. IEEE Trans. Vis. Comput. Graph. 18(8): 1281-1289 (2012)