- Zhiyuan Peng, Xuanji He, Ke Ding, Tan Lee
, Guanglu Wan:
Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition. ISCSLP 2022: 324-328 - Chunyu Qiang, Peng Yang, Hao Che, Xiaorui Wang, Zhongyuan Wang:
Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis. ISCSLP 2022: 61-65 - Ying Qin, Tan Lee
, Anthony Pak-Hin Kong, Feng Lin:
Aphasia Detection for Cantonese-Speaking and Mandarin-Speaking Patients Using Pre-Trained Language Models. ISCSLP 2022: 359-363 - Bowen Qu, Chenda Li, Jinfeng Bai, Yanmin Qian:
Improving Speech Separation with Knowledge Distilled from Self-supervised Pre-trained Models. ISCSLP 2022: 329-333 - Binbin Shen, Jian Luan, Shengyan Zhang, Quanbo Shen, Yujun Wang:
J-TranPSP: A Joint Transition-based Model for Prosodic Structure Prediction, Word Segmentation and PoS Tagging. ISCSLP 2022: 280-284 - Peiyang Shi, Zengqiang Shang, Pengyuan Zhang:
A Mandarin Prosodic Boundary Prediction Model Based on Multi-Source Semi-Supervision. ISCSLP 2022: 285-289 - Kun Song, Jian Cong, Xinsheng Wang, Yongmao Zhang, Lei Xie, Ning Jiang, Haiying Wu:
Robust MelGAN: A robust universal neural vocoder for high-fidelity TTS. ISCSLP 2022: 71-75 - Kun Song, Heyang Xue, Xinsheng Wang, Jian Cong, Yongmao Zhang, Lei Xie, Bing Yang, Xiong Zhang, Dan Su:
AdaVITS: Tiny VITS for Low Computing Resource Speaker Adaptation. ISCSLP 2022: 319-323 - Yujia Sun, Bing Ge, Bo Chen, Zhen Fu, Jinxin He, Hongwei Gao, Xue Wang:
The FawAI ASR System for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge. ISCSLP 2022: 512-516 - Daxin Tan, Liqun Deng, Nianzu Zheng, Yu Ting Yeung, Xin Jiang, Xiao Chen, Tan Lee
:
CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction. ISCSLP 2022: 81-85 - Jian Tang, Shaofei Xue:
Multi-Resolution Stacked 1D-CNN for Small-Footprint keyword Spotting with Two-Stage Detection. ISCSLP 2022: 310-314 - Dehua Tao, Harold Chui, Sarah Luk, Tan Lee
:
CUEMPATHY: A Counseling Speech Dataset for Psychotherapy Research. ISCSLP 2022: 354-358 - Wen-Yuan Ting, Syu-Siang Wang
, Hsin-Li Chang, Borching Su, Yu Tsao:
Speech Enhancement Based on CycleGAN with Noise-informed Training. ISCSLP 2022: 155-159 - Shu-Fen Tsai, Shih-Chan Kuo, Ren-Yuan Lyu, Jyh-Shing Roger Jang:
Ensemble And Re-Ranking Based On Language Models To Improve ASR. ISCSLP 2022: 185-189 - Kimiko Tsukada, Yurong Yurong, Badmaavanchin Munguntsetseg:
Bilingual Advantage? Perception of the Japanese Consonant Length Contrast by Monolingual vs Bilingual Speakers of Mongolian. ISCSLP 2022: 200-204 - Chenxi Wang, Hang Chen, Jun Du, Baocai Yin, Jia Pan:
Multi-Task Joint Learning for Embedding Aware Audio-Visual Speech Enhancement. ISCSLP 2022: 255-259 - Qing Wang
, Hang Chen, Ya Jiang, Zhe Wang, Yuyang Wang, Jun Du, Chin-Hui Lee:
Deep Learning Based Audio-Visual Multi-Speaker DOA Estimation Using Permutation-Free Loss Function. ISCSLP 2022: 250-254 - Qing Wang
, Jun Du, Siyuan Zheng, Yunqing Li, Yajian Wang, Yuzhong Wu, Hu Hu, Chao-Han Huck Yang, Sabato Marco Siniscalchi, Yannan Wang, Chin-Hui Lee:
A Study on Joint Modeling and Data Augmentation of Multi-Modalities for Audio-Visual Scene Classification. ISCSLP 2022: 453-457 - Yue Wang, Wen Liu:
Acoustic and Perceptual Study of Tones in Jin Chinese (Togtoh variety). ISCSLP 2022: 190-194 - Yao-Ting Wang, Yi-Xing Lin, Kai-Wen Liang, Tzu-Chiang Tai, Jia-Ching Wang:
Lightweight End-To-End Deep Learning Model For Music Source Separation. ISCSLP 2022: 315-318 - Yikang Wang, Xingming Wang, Hiromitsu Nishizaki, Ming Li:
Low Pass Filtering and Bandwidth Extension for Robust Anti-spoofing Countermeasure Against Codec Variabilities. ISCSLP 2022: 438-442 - Lei Wang, Benedict Yeoh, Jun Wah Ng:
Synthetic Voice Detection and Audio Splicing Detection using SE-Res2Net-Conformer Architecture. ISCSLP 2022: 115-119 - Wei Wang, Wangyou Zhang, Shaoxiong Lin, Yanmin Qian:
Text-Informed Knowledge Distillation for Robust Speech Enhancement and Recognition. ISCSLP 2022: 334-338 - Haoyu Wang, Wei-Qiang Zhang, Hongbin Suo, Yulong Wan:
Multilingual Zero Resource Speech Recognition Base on Self-Supervise Pre-Trained Acoustic Models. ISCSLP 2022: 11-15 - Qicong Xie, Tao Li, Xinsheng Wang, Zhichao Wang, Lei Xie, Guoqiao Yu, Guanglu Wan:
Multi-speaker Multi-style Text-to-speech Synthesis with Single-speaker Single-style Training Data Scenarios. ISCSLP 2022: 66-70 - Qicong Xie, Shan Yang, Yi Lei, Lei Xie, Dan Su:
End-to-End Voice Conversion with Information Perturbation. ISCSLP 2022: 91-95 - Min Xu, Jing Shao
, Hongwei Ding, Lan Wang:
Acoustic-perceptual correlates of whispered Mandarin consonants. ISCSLP 2022: 195-199 - Jinlong Xue, Yayue Deng, Yichen Han, Ya Li, Jianqing Sun, Jiaen Liang:
ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis. ISCSLP 2022: 230-234 - Shaofei Xue, Jian Tang, Yazhu Liu:
Improving Speech Recognition with Augmented Synthesized Data and Conditional Model Training. ISCSLP 2022: 443-447 - Yuhan Yan, Shanpeng Li
, Ying Chen:
In-group Advantage for Chinese and English Emotional Prosody in Quiet and Noise Conditions. ISCSLP 2022: 305-309