default search action
Yingming Gao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Yingming Gao, Peter Birkholz, Ya Li:
Articulatory Copy Synthesis Based on the Speech Synthesizer VocalTractLab and Convolutional Recurrent Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1845-1858 (2024) - [c25]Yayue Deng, Jinlong Xue, Yukang Jia, Qifei Li, Yichen Han, Fengping Wang, Yingming Gao, Dengfeng Ke, Ya Li:
Concss: Contrastive-based Context Comprehension for Dialogue-Appropriate Prosody in Conversational Speech Synthesis. ICASSP 2024: 10706-10710 - [c24]Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li:
Frame-Level Emotional State Alignment Method for Speech Emotion Recognition. ICASSP 2024: 11486-11490 - [i13]Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li:
Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation. CoRR abs/2401.01044 (2024) - [i12]Jinlong Xue, Yayue Deng, Yichen Han, Yingming Gao, Ya Li:
Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model. CoRR abs/2406.03706 (2024) - [i11]Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li:
Retrieval Augmented Generation in Prompt-based Text-to-Speech Synthesis with Context-Aware Contrastive Language-Audio Pretraining. CoRR abs/2406.03714 (2024) - [i10]Bingsong Bai, Fengping Wang, Yingming Gao, Ya Li:
SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion. CoRR abs/2406.05692 (2024) - [i9]Ruibo Fu, Rui Liu, Chunyu Qiang, Yingming Gao, Yi Lu, Shuchen Shi, Tao Wang, Ya Li, Zhengqi Wen, Chen Zhang, Hui Bu, Yukun Liu, Xin Qi, Guanjun Li:
ICAGC 2024: Inspirational and Convincing Audio Generation Challenge 2024. CoRR abs/2407.12038 (2024) - [i8]Qifei Li, Yingming Gao, Yuhua Wen, Cong Wang, Ya Li:
Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition. CoRR abs/2408.09438 (2024) - 2023
- [c23]Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis. ICASSP 2023: 1-5 - [c22]Cong Wang, Yingming Gao, Ya Li, Man Zhang:
GaitParse: Gait Parsing Algorithm with Self-Supervised Fine-Tuning for Gait Recognition. ICCIP 2023: 85-92 - [c21]Dong Wang, Qifei Li, Yingming Gao, Yong Liu, Ya Li:
Exploring the interpretability in speech-based adolescent depression detection by SHAP. ICCIP 2023: 562-567 - [c20]Qifei Li, Dong Wang, Yiming Ren, Yingming Gao, Ya Li:
FTA-net: A Frequency and Time Attention Network for Speech Depression Detection. INTERSPEECH 2023: 1723-1727 - [c19]Ruishan Li, Yingming Gao, Yanlu Xie, Dengfeng Ke, Jinsong Zhang:
Dual Audio Encoders Based Mandarin Prosodic Boundary Prediction by Using Multi-Granularity Prosodic Representations. INTERSPEECH 2023: 4793-4797 - [c18]Yayue Deng, Jinlong Xue, Fengping Wang, Yingming Gao, Ya Li:
CMCU-CSS: Enhancing Naturalness via Commonsense-based Multi-modal Context Understanding in Conversational Speech Synthesis. ACM Multimedia 2023: 6081-6089 - [c17]Qifei Li, Yingming Gao, Ya Li:
Mining High-quality Samples from Raw Data and Majority Voting Method for Multimodal Emotion Recognition. ACM Multimedia 2023: 9546-9550 - [i7]Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis. CoRR abs/2305.02269 (2023) - [i6]Linkai Peng, Baorian Nuchged, Yingming Gao:
Spoken Language Intelligence of Large Language Models for Language Learning. CoRR abs/2308.14536 (2023) - [i5]Yayue Deng, Jinlong Xue, Yukang Jia, Qifei Li, Yichen Han, Fengping Wang, Yingming Gao, Dengfeng Ke, Ya Li:
CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis. CoRR abs/2312.10358 (2023) - [i4]Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li:
Frame-level emotional state alignment method for speech emotion recognition. CoRR abs/2312.16383 (2023) - 2022
- [b1]Yingming Gao:
Articulatory Copy Synthesis Based on the Speech Synthesizer VocalTractLab. Dresden University of Technology, Germany, 2022 - [j3]Simon Stone, Yingming Gao, Peter Birkholz:
Articulatory Synthesis of Vocalized /r/ Allophones in German. IEEE ACM Trans. Audio Speech Lang. Process. 30: 879-889 (2022) - [c16]Rian Bao, Linkai Peng, Yingming Gao, Jinsong Zhang:
The Importance of Lexical Tone for Sentence Understanding: Utilizing Functional Load Principle to Simulate Comprehension Process. IALP 2022: 379-383 - [c15]Jingwen Cheng, Yuchen Yan, Yingming Gao, Xiaoli Feng, Yannan Wang, Jinsong Zhang:
A study of production error analysis for Mandarin-speaking Children with Hearing Impairment. INTERSPEECH 2022: 4840-4844 - [c14]Jingwen Cheng, Yingming Gao, Yuchen Yan, Xiaoli Feng, Binghuai Lin, Jinsong Zhang:
The Disyllabic Tone Production and Tone Context Effect in Mandarin-speaking Children with Cochlear Implants. ISCSLP 2022: 51-55 - [c13]Rian Bao, Linkai Peng, Yingming Gao, Jinsong Zhang:
The Contribution of Phonological and Fluency Factors to Chinese L2 Comprehensibility Ratings: A Case Study of Urdu-speaking Learners. ISCSLP 2022: 394-398 - [c12]Xiaoli Feng, Yingming Gao, Jinsong Zhang, Yanchun Cao:
An Entropy-based Study on the Acquisition of Mandarin Initial Consonants by Korean Learners. ISCSLP 2022: 414-418 - [c11]Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang:
A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis. MMSP 2022: 1-6 - [i3]Linkai Peng, Yingming Gao, Binghuai Lin, Dengfeng Ke, Yanlu Xie, Jinsong Zhang:
Text-Aware End-to-end Mispronunciation Detection and Diagnosis. CoRR abs/2206.07289 (2022) - [i2]Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang:
A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis. CoRR abs/2210.03335 (2022) - 2021
- [c10]Wenjie Peng, Yingming Gao, Binghuai Lin, Jinsong Zhang:
A Practical Way to Improve Automatic Phonetic Segmentation Performance. ISCSLP 2021: 1-5 - 2020
- [j2]Ju Lin, Yingming Gao, Wei Zhang, Linxuan Wei, Yanlu Xie, Jinsong Zhang:
Improving Pronunciation Erroneous Tendency Detection with Multi-Model Soft Targets. J. Signal Process. Syst. 92(8): 793-803 (2020) - [c9]Wang Dai, Jinsong Zhang, Yingming Gao, Wei Wei, Dengfeng Ke, Binghuai Lin, Yanlu Xie:
Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism. INTERSPEECH 2020: 150-154 - [c8]Yingming Gao, Xinyu Zhang, Yi Xu, Jinsong Zhang, Peter Birkholz:
An Investigation of the Target Approximation Model for Tone Modeling and Recognition in Continuous Mandarin Speech. INTERSPEECH 2020: 1913-1917 - [i1]Wang Dai, Jinsong Zhang, Yingming Gao, Wei Wei, Dengfeng Ke, Binghuai Lin, Yanlu Xie:
Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism. CoRR abs/2005.10803 (2020)
2010 – 2019
- 2019
- [c7]Yuanqi Li, Yingming Gao, Ling Yu, Bao Liu, Long Huang, Yingjie Zhang, Juqian Li, Xiaoyang He:
Research on Illumination Estimation Based on Data Fitting. GreeNets 2019: 76-82 - [c6]Yingming Gao, Simon Stone, Peter Birkholz:
Articulatory Copy Synthesis Based on a Genetic Algorithm. INTERSPEECH 2019: 3770-3774 - 2018
- [j1]Ju Lin, Wei Li, Yingming Gao, Yanlu Xie, Nancy F. Chen, Sabato Marco Siniscalchi, Jinsong Zhang, Chin-Hui Lee:
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks. J. Signal Process. Syst. 90(7): 1077-1087 (2018) - [c5]Yingming Gao, Peter Birkholz:
Speaking Rate Changes Affect Phone Durations Differently for Neutral and Emotional Speech. EUSIPCO 2018: 2070-2074 - 2017
- [c4]Longfei Yang, Yanlu Xie, Yingming Gao, Jinsong Zhang:
Improving pronunciation erroneous tendency detection with convolutional long short-term memory. IALP 2017: 52-56 - 2016
- [c3]Yingming Gao, Yanlu Xie, Ju Lin, Jinsong Zhang:
DNN based detection of pronunciation erroneous tendency in data sparse condition. APSIPA 2016: 1-5 - [c2]Ju Lin, Yanlu Xie, Yingming Gao, Jinsong Zhang:
Improving Mandarin tone recognition based on DNN by combining acoustic and articulatory features. ISCSLP 2016: 1-5 - 2015
- [c1]Yingming Gao, Yanlu Xie, Wen Cao, Jinsong Zhang:
A study on robust detection of pronunciation erroneous tendency based on deep neural network. INTERSPEECH 2015: 693-696
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:20 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint