default search action

combined dblp search
author search
venue search
publication search

ask others

Yingming Gao

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GaoBL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GaoBL24
Yingming Gao, Peter Birkholz, Ya Li:
Articulatory Copy Synthesis Based on the Speech Synthesizer VocalTractLab and Convolutional Recurrent Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1845-1858 (2024)
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DengXJLHWGKL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DengXJLHWGKL24
Yayue Deng, Jinlong Xue, Yukang Jia, Qifei Li, Yichen Han, Fengping Wang, Yingming Gao, Dengfeng Ke, Ya Li:
Concss: Contrastive-based Context Comprehension for Dialogue-Appropriate Prosody in Conversational Speech Synthesis. ICASSP 2024: 10706-10710
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiGWDXHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiGWDXHL24
Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li:
Frame-Level Emotional State Alignment Method for Speech Emotion Recognition. ICASSP 2024: 11486-11490
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-01044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-01044
Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li:
Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation. CoRR abs/2401.01044 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-03706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-03706
Jinlong Xue, Yayue Deng, Yichen Han, Yingming Gao, Ya Li:
Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model. CoRR abs/2406.03706 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-03714
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-03714
Jinlong Xue, Yayue Deng, Yingming Gao, Ya Li:
Retrieval Augmented Generation in Prompt-based Text-to-Speech Synthesis with Context-Aware Contrastive Language-Audio Pretraining. CoRR abs/2406.03714 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05692
Bingsong Bai, Fengping Wang, Yingming Gao, Ya Li:
SPA-SVC: Self-supervised Pitch Augmentation for Singing Voice Conversion. CoRR abs/2406.05692 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-12038
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-12038
Ruibo Fu, Rui Liu, Chunyu Qiang, Yingming Gao, Yi Lu, Shuchen Shi, Tao Wang, Ya Li, Zhengqi Wen, Chen Zhang, Hui Bu, Yukun Liu, Xin Qi, Guanjun Li:
ICAGC 2024: Inspirational and Convincing Audio Generation Challenge 2024. CoRR abs/2407.12038 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-09438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-09438
Qifei Li, Yingming Gao, Yuhua Wen, Cong Wang, Ya Li:
Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition. CoRR abs/2408.09438 (2024)
2023
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XueDWLGTSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XueDWLGTSL23
Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M²-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis. ICASSP 2023: 1-5
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/iccip/WangGLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccip/WangGLZ23
Cong Wang, Yingming Gao, Ya Li, Man Zhang:
GaitParse: Gait Parsing Algorithm with Self-Supervised Fine-Tuning for Gait Recognition. ICCIP 2023: 85-92
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/iccip/WangLGLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccip/WangLGLL23
Dong Wang, Qifei Li, Yingming Gao, Yong Liu, Ya Li:
Exploring the interpretability in speech-based adolescent depression detection by SHAP. ICCIP 2023: 562-567
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWRGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWRGL23
Qifei Li, Dong Wang, Yiming Ren, Yingming Gao, Ya Li:
FTA-net: A Frequency and Time Attention Network for Speech Depression Detection. INTERSPEECH 2023: 1723-1727
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiGXK023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiGXK023
Ruishan Li, Yingming Gao, Yanlu Xie, Dengfeng Ke, Jinsong Zhang:
Dual Audio Encoders Based Mandarin Prosodic Boundary Prediction by Using Multi-Granularity Prosodic Representations. INTERSPEECH 2023: 4793-4797
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/DengXWGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/DengXWGL23
Yayue Deng, Jinlong Xue, Fengping Wang, Yingming Gao, Ya Li:
CMCU-CSS: Enhancing Naturalness via Commonsense-based Multi-modal Context Understanding in Conversational Speech Synthesis. ACM Multimedia 2023: 6081-6089
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiGL23
Qifei Li, Yingming Gao, Ya Li:
Mining High-quality Samples from Raw Data and Majority Voting Method for Multimodal Emotion Recognition. ACM Multimedia 2023: 9546-9550
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-02269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-02269
Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis. CoRR abs/2305.02269 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14536
Linkai Peng, Baorian Nuchged, Yingming Gao:
Spoken Language Intelligence of Large Language Models for Language Learning. CoRR abs/2308.14536 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10358
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10358
Yayue Deng, Jinlong Xue, Yukang Jia, Qifei Li, Yichen Han, Fengping Wang, Yingming Gao, Dengfeng Ke, Ya Li:
CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis. CoRR abs/2312.10358 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16383
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16383
Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li:
Frame-level emotional state alignment method for speech emotion recognition. CoRR abs/2312.16383 (2023)
2022
[b1]
- view
  - electronic edition @ nbn-resolving.org
  - details & citations
  authority control:
- export record
  dblp key:
  - phd/dnb/Gao22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/dnb/Gao22
Yingming Gao:
Articulatory Copy Synthesis Based on the Speech Synthesizer VocalTractLab. Dresden University of Technology, Germany, 2022
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/StoneGB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/StoneGB22
Simon Stone, Yingming Gao, Peter Birkholz:
Articulatory Synthesis of Vocalized /r/ Allophones in German. IEEE ACM Trans. Audio Speech Lang. Process. 30: 879-889 (2022)
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/BaoPGZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/BaoPGZ22
Rian Bao, Linkai Peng, Yingming Gao, Jinsong Zhang:
The Importance of Lexical Tone for Sentence Understanding: Utilizing Functional Load Principle to Simulate Comprehension Process. IALP 2022: 379-383
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengYGFW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengYGFW022
Jingwen Cheng, Yuchen Yan, Yingming Gao, Xiaoli Feng, Yannan Wang, Jinsong Zhang:
A study of production error analysis for Mandarin-speaking Children with Hearing Impairment. INTERSPEECH 2022: 4840-4844
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ChengGYFLZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ChengGYFLZ22
Jingwen Cheng, Yingming Gao, Yuchen Yan, Xiaoli Feng, Binghuai Lin, Jinsong Zhang:
The Disyllabic Tone Production and Tone Context Effect in Mandarin-speaking Children with Cochlear Implants. ISCSLP 2022: 51-55
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/BaoPGZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/BaoPGZ22
Rian Bao, Linkai Peng, Yingming Gao, Jinsong Zhang:
The Contribution of Phonological and Fluency Factors to Chinese L2 Comprehensibility Ratings: A Case Study of Urdu-speaking Learners. ISCSLP 2022: 394-398
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/FengGZC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/FengGZC22
Xiaoli Feng, Yingming Gao, Jinsong Zhang, Yanchun Cao:
An Entropy-based Study on the Acquisition of Mandarin Initial Consonants by Korean Learners. ISCSLP 2022: 414-418
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/HanLGXWY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/HanLGXWY22
Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang:
A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis. MMSP 2022: 1-6
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07289
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07289
Linkai Peng, Yingming Gao, Binghuai Lin, Dengfeng Ke, Yanlu Xie, Jinsong Zhang:
Text-Aware End-to-end Mispronunciation Detection and Diagnosis. CoRR abs/2206.07289 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-03335
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-03335
Yichen Han, Ya Li, Yingming Gao, Jinlong Xue, Songpo Wang, Lei Yang:
A Keypoint Based Enhancement Method for Audio Driven Free View Talking Head Synthesis. CoRR abs/2210.03335 (2022)
2021
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/PengGLZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/PengGLZ21
Wenjie Peng, Yingming Gao, Binghuai Lin, Jinsong Zhang:
A Practical Way to Improve Automatic Phonetic Segmentation Performance. ISCSLP 2021: 1-5
2020
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/LinGZWXZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/LinGZWXZ20
Ju Lin, Yingming Gao, Wei Zhang, Linxuan Wei, Yanlu Xie, Jinsong Zhang:
Improving Pronunciation Erroneous Tendency Detection with Multi-Model Soft Targets. J. Signal Process. Syst. 92(8): 793-803 (2020)
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DaiZGWKLX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DaiZGWKLX20
Wang Dai, Jinsong Zhang, Yingming Gao, Wei Wei, Dengfeng Ke, Binghuai Lin, Yanlu Xie:
Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism. INTERSPEECH 2020: 150-154
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoZXZB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoZXZB20
Yingming Gao, Xinyu Zhang, Yi Xu, Jinsong Zhang, Peter Birkholz:
An Investigation of the Target Approximation Model for Tone Modeling and Recognition in Continuous Mandarin Speech. INTERSPEECH 2020: 1913-1917
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-10803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-10803
Wang Dai, Jinsong Zhang, Yingming Gao, Wei Wei, Dengfeng Ke, Binghuai Lin, Yanlu Xie:
Formant Tracking Using Dilated Convolutional Networks Through Dense Connection with Gating Mechanism. CoRR abs/2005.10803 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/greenets/LiGYLHZLH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/greenets/LiGYLHZLH19
Yuanqi Li, Yingming Gao, Ling Yu, Bao Liu, Long Huang, Yingjie Zhang, Juqian Li, Xiaoyang He:
Research on Illumination Estimation Based on Data Fitting. GreeNets 2019: 76-82
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoSB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoSB19
Yingming Gao, Simon Stone, Peter Birkholz:
Articulatory Copy Synthesis Based on a Genetic Algorithm. INTERSPEECH 2019: 3770-3774
2018
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/LinLGXCSZL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/LinLGXCSZL18
Ju Lin, Wei Li, Yingming Gao, Yanlu Xie, Nancy F. Chen, Sabato Marco Siniscalchi, Jinsong Zhang, Chin-Hui Lee:
Improving Mandarin Tone Recognition Based on DNN by Combining Acoustic and Articulatory Features Using Extended Recognition Networks. J. Signal Process. Syst. 90(7): 1077-1087 (2018)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/GaoB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/GaoB18
Yingming Gao, Peter Birkholz:
Speaking Rate Changes Affect Phone Durations Differently for Neutral and Emotional Speech. EUSIPCO 2018: 2070-2074
2017
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/YangXGZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/YangXGZ17
Longfei Yang, Yanlu Xie, Yingming Gao, Jinsong Zhang:
Improving pronunciation erroneous tendency detection with convolutional long short-term memory. IALP 2017: 52-56
2016
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/GaoXLZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/GaoXLZ16
Yingming Gao, Yanlu Xie, Ju Lin, Jinsong Zhang:
DNN based detection of pronunciation erroneous tendency in data sparse condition. APSIPA 2016: 1-5
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LinXGZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LinXGZ16
Ju Lin, Yanlu Xie, Yingming Gao, Jinsong Zhang:
Improving Mandarin tone recognition based on DNN by combining acoustic and articulatory features. ISCSLP 2016: 1-5
2015
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoXCZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoXCZ15
Yingming Gao, Yanlu Xie, Wen Cao, Jinsong Zhang:
A study on robust detection of pronunciation erroneous tendency based on deep neural network. INTERSPEECH 2015: 693-696

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.