default search action
Ming Tu
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c27]Zhaoyi Lu, Wenchao Xu, Xin Xie, Ming Tu, Haozhao Wang, Cunqing Hua:
Physical Layer Overshadowing Attack on Semantic Communication System. ICC 2024: 3322-3327 - [i19]Philip Anastassiou, Zhenyu Tang, Kainan Peng, Dongya Jia, Jiaxin Li, Ming Tu, Yuping Wang, Yuxuan Wang, Mingbo Ma:
VoiceShop: A Unified Speech-to-Speech Framework for Identity-Preserving Zero-Shot Voice Editing. CoRR abs/2404.06674 (2024) - [i18]Zhaoyi Lu, Wenchao Xu, Ming Tu, Xin Xie, Cunqing Hua, Nan Cheng:
Erasing Radio Frequency Fingerprints via Active Adversarial Perturbation. CoRR abs/2406.07349 (2024) - [i17]Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou:
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition. CoRR abs/2407.04675 (2024) - 2023
- [c26]Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang:
Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance. ICASSP 2023: 1-5 - [c25]Yukun Feng, Ming Tu, Rui Xia, Chuanzeng Huang, Yuxuan Wang:
Memory Augmented Lookup Dictionary Based Language Modeling for Automatic Speech Recognition. INTERSPEECH 2023: 481-485 - [c24]Siyuan Feng, Ming Tu, Rui Xia, Chuanzeng Huang, Yuxuan Wang:
Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition. INTERSPEECH 2023: 1384-1388 - [c23]Siyuan Feng, Ming Tu, Rui Xia, Chuanzeng Huang, Yuxuan Wang:
Language-universal Phonetic Encoder for Low-resource Speech Recognition. INTERSPEECH 2023: 1429-1433 - [c22]Max W. Y. Lam, Qiao Tian, Tang Li, Zongyu Yin, Siyuan Feng, Ming Tu, Yuliang Ji, Rui Xia, Mingbo Ma, Xuchen Song, Jitong Chen, Yuping Wang, Yuxuan Wang:
Efficient Neural Music Generation. NeurIPS 2023 - [i16]Yukun Feng, Ming Tu, Rui Xia, Chuanzeng Huang, Yuxuan Wang:
Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition. CoRR abs/2301.00066 (2023) - [i15]Siyuan Feng, Ming Tu, Rui Xia, Chuanzeng Huang, Yuxuan Wang:
Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition. CoRR abs/2305.11569 (2023) - [i14]Siyuan Feng, Ming Tu, Rui Xia, Chuanzeng Huang, Yuxuan Wang:
Language-universal phonetic encoder for low-resource speech recognition. CoRR abs/2305.11576 (2023) - [i13]Max W. Y. Lam, Qiao Tian, Tang Li, Zongyu Yin, Siyuan Feng, Ming Tu, Yuliang Ji, Rui Xia, Mingbo Ma, Xuchen Song, Jitong Chen, Yuping Wang, Yuxuan Wang:
Efficient Neural Music Generation. CoRR abs/2305.15719 (2023) - 2022
- [c21]Dongyang Dai, Yuanzhe Chen, Li Chen, Ming Tu, Lu Liu, Rui Xia, Qiao Tian, Yuping Wang, Yuxuan Wang:
Cloning One's Voice Using Very Limited Data in the Wild. ICASSP 2022: 8322-8326 - [i12]Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang:
Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance. CoRR abs/2210.15158 (2022) - 2021
- [i11]Dongyang Dai, Yuanzhe Chen, Li Chen, Ming Tu, Lu Liu, Rui Xia, Qiao Tian, Yuping Wang, Yuxuan Wang:
Cloning one's voice using very limited data in the wild. CoRR abs/2110.03347 (2021) - 2020
- [j4]Dawei Hu, Gangyan Li, Guoming Zhu, Zihao Liu, Ming Tu:
Linear-Quadratic Tracking Control of a Commercial Vehicle Air Brake System. IEEE Access 8: 149741-149750 (2020) - [c20]Ming Tu, Kevin Huang, Guangtao Wang, Jing Huang, Xiaodong He, Bowen Zhou:
Select, Answer and Explain: Interpretable Multi-Hop Reading Comprehension over Multiple Documents. AAAI 2020: 9073-9080 - [c19]Zishun Feng, Ming Tu, Rui Xia, Yuxuan Wang, Ashok K. Krishnamurthy:
Self-Supervised Audio-Visual Representation Learning for in-the-wild Videos. IEEE BigData 2020: 5671-5672 - [c18]Haoqi Li, Ming Tu, Jing Huang, Shrikanth Narayanan, Panayiotis G. Georgiou:
Speaker-Invariant Affective Representation Learning via Adversarial Training. ICASSP 2020: 7144-7148 - [i10]Ming Tu, Jing Huang, Xiaodong He, Bowen Zhou:
Graph Sequential Network for Reasoning over Sequences. CoRR abs/2004.02001 (2020)
2010 – 2019
- 2019
- [j3]Mohit Shah, Ming Tu, Visar Berisha, Chaitali Chakrabarti, Andreas Spanias:
Articulation constrained learning with application to speech emotion recognition. EURASIP J. Audio Speech Music. Process. 2019: 14 (2019) - [c17]Ming Tu, Guangtao Wang, Jing Huang, Yun Tang, Xiaodong He, Bowen Zhou:
Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs. ACL (1) 2019: 2704-2713 - [i9]Ming Tu, Yun Tang, Jing Huang, Xiaodong He, Bowen Zhou:
Towards adversarial learning of speaker-invariant representation for speech emotion recognition. CoRR abs/1903.09606 (2019) - [i8]Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Héctor Delgado, Jose Patino, Qiongqiong Wang, Ling Guo, Takafumi Koshinaka, Jiacen Zhang, Koichi Shinoda, Trung Ngo Trong, Md. Sahidullah, Fan Lu, Yun Tang, Ming Tu, Kah Kuan Teh, Tran Huy Dat, Kuruvachan K. George, Ivan Kukanov, Florent Desnous, Jichen Yang, Emre Yilmaz, Longting Xu, Jean-François Bonastre, Chenglin Xu, Zhi Hao Lim, Eng Siong Chng, Shivesh Ranjan, John H. L. Hansen, Massimiliano Todisco, Nicholas W. D. Evans:
I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. CoRR abs/1904.07386 (2019) - [i7]Ming Tu, Guangtao Wang, Jing Huang, Yun Tang, Xiaodong He, Bowen Zhou:
Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs. CoRR abs/1905.07374 (2019) - [i6]Ming Tu, Jing Huang, Xiaodong He, Bowen Zhou:
Multiple instance learning with graph neural networks. CoRR abs/1906.04881 (2019) - [i5]Ming Tu, Kevin Huang, Guangtao Wang, Jing Huang, Xiaodong He, Bowen Zhou:
Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents. CoRR abs/1911.00484 (2019) - [i4]Haoqi Li, Ming Tu, Jing Huang, Shrikanth S. Narayanan, Panayiotis G. Georgiou:
Speaker-invariant Affective Representation Learning via Adversarial Training. CoRR abs/1911.01533 (2019) - 2018
- [c16]Yishan Jiao, Ming Tu, Visar Berisha, Julie Liss:
Simulating Dysarthric Speech for Training Data Augmentation in Clinical Speech Applications. ICASSP 2018: 6009-6013 - [c15]Megan M. Willi, Stephanie A. Borrie, Tyson S. Barrett, Ming Tu, Visar Berisha:
A Discriminative Acoustic-Prosodic Approach for Measuring Local Entrainment. INTERSPEECH 2018: 581-585 - [c14]Ming Tu, Anna Grabek, Julie Liss, Visar Berisha:
Investigating the Role of L1 in Automatic Pronunciation Evaluation of L2 Speech. INTERSPEECH 2018: 1636-1640 - [i3]Megan M. Willi, Stephanie A. Borrie, Tyson S. Barrett, Ming Tu, Visar Berisha:
A Discriminative Acoustic-Prosodic Approach for Measuring Local Entrainment. CoRR abs/1804.08663 (2018) - [i2]Ming Tu, Anna Grabek, Julie Liss, Visar Berisha:
Investigating the role of L1 in automatic pronunciation evaluation of L2 speech. CoRR abs/1807.01738 (2018) - 2017
- [j2]Zihan Xu, Steven Skorheim, Ming Tu, Visar Berisha, Shimeng Yu, Jae-sun Seo, Maxim Bazhenov, Yu Cao:
Improving efficiency in sparse learning with the feedforward inhibitory motif. Neurocomputing 267: 141-151 (2017) - [c13]Ming Tu, Visar Berisha, Julie Liss:
Objective assessment of pathological speech using distribution regression. ICASSP 2017: 5050-5054 - [c12]Ming Tu, Xianxian Zhang:
Speech enhancement based on Deep Neural Networks with skip connections. ICASSP 2017: 5565-5569 - [c11]Ming Tu, Visar Berisha, Julie Liss:
Interpretable Objective Assessment of Dysarthric Speech Based on Deep Neural Networks. INTERSPEECH 2017: 1849-1853 - 2016
- [c10]Ming Tu, Yishan Jiao, Visar Berisha, Julie M. Liss:
Models for objective evaluation of dysarthric speech from data annotated by multiple listeners. ACSSC 2016: 827-830 - [c9]Ming Tu, Visar Berisha, Martin Woolf, Jae-sun Seo, Yu Cao:
Ranking the parameters of deep neural networks using the fisher information. ICASSP 2016: 2647-2651 - [c8]Yishan Jiao, Ming Tu, Visar Berisha, Julie M. Liss:
Online speaking rate estimation using recurrent neural networks. ICASSP 2016: 5245-5249 - [c7]Yishan Jiao, Ming Tu, Visar Berisha, Julie M. Liss:
Accent Identification by Combining Deep Neural Networks and Recurrent Neural Networks Trained on Long and Short Term Features. INTERSPEECH 2016: 2388-2392 - [c6]Ming Tu, Visar Berisha, Yu Cao, Jae-sun Seo:
Reducing the Model Order of Deep Neural Networks Using Information Theory. ISVLSI 2016: 93-98 - [i1]Ming Tu, Visar Berisha, Yu Cao, Jae-sun Seo:
Reducing the Model Order of Deep Neural Networks Using Information Theory. CoRR abs/1605.04859 (2016) - 2015
- [j1]Yishan Jiao, Visar Berisha, Ming Tu, Julie Liss:
Convex Weighting Criteria for Speaking Rate Estimation. IEEE ACM Trans. Audio Speech Lang. Process. 23(9): 1421-1430 (2015) - [c5]Yishan Jiao, Visar Berisha, Ming Tu, Timothy Huston, Julie M. Liss:
Estimating speaking rate in spontaneous discourse. ACSSC 2015: 1189-1192 - 2014
- [c4]Yishan Jiao, Xiang Xie, Xingyu Na, Ming Tu:
Improving voice quality of HMM-based speech synthesis using voice conversion method. ICASSP 2014: 7914-7918 - [c3]Ming Tu, Xiang Xie, Xingyu Na:
Computational Auditory Scene Analysis Based Voice Activity Detection. ICPR 2014: 797-802 - [c2]Ming Tu, Xiang Xie, Yishan Jiao:
Towards improving statistical model based voice activity detection. INTERSPEECH 2014: 1549-1552 - 2012
- [c1]Kensaku Kawamoto, David Shields, Ming Tu, Jon Reid, Susan Mottice, Paul Sanders, Catherine J. Staes, Cheri Hunter, Bruce E. Bray:
OpenCDS ePHR: an Open-Source, Standards-Based Decision Support Platform for Electronic Public Health Reporting. AMIA 2012
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-04 01:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint