Helen M. Meng
Helen Meng – Helen Mei-Ling Meng – 蒙美玲
Person information
- unicode name: 蒙美玲
- affiliation: The Chinese University of Hog Kong
- affiliation (former): Massachusetts Institute of Technology, Cambridge, MA, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2010 – today
- 2019
- [c222]Shoukang Hu, Max W. Y. Lam, Xurong Xie, Shansong Liu, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition. ICASSP 2019: 6555-6559 - [c221]Runnan Li, Zhiyong Wu, Jia Jia, Sheng Zhao, Helen Meng:
Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition. ICASSP 2019: 6675-6679 - [c220]Xixin Wu, Songxiang Liu, Yuewen Cao, Xu Li, Jianwei Yu, Dongyang Dai, Xi Ma, Shoukang Hu, Zhiyong Wu, Xunying Liu, Helen Meng:
Speech Emotion Recognition Using Capsule Networks. ICASSP 2019: 6695-6699 - [c219]Hui Lu, Zhiyong Wu, Runnan Li, Shiyin Kang, Jia Jia, Helen Meng:
A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams. ICASSP 2019: 6810-6814 - [c218]Yuewen Cao, Xixin Wu, Songxiang Liu, Jianwei Yu, Xu Li, Zhiyong Wu, Xunying Liu, Helen Meng:
End-to-end Code-switched TTS with Mix of Monolingual Recordings. ICASSP 2019: 6935-6939 - [c217]Mu Wang, Xixin Wu, Zhiyong Wu, Shiyin Kang, Deyi Tuo, Guangzhi Li, Dan Su, Dong Yu, Helen Meng:
Quasi-fully Convolutional Neural Network with Variational Inference for Speech Synthesis. ICASSP 2019: 7060-7064 - [c216]Max W. Y. Lam, Xie Chen, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng:
Gaussian Process Lstm Recurrent Neural Network Language Models for Speech Recognition. ICASSP 2019: 7235-7239 - [c215]Jianwei Yu, Max W. Y. Lam, Xie Chen, Shoukang Hu, Songxiang Liu, Xixin Wu, Xunying Liu, Helen Meng:
Recurrent Neural Network Language Model Training Using Natural Gradient. ICASSP 2019: 7260-7264 - [c214]Dongyang Dai, Zhiyong Wu, Runnan Li, Xixin Wu, Jia Jia, Helen Meng:
Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition. ICASSP 2019: 7405-7409 - [c213]Wai-Kim Leung, Xunying Liu, Helen Meng:
CNN-RNN-CTC Based End-to-end Mispronunciation Detection and Diagnosis. ICASSP 2019: 8132-8136 - [c212]Runnan Li, Zhiyong Wu, Jia Jia, Yaohua Bu, Sheng Zhao, Helen Meng:
Towards Discriminative Representation Learning for Speech Emotion Recognition. IJCAI 2019: 5060-5066 - [c211]Jia Li, Yu Rong, Hong Cheng, Helen Meng, Wen-bing Huang, Junzhou Huang:
Semi-Supervised Graph Classification: A Hierarchical Graph Perspective. WWW 2019: 972-982 - [e6]Wen Gao, Helen Mei-Ling Meng, Matthew Turk, Susan R. Fussell, Björn W. Schuller, Yale Song, Kai Yu:
International Conference on Multimodal Interaction, ICMI 2019, Suzhou, China, October 14-18, 2019. ACM 2019, ISBN 978-1-4503-6860-5 [contents] - [i6]Jia Li, Yu Rong, Hong Cheng, Helen Meng, Wen-bing Huang, Junzhou Huang:
Semi-Supervised Graph Classification: A Hierarchical Graph Perspective. CoRR abs/1904.05003 (2019) - [i5]Songxiang Liu, Haibin Wu, Hung-yi Lee, Helen Meng:
Adversarial Attacks on Spoofing Countermeasures of automatic speaker verification. CoRR abs/1910.08716 (2019) - [i4]Xingcheng Song, Guangsen Wang, Zhiyong Wu, Yiheng Huang, Dan Su, Dong Yu, Helen Meng:
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks. CoRR abs/1910.10387 (2019) - [i3]Xu Li, Jinghua Zhong, Xixin Wu, Jianwei Yu, Xunying Liu, Helen Meng:
Adversarial Attacks on GMM i-vector based Speaker Verification Systems. CoRR abs/1911.03078 (2019) - 2018
- [j39]Kelvin K. F. Tsoi, Felix C. H. Chan, Hoyee W. Hirai, Gary K. S. Keung, Yong-Hong Kuo
, Samson Tai, Helen Mei-Ling Meng:
Data Visualization with IBM Watson Analytics for Global Cancer Trends Comparison from World Health Organization. IJHISI 13(1): 45-54 (2018) - [j38]Kun Li, Shaoguang Mao, Xu Li, Zhiyong Wu, Helen Meng:
Automatic lexical stress and pitch accent detection for L2 English speech using multi-distribution deep neural networks. Speech Communication 96: 28-36 (2018) - [c210]Ziwei Zhu, Zhiyong Wu, Runnan Li, Yishuang Ning, Helen Meng:
Learning Frame-Level Recurrent Neural Networks Representations for Query-by-Example Spoken Term Detection on Mobile Devices. AIMS 2018: 55-66 - [c209]Kelvin K. F. Tsoi, Max W. Y. Lam, Christopher T. K. Chu, Michael P. F. Wong, Helen Mei-Ling Meng:
Machine Learning on Drawing Behavior for Dementia Screening. DH 2018: 131-132 - [c208]Max W. Y. Lam, Xunying Liu, Helen Mei-Ling Meng, Kelvin K. F. Tsoi:
Drawing-Based Automatic Dementia Screening Using Gaussian Process Markov Chains. HICSS 2018: 1-10 - [c207]Kelvin K. F. Tsoi, Lingling Zhang, Nicholas B. Chan, Felix C. H. Chan, Hoyee W. Hirai, Helen Mei-Ling Meng:
Social Media as a Tool to Look for People with Dementia Who Become Lost: Factors That Matter. HICSS 2018: 1-10 - [c206]Runnan Li, Zhiyong Wu, Yuchen Huang, Jia Jia, Helen Meng, Lianhong Cai:
Emphatic Speech Generation with Conditioned Input Layer and Bidirectional LSTMS for Expressive Speech Synthesis. ICASSP 2018: 5129-5133 - [c205]Xixin Wu, Lifa Sun, Shiyin Kang, Songxiang Liu, Zhiyong Wu, Xunying Liu, Helen Meng:
Feature Based Adaptation for Speaking Style Synthesis. ICASSP 2018: 5304-5308 - [c204]Xunying Liu, Shansong Liu, Jinze Sha, Jianwei Yu, Zhiyuan Xu, Xie Chen, Helen Meng:
Limited-Memory BFGS Optimization of Recurrent Neural Network Language Models for Speech Recognition. ICASSP 2018: 6114-6118 - [c203]Shaoguang Mao, Xu Li, Kun Li, Zhiyong Wu, Xunying Liu, Helen Meng:
Unsupervised Discovery of an Extended Phoneme Set in L2 English Speech for Mispronunciation Detection and Diagnosis. ICASSP 2018: 6244-6248 - [c202]Shaoguang Mao, Zhiyong Wu, Runnan Li, Xu Li, Helen Meng, Lianhong Cai:
Applying Multitask Learning to Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech. ICASSP 2018: 6254-6258 - [c201]Shaoguang Mao, Zhiyong Wu, Xu Li, Runnan Li, Xixin Wu, Helen Meng:
Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech. ICME 2018: 1-6 - [c200]Ziwei Zhu, Zhiyong Wu, Runnan Li, Helen Meng, Lianhong Cai:
Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection. INTERSPEECH 2018: 102-106 - [c199]Shuai Yang, Zhiyong Wu, Binbin Shen, Helen Meng:
Detection of Glottal Closure Instants from Speech Signals: A Convolutional Neural Network Based Method. INTERSPEECH 2018: 317-321 - [c198]Songxiang Liu, Jinghua Zhong, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance. INTERSPEECH 2018: 496-500 - [c197]Max W. Y. Lam, Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng:
Gaussian Process Neural Networks for Speech Recognition. INTERSPEECH 2018: 1778-1782 - [c196]Xu Li, Shaoguang Mao, Xixin Wu, Kun Li, Xunying Liu, Helen Meng:
Unsupervised Discovery of Non-native Phonetic Patterns in L2 English Speech for Mispronunciation Detection and Diagnosis. INTERSPEECH 2018: 2554-2558 - [c195]Jianwei Yu, Xurong Xie, Shansong Liu, Shoukang Hu, Max W. Y. Lam, Xixin Wu, Ka Ho Wong, Xunying Liu, Helen Meng:
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. INTERSPEECH 2018: 2938-2942 - [c194]Helen Meng:
Speech and Language Processing for Learning and Wellbeing. INTERSPEECH 2018: 3022 - [c193]Xixin Wu, Yuewen Cao, Mu Wang, Songxiang Liu, Shiyin Kang, Zhiyong Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis. INTERSPEECH 2018: 3072-3076 - [c192]Xi Ma, Zhiyong Wu, Jia Jia, Mingxing Xu, Helen Meng, Lianhong Cai:
Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms. INTERSPEECH 2018: 3683-3687 - [c191]Jinghua Zhong, Helen Meng:
DNN i-vector based Fishervoice and PLDA SVM scoring for NIST SRE 2016. ISCSLP 2018: 180-184 - [c190]Mu Wang, Zhiyong Wu, Shiyin Kang, Xixin Wu, Jia Jia, Dan Su, Dong Yu, Helen Meng:
Speech Super-Resolution Using Parallel WaveNet. ISCSLP 2018: 260-264 - [c189]Jia Li, Yu Rong, Helen Meng, Zhihui Lu, Timothy Kwok, Hong Cheng:
TATC: Predicting Alzheimer's Disease with Actigraphy Data. KDD 2018: 509-518 - [c188]Runnan Li, Zhiyong Wu, Jia Jia, Jingbei Li, Wei Chen, Helen Meng:
Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs. ACM Multimedia 2018: 136-144 - [c187]Songxiang Liu, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:
The HCCL-CUHK System for the Voice Conversion Challenge 2018. Odyssey 2018: 248-254 - 2017
- [j37]Kun Li
, Xixin Wu, Helen M. Meng:
Intonation classification for L2 English speech using multi-distribution deep neural networks. Computer Speech & Language 43: 18-33 (2017) - [j36]Kun Li, Xiaojun Qian, Helen M. Meng:
Mispronunciation Detection and Diagnosis in L2 English Speech Using Multidistribution Deep Neural Networks. IEEE/ACM Trans. Audio, Speech & Language Processing 25(1): 193-207 (2017) - [c186]Yishuang Ning, Jia Jia, Zhiyong Wu, Runnan Li, Yongsheng An, Yanfeng Wang, Helen M. Meng:
Multi-Task Deep Learning for User Intention Understanding in Speech Interaction Systems. AAAI 2017: 161-167 - [c185]King Keung Wu, Yeung Yam, Helen M. Meng, Mehran Mesbahi:
Parallel probabilistic swarm guidance by exploiting Kronecker product structures in discrete-time Markov chains. ACC 2017: 346-351 - [c184]Kelvin K. F. Tsoi, Max W. Y. Lam, Felix C. H. Chan, Hoyee W. Hirai, Baker K. K. Bat, Samuel Y. S. Wong, Helen Mei-Ling Meng:
Classification of Visit-to-Visit Blood Pressure Variability: A Machine Learning Approach for Data Clustering on Systolic Blood Pressure Intervention Trial (SPRINT). DH 2017: 58-59 - [c183]Kelvin K. F. Tsoi, Janet Y. H. Wong, Michael P. F. Wong, Gary K. S. Leung, Baker K. K. Bat, Felix C. H. Chan, Yong-Hong Kuo
, Herman H. M. Lo, Helen Mei-Ling Meng:
Personal Wearable Devices to Measure Heart Rate Variability: A Framework of Cloud Platform for Public Health Research. DH 2017: 207-208 - [c182]Kelvin K. F. Tsoi, Felix C. H. Chan, Hoyee W. Hirai, Gary K. S. Leung, Yong-Hong Kuo, Samson Tai, Helen M. Meng:
Data Visualization on Global Trends on Cancer Incidence An Application of IBM Watson Analytics. HICSS 2017: 1-6 - [c181]Runnan Li, Zhiyong Wu, Xunying Liu, Helen M. Meng, Lianhong Cai:
Multi-task learning of structured output layer bidirectional LSTMS for speech synthesis. ICASSP 2017: 5510-5514 - [c180]Yishuang Ning, Zhiyong Wu, Runnan Li, Jia Jia, Mingxing Xu, Helen M. Meng, Lianhong Cai:
Learning cross-lingual knowledge with multilingual BLSTM for emphasis detection with limited training data. ICASSP 2017: 5615-5619 - [c179]Pengfei Liu, King Keung Wu, Helen M. Meng:
A model of extended paragraph vector for document categorization and trend analysis. IJCNN 2017: 2400-2406 - [c178]Yuchen Huang, Zhiyong Wu, Runnan Li, Helen Meng, Lianhong Cai:
Multi-Task Learning for Prosodic Structure Generation Using BLSTM RNN with Structured Output Layer. INTERSPEECH 2017: 779-783 - [c177]Xi Ma, Zhiyong Wu, Jia Jia, Mingxing Xu, Helen Meng, Lianhong Cai:
Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space. INTERSPEECH 2017: 1238-1242 - [c176]Jinghua Zhong, Wenping Hu, Frank K. Soong, Helen Meng:
DNN i-Vector Speaker Verification with Short, Text-Constrained Test Utterances. INTERSPEECH 2017: 1507-1511 - [c175]Runnan Li, Zhiyong Wu, Yishuang Ning, Lifa Sun, Helen Meng, Lianhong Cai:
Spectro-Temporal Modelling with Time-Frequency LSTM and Structured Output Layer for Voice Conversion. INTERSPEECH 2017: 3409-3413 - 2016
- [j35]Hao Wang, Peggy Mok, Helen Meng:
Capitalizing on musical rhythm for prosodic training in computer-aided language learning. Computer Speech & Language 37: 67-81 (2016) - [j34]Xiaojun Qian, Helen M. Meng, Frank K. Soong:
A Two-Pass Framework of Mispronunciation Detection and Diagnosis for Computer-Aided Pronunciation Training. IEEE/ACM Trans. Audio, Speech & Language Processing 24(6): 1020-1028 (2016) - [c174]Vincent T. F. Chow, Ka Wing Sung, Helen M. Meng, Ka Ho Wong, Gary K. S. Leung, Yong-Hong Kuo
, Kelvin K. F. Tsoi:
Utilizing Real-Time Travel Information, Mobile Applications and Wearable Devices for Smart Public Transportation. CCBD 2016: 138-144 - [c173]Kelvin K. F. Tsoi, Benjamin Yip, Doreen W. H. Au, Yong-Hong Kuo
, Samuel Y. S. Wong, Jean Woo, Helen Mei-Ling Meng:
Blood Pressure Monitoring on the Cloud System in Elderly Community Centres: A Data Capturing Platform for Application Research in Public Health. CCBD 2016: 312-315 - [c172]Pengfei Liu, Shoaib Jameel, King Keung Wu, Helen M. Meng:
Learning Track Representation and Trends for Conference Analytics. HICSS 2016: 1671-1680 - [c171]Quanjie Yu, Peng Liu, Zhiyong Wu, Shiyin Kang, Helen Meng, Lianhong Cai:
Learning cross-lingual information with multilingual BLSTM for speech synthesis of low-resource languages. ICASSP 2016: 5545-5549 - [c170]Xinyu Lan, Xu Li, Yishuang Ning, Zhiyong Wu, Helen Meng, Jia Jia, Lianhong Cai:
Low level descriptors based DBLSTM bottleneck feature for speech driven talking avatar. ICASSP 2016: 5550-5554 - [c169]Yaodong Tang, Yuchen Huang, Zhiyong Wu, Helen Meng, Mingxing Xu, Lianhong Cai:
Question detection from acoustic features using recurrent neural network with gated recurrent unit. ICASSP 2016: 6125-6129 - [c168]Ka-Ho Wong, Wing Sum Yeung, Yu Ting Yeung, Helen M. Meng:
Exploring articulatory characteristics of Cantonese dysarthric speech using distinctive features. ICASSP 2016: 6495-6499 - [c167]Linchuan Li, Zhiyong Wu, Mingxing Xu, Helen M. Meng, Lianhong Cai:
Recognizing stances in Mandarin social ideological debates with text and acoustic features. ICME Workshops 2016: 1-6 - [c166]Lifa Sun, Kun Li, Hao Wang, Shiyin Kang, Helen M. Meng:
Phonetic posteriorgrams for many-to-one voice conversion without parallel data training. ICME 2016: 1-6 - [c165]Lifa Sun, Hao Wang, Shiyin Kang, Kun Li, Helen M. Meng:
Personalized, Cross-Lingual TTS Using Phonetic Posteriorgrams. INTERSPEECH 2016: 322-326 - [c164]Yaodong Tang, Zhiyong Wu, Helen M. Meng, Mingxing Xu, Lianhong Cai:
Analysis on Gated Recurrent Unit Based Question Detection Approach. INTERSPEECH 2016: 735-739 - [c163]Linchuan Li, Zhiyong Wu, Mingxing Xu, Helen M. Meng, Lianhong Cai:
Combining CNN and BLSTM to Extract Textual and Acoustic Features for Recognizing Stances in Mandarin Ideological Debate Competition. INTERSPEECH 2016: 1392-1396 - [c162]Xu Li, Zhiyong Wu, Helen M. Meng, Jia Jia, Xiaoyan Lou, Lianhong Cai:
Phoneme Embedding and its Application to Speech Driven Talking Avatar Synthesis. INTERSPEECH 2016: 1472-1476 - [c161]Xu Li, Zhiyong Wu, Helen M. Meng, Jia Jia, Xiaoyan Lou, Lianhong Cai:
Expressive Speech Driven Talking Avatar Synthesis with DBLSTM Using Limited Amount of Emotional Bimodal Data. INTERSPEECH 2016: 1477-1481 - [c160]Runnan Li, Zhiyong Wu, Helen M. Meng, Lianhong Cai:
DBLSTM-based multi-task learning for pitch transformation in voice conversion. ISCSLP 2016: 1-5 - [c159]Ka-Ho Wong, Hoi Kiu Kristy Mok, Helen Meng:
Exploratory data analysis on nuclei in cantonese dysarthric speech. ISCSLP 2016: 1-5 - [c158]King Keung Wu, Pengfei Liu, Helen M. Meng, Yeung Yam:
An embedding approach for context-aware collaborative recommendation and visualization. SMC 2016: 3457-3462 - [c157]King Keung Wu, Yeung Yam, Helen M. Meng, Mehran Mesbahi:
Kronecker product approximation with multiple factor matrices via the tensor product algorithm. SMC 2016: 4277-4282 - [i2]Xi Ma, Zhiyong Wu, Jia Jia, Mingxing Xu, Helen M. Meng, Lianhong Cai:
Study on Feature Subspace of Archetypal Emotions for Speech Emotion Recognition. CoRR abs/1611.05675 (2016) - 2015
- [j33]Péter Baranyi, Hassan Charaf, Anna Esposito
, Péter Földesi, Helen Meng:
Preface. J. Multimodal User Interfaces 9(4): 261-262 (2015) - [j32]Lei Xie, Jia Jia, Helen M. Meng, Zhigang Deng, Lijuan Wang:
Expressive talking avatar synthesis and animation. Multimedia Tools Appl. 74(22): 9845-9848 (2015) - [j31]Zhiyong Wu, Kai Zhao, Xixin Wu, Xinyu Lan, Helen Meng:
Acoustic to articulatory mapping with deep neural network. Multimedia Tools Appl. 74(22): 9889-9907 (2015) - [j30]Zhiyong Wu, Yishuang Ning, Xiao Zang, Jia Jia, Fanbo Meng, Helen Meng, Lianhong Cai:
Generating emphatic speech with hidden Markov model for expressive speech synthesis. Multimedia Tools Appl. 74(22): 9909-9925 (2015) - [j29]Wei Ying Yi, Kin Ming Lo, Terrence S. T. Mak, Kwong-Sak Leung, Yee Leung, Helen Mei-Ling Meng:
A Survey of Wireless Sensor Network Based Air Pollution Monitoring Systems. Sensors 15(12): 31392-31427 (2015) - [j28]Zhen-Hua Ling, Shiyin Kang, Heiga Zen, Andrew W. Senior, Mike Schuster, Xiaojun Qian, Helen M. Meng, Li Deng:
Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends. IEEE Signal Process. Mag. 32(3): 35-52 (2015) - [j27]Haizhou Li
, Marcello Federico, Xiaodong He, Helen M. Meng, Isabel Trancoso
:
Introduction to the Special Section on Continuous Space and Related Methods in Natural Language Processing. IEEE/ACM Trans. Audio, Speech & Language Processing 23(3): 427-430 (2015) - [c156]Xixin Wu, Zhiyong Wu, Yishuang Ning, Jia Jia, Lianhong Cai, Helen M. Meng:
Understanding speaking styles of internet speech data with LSTM and low-resource training. ACII 2015: 815-820 - [c155]Xiaojun Qian, Helen M. Meng, Frank K. Soong:
A two-pass framework of mispronunciation detection & diagnosis for computer-aided pronunciation training. APSIPA 2015: 384-387 - [c154]Benjamin Yip, Hoyee W. Hirai, Yong-Hong Kuo
, Helen M. Meng, Samuel Y. S. Wong, Kelvin K. F. Tsoi:
Blood Pressure Management with Data Capturing in the Cloud among Hypertensive Patients: A Monitoring Platform for Hypertensive Patients. BigData Congress 2015: 305-308 - [c153]Kin Fai Ho
, Hoyee W. Hirai, Yong-Hong Kuo
, Helen M. Meng, Kelvin K. F. Tsoi:
Indoor Air Monitoring Platform and Personal Health Reporting System: Big Data Analytics for Public Health Research. BigData Congress 2015: 309-312 - [c152]Yong-Hong Kuo
, Janny M. Y. Leung, Kelvin K. F. Tsoi, Helen M. Meng, Colin A. Graham
:
Embracing Big Data for Simulation Modelling of Emergency Department Processes and Activities. BigData Congress 2015: 313-316 - [c151]Yong-Hong Kuo
, Janny M. Y. Leung, Helen M. Meng, Kelvin K. F. Tsoi:
A Real-Time Decision Support Tool for Disaster Response: A Mathematical Programming Approach. BigData Congress 2015: 639-642 - [c150]Kelvin K. F. Tsoi, Yong-Hong Kuo
, Helen M. Meng:
A Data Capturing Platform in the Cloud for Behavioral Analysis among Smokers: An Application Platform for Public Health Research. BigData Congress 2015: 737-740 - [c149]Pengfei Liu, Shafiq R. Joty, Helen M. Meng:
Fine-grained Opinion Mining with Recurrent Neural Networks and Word Embeddings. EMNLP 2015: 1433-1443 - [c148]Peng Liu, Quanjie Yu, Zhiyong Wu, Shiyin Kang, Helen M. Meng, Lianhong Cai:
A deep recurrent approach for acoustic-to-articulatory inversion. ICASSP 2015: 4450-4454 - [c147]Lifa Sun, Shiyin Kang, Kun Li, Helen M. Meng:
Voice conversion using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks. ICASSP 2015: 4869-4873 - [c146]Hao Wang, Frank K. Soong, Helen Meng:
AA spectral space warping approach to cross-lingual voice transformation in HMM-based TTS. ICASSP 2015: 4874-4878 - [c145]Yishuang Ning, Zhiyong Wu, Jia Jia, Fanbo Meng, Helen M. Meng, Lianhong Cai:
HMM-based emphatic speech synthesis for corrective feedback in computer-aided pronunciation training. ICASSP 2015: 4934-4938 - [c144]Qi Lyu, Zhiyong Wu, Jun Zhu, Helen Meng:
Modelling High-Dimensional Sequences with LSTM-RTRBM: Application to Polyphonic Music Generation. IJCAI 2015: 4138-4139 - [c143]Ka-Ho Wong, Yu Ting Yeung, Edwin H. Y. Chan, Patrick C. M. Wong, Gina-Anne Levow, Helen M. Meng:
Development of a Cantonese dysarthric speech corpus. INTERSPEECH 2015: 329-333 - [c142]Yishuang Ning, Zhiyong Wu, Xiaoyan Lou, Helen M. Meng, Jia Jia, Lianhong Cai:
Using tilt for automatic emphasis detection with Bayesian networks. INTERSPEECH 2015: 578-582 - [c141]Pengfei Liu, Shoaib Jameel, Wai Lam, Bin Ma, Helen M. Meng:
Topic modeling for conference analytics. INTERSPEECH 2015: 707-711 - [c140]Ka-Ho Wong, Wai-Kim Leung, Helen M. Meng:
E-commu-book: an assistive technology for users with speech impairments. INTERSPEECH 2015: 1876-1877 - [c139]Yu Ting Yeung, Ka-Ho Wong, Helen M. Meng:
Improving automatic forced alignment for dysarthric speech transcription. INTERSPEECH 2015: 2991-2995 - [c138]Ka-Ho Wong, Yu Ting Yeung, Patrick C. M. Wong, Gina-Anne Levow, Helen Meng:
Analysis of Dysarthric Speech using Distinctive Feature Recognition. SLPAT@Interspeech 2015: 86-90 - [c137]Kun Li, Xiaojun Qian, Shiyin Kang, Pengfei Liu, Helen Meng:
Integrating acoustic and state-transition models for free phone recognition in L2 English speech using multi-distribution deep neural networks. SLaTE 2015: 119-124 - [e5]Zhengyou Zhang, Phil Cohen, Dan Bohus, Radu Horaud, Helen Meng:
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09 - 13, 2015. ACM 2015, ISBN 978-1-4503-3912-4 [contents] - 2014
- [j26]Jia Jia, Wai-Kim Leung, Yu-Hao Wu, Xiu-Long Zhang, Hao Wang, Lianhong Cai, Helen M. Meng:
Grading the Severity of Mispronunciations in CAPT Based on Statistical Analysis and Computational Speech Perception. J. Comput. Sci. Technol. 29(5): 751-761 (2014) - [j25]Jia Jia, Zhiyong Wu, Shen Zhang, Helen M. Meng, Lianhong Cai:
Head and facial gestures synthesis using PAD model for an expressive talking avatar. Multimedia Tools Appl. 73(1): 439-461 (2014) - [j24]Fanbo Meng, Zhiyong Wu, Jia Jia, Helen M. Meng, Lianhong Cai:
Synthesizing English emphatic speech for multimodal corrective feedback in computer-aided pronunciation training. Multimedia Tools Appl. 73(1): 463-489 (2014) - [j23]Pui-Yu Hui, Helen Meng:
Latent Semantic Analysis for Multimodal User Input With Speech and Gestures. IEEE/ACM Trans. Audio, Speech & Language Processing 22(2): 417-429 (2014) - [c136]Xin Zheng, Zhiyong Wu, Helen Meng, Lianhong Cai:
Learning dynamic features with neural networks for phoneme recognition. ICASSP 2014: 2524-2528 - [c135]Xin Zheng, Zhiyong Wu, Helen Meng, Lianhong Cai:
Contrastive auto-encoder for phoneme recognition. ICASSP 2014: 2529-2533 - [c134]Hao Wang, Xiaojun Qian, Helen Meng:
Phonological modeling of mispronunciation gradations in L2 English speech of L1 Chinese learners. ICASSP 2014: 7714-7718 - [c133]Xiao Zang, Zhiyong Wu, Helen M. Meng, Jia Jia, Lianhong Cai:
Using conditional random fields to predict focus word pair in spontaneous spoken English. INTERSPEECH 2014: 756-760 - [c132]Jinghua Zhong, Weiwu Jiang, Wei Rao, Man-Wai Mak, Helen M. Meng:
PLDA modeling in the fishervoice subspace for speaker verification. INTERSPEECH 2014: 1130-1134 - [c131]