default search action
Van Tung Pham
Person information
- affiliation: ByteDance
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i13]Van Tung Pham, Yist Y. Lin, Tao Han, Wei Li, Jun Zhang, Lu Lu, Yuxuan Wang:
A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR. CoRR abs/2406.17272 (2024) - 2023
- [c32]Yist Y. Lin, Tao Han, Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Tze Yuang Chong, Yi He, Lu Lu, Zejun Ma:
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition. INTERSPEECH 2023: 904-908 - [i12]Yi Guo, Yiqian He, Xiaoyang Li, Haotong Qin, Van Tung Pham, Yang Zhang, Shouda Liu:
RdimKD: Generic Distillation Paradigm by Dimensionality Reduction. CoRR abs/2312.08700 (2023) - 2022
- [i11]Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Yist Y. Lin, Tao Han, Tze Yuan Chong, Yi He, Zejun Ma:
Improving short-video speech recognition using random utterance concatenation. CoRR abs/2210.15876 (2022) - 2021
- [c31]Manav Kaushik, Van Tung Pham, Tran The Anh, Eng Siong Chng:
End-to-End Speaker Age and Height Estimation using Attention Mechanism and Triplet Loss. APSIPA ASC 2021: 1-8 - [c30]Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-based joint learning approach to robust ASR for radio communication speech. APSIPA ASC 2021: 497-502 - [c29]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named Entities for Improved Speech Recognition. APSIPA ASC 2021: 1021-1025 - [c28]Jicheng Zhang, Yizhou Peng, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition. Interspeech 2021: 1519-1523 - [c27]Weiguang Chen, Van Tung Pham, Eng Siong Chng, Xionghu Zhong:
Overlapped Speech Detection Based on Spectral and Spatial Feature Fusion. Interspeech 2021: 4189-4193 - [c26]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. ISCSLP 2021: 1-5 - [c25]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. ISCSLP 2021: 1-5 - [i10]Manav Kaushik, Van Tung Pham, Eng Siong Chng:
End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN. CoRR abs/2101.05056 (2021) - [i9]Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech. CoRR abs/2107.10701 (2021) - [i8]Shangeth Rajaa, Van Tung Pham, Chng Eng Siong:
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling. CoRR abs/2110.13653 (2021) - 2020
- [c24]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063 - [c23]Haobo Zhang, Haihua Xu, Van Tung Pham, Hao Huang, Eng Siong Chng:
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition. INTERSPEECH 2020: 2392-2396 - [c22]Nana Hou, Chenglin Xu, Van Tung Pham, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network. INTERSPEECH 2020: 4064-4068 - [i7]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. CoRR abs/2005.08742 (2020) - [i6]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. CoRR abs/2005.10407 (2020) - [i5]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance. CoRR abs/2010.12143 (2020)
2010 – 2019
- 2019
- [b1]Van Tung Pham:
Robust spoken term detection using partial search and re-scoring hypothesized detections techniques. Nanyang Technological University, Singapore, 2019 - [c21]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. INTERSPEECH 2019: 2160-2164 - [c20]Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-Switching Speech Recognition. INTERSPEECH 2019: 2165-2169 - [c19]Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. INTERSPEECH 2019: 3505-3509 - [i4]Yerbolat Khassanov, Zhiping Zeng, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Enriching Rare Word Representations in Neural Language Models by Embedding Matrix Augmentation. CoRR abs/1904.03799 (2019) - [i3]Yerbolat Khassanov, Haihua Xu, Van Tung Pham, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma:
Constrained Output Embeddings for End-to-End Code-Switching Speech Recognition with Only Monolingual Data. CoRR abs/1904.03802 (2019) - [i2]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent language modeling architecture for end-to-end ASR. CoRR abs/1912.00863 (2019) - 2018
- [j1]Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Re-ranking spoken term detection with acoustic exemplars of keywords. Speech Commun. 104: 12-23 (2018) - [c18]Haihua Xu, Van Tung Pham, Zin Tun Kyaw, Zhi Hao Lim, Eng Siong Chng, Haizhou Li:
Mandarin-English Code-switching Speech Recognition. INTERSPEECH 2018: 554-555 - [i1]Zhiping Zeng, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Eng Siong Chng, Haizhou Li:
On the End-to-End Solution to Mandarin-English Code-switching Speech Recognition. CoRR abs/1811.00241 (2018) - 2017
- [c17]Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng, Bin Ma, Haizhou Li:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. APSIPA 2017: 1322-1327 - [c16]Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng:
Pruning Strategies for Partial Search in Spoken Term Detection. SoICT 2017: 114-119 - 2016
- [c15]Thi-Nga Ho, Tze Yuang Chong, Van Hai Do, Van Tung Pham, Eng Siong Chng:
Improving Efficiency of Sentence Boundary Detection by Feature Selection. ACIIDS (2) 2016: 594-603 - [c14]Zhengchen Zhang, Mei Li, Yuchao Zhang, Weini Zhang, Yang Liu, Shan Yang, Yanfeng Lu, Van Tung Pham, Lei Xie, Minghui Dong:
The I2R-NWPU-NTU Text-to-Speech System at Blizzard Challenge 2016. Blizzard Challenge 2016 - [c13]Haihua Xu, Jingyong Hou, Xiong Xiao, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Van Hai Do, Hang Lv, Lei Xie, Bin Ma, Eng Siong Chng, Haizhou Li:
Approximate search of audio queries by using DTW with phone time boundary and data augmentation. ICASSP 2016: 6030-6034 - [c12]Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Keyword search using query expansion for graph-based rescoring of hypothesized detections. ICASSP 2016: 6035-6039 - [c11]Nancy F. Chen, Van Tung Pham, Haihua Xu, Xiong Xiao, Van Hai Do, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Chin-Hui Lee, Eng Siong Chng, Bin Ma, Haizhou Li:
Exemplar-inspired strategies for low-resource spoken keyword search in Swahili. ICASSP 2016: 6040-6044 - [c10]Van Tung Pham, Haihua Xu, Xiong Xiao, Nancy F. Chen, Eng Siong Chng, Haizhou Li:
Rescoring Hypothesized Detections of Out-of-Vocabulary Keywords Using Subword Samples. INTERSPEECH 2016: 933-937 - [c9]Cheung-Chi Leung, Lei Wang, Haihua Xu, Jingyong Hou, Van Tung Pham, Hang Lv, Lei Xie, Xiong Xiao, Chongjia Ni, Bin Ma, Eng Siong Chng, Haizhou Li:
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis. INTERSPEECH 2016: 3703-3707 - 2015
- [c8]Van Tung Pham, Haihua Xu, Van Hai Do, Tze Yuang Chong, Xiong Xiao, Eng Siong Chng, Haizhou Li:
On the study of very low-resource language keyword search. APSIPA 2015: 358-364 - [c7]Hang Su, Van Tung Pham, Yanzhang He, James Hieronymus:
Improvements on transducing syllable lattice to word lattice for keyword search. ICASSP 2015: 4729-4733 - [c6]Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, Van Tung Pham, Haihua Xu, Xiong Xiao, Tze Siong Lau, Su Jun Leow, Boon Pang Lim, Cheung-Chi Leung, Lei Wang, Chin-Hui Lee, Alvina Goh, Engsiong Chng, Bin Ma, Haizhou Li:
Low-resource keyword search strategies for tamil. ICASSP 2015: 5366-5370 - [c5]Jingyong Hou, Van Tung Pham, Cheung-Chi Leung, Lei Wang, Haihua Xu, Hang Lv, Lei Xie, Zhonghua Fu, Chongjia Ni, Xiong Xiao, Hongjie Chen, Shaofei Zhang, Sining Sun, Yougen Yuan, Pengcheng Li, Tin Lay Nwe, Sunil Sivadas, Bin Ma, Engsiong Chng, Haizhou Li:
The NNI Query-by-Example System for MediaEval 2015. MediaEval 2015 - 2014
- [c4]Haihua Xu, Van Tung Pham, Engsiong Chng, Haizhou Li:
Towards better keyword search performance on Malay broadcast news data. APSIPA 2014: 1-5 - [c3]Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Hoang Gia Ngo, Haihua Xu, Van Tung Pham, Bin Ma, Haizhou Li:
Strategies for Vietnamese keyword search. ICASSP 2014: 4121-4125 - [c2]Van Tung Pham, Haihua Xu, Nancy F. Chen, Sunil Sivadas, Boon Pang Lim, Engsiong Chng, Haizhou Li:
Discriminative score normalization for keyword search decision. ICASSP 2014: 7078-7082 - [c1]Van Tung Pham, Nancy F. Chen, Sunil Sivadas, Haihua Xu, I-Fan Chen, Chongjia Ni, Engsiong Chng, Haizhou Li:
System and keyword dependent fusion for spoken term detection. SLT 2014: 430-435
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:14 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint