


Остановите войну!
for scientists:


default search action
Hung-yi Lee
Hung-Yi Lee
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [i193]Guan-Ting Liu, En-Pei Hu, Pu-Jen Cheng, Hung-Yi Lee, Shao-Hua Sun:
Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs. CoRR abs/2301.12950 (2023) - [i192]Hsuan Su, Shachi H. Kumar, Sahisnu Mazumder, Wenda Chen, Ramesh Manuvinakurike, Eda Okur, Saurav Sahay, Lama Nachman, Shang-Tse Chen, Hung-yi Lee:
Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue. CoRR abs/2302.05888 (2023) - [i191]Kuan-Po Huang, Tzu-hsun Feng, Yu-Kuan Fu, Tsu-Yuan Hsu, Po-Chieh Yen, Wei-Cheng Tseng, Kai-Wei Chang, Hung-yi Lee:
Ensemble knowledge distillation of self-supervised speech models. CoRR abs/2302.12757 (2023) - [i190]Kai-Wei Chang, Yu-Kai Wang, Hua Shen, Iu-thing Kang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee:
SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks. CoRR abs/2303.00733 (2023) - [i189]Yuan Tseng, Cheng-I Lai, Hung-yi Lee:
Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences. CoRR abs/2303.08809 (2023) - [i188]Sung-Feng Huang, Chia-Ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-yi Lee:
Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning. CoRR abs/2303.11816 (2023) - [i187]David Cheng-Han Chiang, Hung-yi Lee:
Can Large Language Models Be an Alternative to Human Evaluations? CoRR abs/2305.01937 (2023) - [i186]Yu-Kuan Fu, Liang-Hsuan Tseng, Jiatong Shi, Chen-An Li, Tsu-Yuan Hsu, Shinji Watanabe, Hung-Yi Lee:
Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation. CoRR abs/2305.07455 (2023) - [i185]Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei-Ping Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe:
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark. CoRR abs/2305.10615 (2023) - [i184]Haibin Wu, Jiawen Kang, Lingwei Meng, Helen Meng, Hung-yi Lee:
The defender's perspective on automatic speaker verification: An overview. CoRR abs/2305.12804 (2023) - 2022
- [j22]Hung-Yi Lee, Shinji Watanabe
, Karen Livescu, Abdelrahman Mohamed, Tara N. Sainath
:
Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1174-1178 (2022) - [j21]Abdelrahman Mohamed, Hung-yi Lee
, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin
, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath
, Shinji Watanabe
:
Self-Supervised Speech Representation Learning: A Review. IEEE J. Sel. Top. Signal Process. 16(6): 1179-1210 (2022) - [j20]Haibin Wu
, Xu Li
, Andy T. Liu
, Zhiyong Wu
, Helen Meng, Hung-Yi Lee
:
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning. IEEE ACM Trans. Audio Speech Lang. Process. 30: 202-217 (2022) - [j19]Da-Rong Liu, Po-chun Hsu, Yi-Chen Chen, Sung-Feng Huang
, Shun-Po Chuang, Da-Yi Wu, Hung-yi Lee
:
Learning Phone Recognition From Unpaired Audio and Phone Sequences Based on Generative Adversarial Network. IEEE ACM Trans. Audio Speech Lang. Process. 30: 230-243 (2022) - [j18]Sung-Feng Huang
, Chyi-Jiunn Lin
, Da-Rong Liu, Yi-Chen Chen
, Hung-yi Lee
:
Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1558-1571 (2022) - [j17]Yi-Long Liou, Jui-Yang Hsu
, Chen-Sheng Chen, Alexander H. Liu, Hung-Yi Lee
, Tsung-Te Liu
:
A Fully Integrated 1.7mW Attention-Based Automatic Speech Recognition Processor. IEEE Trans. Circuits Syst. II Express Briefs 69(10): 4178-4182 (2022) - [c166]David Cheng-Han Chiang, Hung-Yi Lee:
On the Transferability of Pre-trained Language Models: A Study from Artificial Datasets. AAAI 2022: 10518-10525 - [c165]Chan-Jan Hsu, Hung-yi Lee, Yu Tsao:
XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding. ACL (2) 2022: 479-489 - [c164]Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. ACL (1) 2022: 8479-8492 - [c163]Haibin Wu, Po-chun Hsu, Ji Gao, Shanshan Zhang, Shen Huang, Jian Kang, Zhiyong Wu, Helen Meng, Hung-Yi Lee:
Adversarial Sample Detection for Speaker Verification by Neural Vocoders. ICASSP 2022: 236-240 - [c162]Haibin Wu, Bo Zheng, Xu Li, Xixin Wu, Hung-Yi Lee, Helen Meng:
Characterizing the Adversarial Vulnerability of Speech self-Supervised Learning. ICASSP 2022: 3164-3168 - [c161]Yen Meng, Yi-Hui Chou, Andy T. Liu, Hung-yi Lee:
Don't Speak Too Fast: The Impact of Data Bias on Self-Supervised Speech Models. ICASSP 2022: 3258-3262 - [c160]Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Hung-Yi Lee, Shinji Watanabe
, Tomoki Toda:
S3PRL-VC: Open-Source Voice Conversion Framework with Self-Supervised Speech Representations. ICASSP 2022: 6552-6556 - [c159]Chien-yu Huang, Kai-Wei Chang, Hung-Yi Lee:
Toward Degradation-Robust Voice Conversion. ICASSP 2022: 6777-6781 - [c158]Heng-Jui Chang, Shu-Wen Yang, Hung-yi Lee:
Distilhubert: Speech Representation Learning by Layer-Wise Distillation of Hidden-Unit Bert. ICASSP 2022: 7087-7091 - [c157]Guan-Ting Lin, Chan-Jan Hsu, Da-Rong Liu, Hung-Yi Lee, Yu Tsao:
Analyzing The Robustness of Unsupervised Speech Recognition. ICASSP 2022: 8202-8206 - [c156]Haibin Wu, Heng-Cheng Kuo, Naijun Zheng, Kuo-Hsuan Hung, Hung-Yi Lee, Yu Tsao, Hsin-Min Wang
, Helen Meng:
Partially Fake Audio Detection by Self-Attention-Based Fake Span Discovery. ICASSP 2022: 9236-9240 - [c155]Yang Zhang, Zhiqiang Lv, Haibin Wu, Shanshan Zhang, Pengfei Hu, Zhiyong Wu, Hung-yi Lee, Helen Meng:
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification. INTERSPEECH 2022: 306-310 - [c154]Kuan-Po Huang, Yu-Kuan Fu, Yu Zhang, Hung-yi Lee:
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation. INTERSPEECH 2022: 2193-2197 - [c153]Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee:
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition. INTERSPEECH 2022: 2198-2202 - [c152]Haibin Wu, Lingwei Meng, Jiawen Kang, Jinchao Li, Xu Li, Xixin Wu, Hung-yi Lee, Helen Meng:
Spoofing-Aware Speaker Verification by Multi-Level Fusion. INTERSPEECH 2022: 4357-4361 - [c151]Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee:
DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores. INTERSPEECH 2022: 4541-4545 - [c150]Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang, Hung-yi Lee:
Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding. INTERSPEECH 2022: 4566-4570 - [c149]Kai-Wei Chang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee:
An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks. INTERSPEECH 2022: 5005-5009 - [c148]Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee:
Membership Inference Attacks Against Self-supervised Speech Models. INTERSPEECH 2022: 5040-5044 - [c147]Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-Wen Yang, Hsuan-Jui Chen, Shuyan Annie Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering. INTERSPEECH 2022: 5165-5169 - [c146]Chih-Chiang Chang, Hung-yi Lee:
Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation. INTERSPEECH 2022: 5175-5179 - [c145]Chih-Chiang Chang, Shun-Po Chuang, Hung-yi Lee:
Anticipation-Free Training for Simultaneous Machine Translation. IWSLT@ACL 2022: 43-61 - [c144]Hung-yi Lee, Shang-Wen Li, Thang Vu:
Meta Learning for Natural Language Processing: A Survey. NAACL-HLT 2022: 666-684 - [c143]Chin-Lun Fu, Zih-Ching Chen, Yun-Ru Lee, Hung-yi Lee:
AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks. NAACL-HLT (Findings) 2022: 2608-2621 - [c142]Haibin Wu, Jiawen Kang, Lingwei Meng, Yang Zhang, Xixin Wu, Zhiyong Wu, Hung-yi Lee, Helen Meng:
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion. Odyssey 2022: 92-99 - [c141]Wei-Tsung Kao, Yuan-Kuei Wu, Chia-Ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-Yi Lee:
On the Efficiency of Integrating Self-Supervised Learning and Meta-Learning for User-Defined Few-Shot Keyword Spotting. SLT 2022: 414-421 - [c140]Xuanjun Chen, Haibin Wu, Helen Meng, Hung-yi Lee, Jyh-Shing Roger Jang:
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection. SLT 2022: 692-699 - [c139]Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Layne Berry, Hung-yi Lee, David Harwath:
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model. SLT 2022: 715-722 - [c138]Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe
, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee:
Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. SLT 2022: 1096-1103 - [c137]Guan-Ting Lin, Chi-Luen Feng, Wei-Ping Huang, Yuan Tseng, Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Nigel G. Ward:
On the Utility of Self-Supervised Models for Prosody-Related Tasks. SLT 2022: 1104-1111 - [c136]Kuan-Po Huang, Yu-Kuan Fu, Tsu-Yuan Hsu, Fabian Ritter Gutierrez, Fan-Lin Wang, Liang-Hsuan Tseng, Yu Zhang, Hung-yi Lee:
Improving Generalizability of Distilled Self-Supervised Speech Processing Models Under Distorted Settings. SLT 2022: 1112-1119 - [c135]Zih-Ching Chen, Chin-Lun Fu, Chih-Ying Liu, Shang-Wen (Daniel) Li, Hung-yi Lee:
Exploring Efficient-Tuning Methods in Self-Supervised Speech Models. SLT 2022: 1120-1127 - [c134]Yen Meng, Hsuan-Jui Chen, Jiatong Shi, Shinji Watanabe
, Paola García, Hung-yi Lee, Hao Tang:
On Compressing Sequences for Self-Supervised Speech Models. SLT 2022: 1128-1135 - [e1]Kong Aik Lee, Hung-yi Lee, Yanfeng Lu, Minghui Dong:
13th International Symposium on Chinese Spoken Language Processing, ISCSLP 2022, Singapore, December 11-14, 2022. IEEE 2022, ISBN 979-8-3503-9796-3 [contents] - [i183]Chih-Chiang Chang, Shun-Po Chuang, Hung-yi Lee:
Anticipation-free Training for Simultaneous Translation. CoRR abs/2201.12868 (2022) - [i182]Haibin Wu, Heng-Cheng Kuo, Naijun Zheng, Kuo-Hsuan Hung, Hung-Yi Lee, Yu Tsao, Hsin-Min Wang, Helen Meng:
Partially Fake Audio Detection by Self-attention-based Fake Span Discovery. CoRR abs/2202.06684 (2022) - [i181]Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-Wen Yang, Hsuan-Jui Chen, Shuyan Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering. CoRR abs/2203.04911 (2022) - [i180]Kuan-Po Huang, Yuan-Kuei Wu, Hung-yi Lee:
Improving the transferability of speech separation by meta-learning. CoRR abs/2203.05882 (2022) - [i179]Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. CoRR abs/2203.06849 (2022) - [i178]Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee:
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition. CoRR abs/2203.14222 (2022) - [i177]Yang Zhang, Zhiqiang Lv, Haibin Wu, Shanshan Zhang, Pengfei Hu, Zhiyong Wu, Hung-Yi Lee, Helen Meng:
MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification. CoRR abs/2203.15249 (2022) - [i176]Haibin Wu, Lingwei Meng, Jiawen Kang, Jinchao Li, Xu Li, Xixin Wu, Hung-yi Lee, Helen Meng:
Spoofing-Aware Speaker Verification by Multi-Level Fusion. CoRR abs/2203.15377 (2022) - [i175]Kuan-Po Huang, Yu-Kuan Fu, Yu Zhang, Hung-yi Lee:
Improving Distortion Robustness of Self-supervised Speech Processing Tasks with Domain Adaptation. CoRR abs/2203.16104 (2022) - [i174]Kai-Wei Chang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee:
An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks. CoRR abs/2203.16773 (2022) - [i173]Fan-Lin Wang, Po-chun Hsu, Da-Rong Liu, Hung-yi Lee:
Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis. CoRR abs/2204.00170 (2022) - [i172]Wei-Tsung Kao, Yuen-Kwei Wu, Chia-Ping Chen, Zhi-Sheng Chen, Yu-Pao Tsai, Hung-Yi Lee:
On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting. CoRR abs/2204.00352 (2022) - [i171]Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee:
DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores. CoRR abs/2204.03219 (2022) - [i170]David Cheng-Han Chiang, Hung-Yi Lee:
Understanding, Detecting, and Separating Out-of-Distribution Samples and Adversarial Samples in Text Classification. CoRR abs/2204.04458 (2022) - [i169]David Cheng-Han Chiang, Hung-Yi Lee:
Re-Examining Human Annotations for Interpretable NLP. CoRR abs/2204.04580 (2022) - [i168]Chan-Jan Hsu, Hung-yi Lee, Yu Tsao:
XDBERT: Distilling Visual Information to BERT from Cross-Modal Systems to Improve Language Understanding. CoRR abs/2204.07316 (2022) - [i167]Chih-Chiang Chang, Hung-yi Lee:
Exploring Continuous Integrate-and-Fire for Adaptive Simultaneous Speech Translation. CoRR abs/2204.09595 (2022) - [i166]Po-chun Hsu, Da-Rong Liu, Andy T. Liu, Hung-yi Lee:
Parallel Synthesis for Autoregressive Speech Generation. CoRR abs/2204.11806 (2022) - [i165]Chin-Lun Fu, Zih-Ching Chen, Yun-Ru Lee, Hung-yi Lee:
AdapterBias: Parameter-efficient Token-dependent Representation Shift for Adapters in NLP Tasks. CoRR abs/2205.00305 (2022) - [i164]Hung-yi Lee, Shang-Wen Li, Ngoc Thang Vu:
Meta Learning for Natural Language Processing: A Survey. CoRR abs/2205.01500 (2022) - [i163]Chi-Luen Feng, Po-chun Hsu, Hung-yi Lee:
Silence is Sweeter Than Speech: Self-Supervised Model Using Silence to Store Speaker Information. CoRR abs/2205.03759 (2022) - [i162]Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe
:
Self-Supervised Speech Representation Learning: A Review. CoRR abs/2205.10643 (2022) - [i161]Chi-Liang Liu, Hung-yi Lee, Wen-tau Yih:
Structured Prompt Tuning. CoRR abs/2205.12309 (2022) - [i160]Dennis Y. Menn, Hung-yi Lee:
Searching for the Essence of Adversarial Perturbations. CoRR abs/2205.15357 (2022) - [i159]Hsuan Su, Po-Han Chi, Shih-Cheng Huang, Ho-Lam Chung, Saurav Sahay, Shang-Tse Chen, Hung-Yi Lee:
Few-shot Prompting Towards Controllable Response Generation. CoRR abs/2206.03931 (2022) - [i158]Haibin Wu, Jiawen Kang, Lingwei Meng, Yang Zhang, Xixin Wu, Zhiyong Wu, Hung-yi Lee, Helen Meng:
Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion. CoRR abs/2206.09131 (2022) - [i157]Wei-Ping Huang, Po-Chun Chen, Sung-Feng Huang, Hung-yi Lee:
Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding. CoRR abs/2206.15427 (2022) - [i156]Da-Rong Liu, Po-chun Hsu, Yi-Chen Chen, Sung-Feng Huang, Shun-Po Chuang, Da-Yi Wu, Hung-yi Lee:
Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network. CoRR abs/2207.14568 (2022) - [i155]Tung-Yu Wu, Chen-An Li, Tzu-Han Lin, Tsu-Yuan Hsu, Hung-Yi Lee:
The Ability of Self-Supervised Speech Models for Audio Representations. CoRR abs/2209.12900 (2022) - [i154]Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Layne Berry, Hung-yi Lee, David Harwath:
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model. CoRR abs/2210.00705 (2022) - [i153]Xuanjun Chen, Haibin Wu, Helen Meng, Hung-yi Lee, Jyh-Shing Roger Jang:
Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection. CoRR abs/2210.00753 (2022) - [i152]David Cheng-Han Chiang, Hung-yi Lee:
How Far Are We from Real Synonym Substitution Attacks? CoRR abs/2210.02844 (2022) - [i151]Zih-Ching Chen, Chin-Lun Fu, Chih-Ying Liu, Shang-Wen Li, Hung-yi Lee:
Exploring Efficient-tuning Methods in Self-supervised Speech Models. CoRR abs/2210.06175 (2022) - [i150]Guan-Ting Lin, Chi-Luen Feng, Wei-Ping Huang, Yuan Tseng, Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Nigel G. Ward:
On the Utility of Self-supervised Models for Prosody-related Tasks. CoRR abs/2210.07185 (2022) - [i149]Yen Meng, Hsuan-Jui Chen, Jiatong Shi, Shinji Watanabe
, Paola García, Hung-yi Lee, Hao Tang:
On Compressing Sequences for Self-Supervised Speech Models. CoRR abs/2210.07189 (2022) - [i148]Kuan-Po Huang, Yu-Kuan Fu, Tsu-Yuan Hsu, Fabian Ritter Gutierrez, Fan-Lin Wang, Liang-Hsuan Tseng, Yu Zhang, Hung-yi Lee:
Improving generalizability of distilled self-supervised speech processing models under distorted settings. CoRR abs/2210.07978 (2022) - [i147]Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe
, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee:
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. CoRR abs/2210.08634 (2022) - [i146]Xuanjun Chen, Haibin Wu, Chung-Che Wang, Hung-yi Lee, Jyh-Shing Roger Jang:
Multimodal Transformer Distillation for Audio-Visual Synchronization. CoRR abs/2210.15563 (2022) - [i145]Chan-Jan Hsu, Ho-Lam Chung, Hung-yi Lee, Yu Tsao:
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5. CoRR abs/2211.00586 (2022) - [i144]Layne Berry, Yi-Jen Shih, Hsuan-Fu Wang, Heng-Jui Chang, Hung-yi Lee, David Harwath:
M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval. CoRR abs/2211.01180 (2022) - [i143]Hsuan-Jui Chen, Yen Meng, Hung-yi Lee:
Once-for-All Sequence Compression for Self-Supervised Speech Models. CoRR abs/2211.02332 (2022) - [i142]Jiatong Shi, Chan-Jan Hsu, Ho-Lam Chung, Dongji Gao, Paola García, Shinji Watanabe
, Ann Lee, Hung-yi Lee:
Bridging Speech and Textual Pre-trained Models with Unsupervised ASR. CoRR abs/2211.03025 (2022) - [i141]Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Alexei Baevski, Guan-Ting Lin, Hung-yi Lee, Yizhou Sun, Wei Wang:
Introducing Semantics into Speech Encoders. CoRR abs/2211.08402 (2022) - [i140]Tzu-Quan Lin, Hung-yi Lee, Hao Tang:
MelHuBERT: A simplified HuBERT on Mel spectrogram. CoRR abs/2211.09944 (2022) - [i139]Tzu-Quan Lin, Tsung-Huan Yang, Chun-Yao Chang, Kuang-Ming Chen, Tzu-hsun Feng, Hung-yi Lee, Hao Tang:
Compressing Transformer-based self-supervised models for speech processing. CoRR abs/2211.09949 (2022) - [i138]Tsu-Yuan Hsu, Chen-An Li, Tung-Yu Wu, Hung-yi Lee:
Model Extraction Attack against Self-supervised Speech Models. CoRR abs/2211.16044 (2022) - [i137]Dongji Gao, Jiatong Shi, Shun-Po Chuang, Leibny Paola Garcia, Hung-yi Lee, Shinji Watanabe
, Sanjeev Khudanpur:
EURO: ESPnet Unsupervised ASR Open-source Toolkit. CoRR abs/2211.17196 (2022) - [i136]Shih-Cheng Huang, Shih-Heng Wang, Min-Han Shih, Saurav Sahay, Hung-yi Lee:
General Framework for Self-Supervised Model Priming for Parameter-Efficient Fine-tuning. CoRR abs/2212.01032 (2022) - [i135]Zih-Ching Chen, Yu-Shun Sung, Hung-yi Lee:
CHAPTER: Exploiting Convolutional Neural Network Adapters for Self-supervised Speech Models. CoRR abs/2212.01282 (2022) - [i134]Suwon Shon, Siddhant Arora, Chyi-Jiunn Lin, Ankita Pasad, Felix Wu, Roshan Sharma, Wei-Lun Wu, Hung-Yi Lee, Karen Livescu, Shinji Watanabe
:
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks. CoRR abs/2212.10525 (2022) - 2021
- [j16]Shun-Po Chuang
, Alexander H. Liu, Tzu-Wei Sung, Hung-yi Lee
:
Improving Automatic Speech Recognition and Speech Translation via Word Embedding Prediction. IEEE ACM Trans. Audio Speech Lang. Process. 29: 93-105 (2021) - [j15]Andy T. Liu
, Shang-Wen Li
, Hung-yi Lee
:
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2351-2366 (2021) - [c133]Yu-Ching Chiu, Bo-Hao Chang, Tzu-Yu Chen, Cheng-Fu Yang, Nanyi Bi, Richard Tzong-Han Tsai, Hung-yi Lee, Jane Yung-jen Hsu:
Multi-modal User Intent Classification Under the Scenario of Smart Factory (Student Abstract). AAAI 2021: 15771-15772 - [c132]Shun-Po Chuang, Yung-Sung Chuang
, Chih-Chiang Chang, Hung-yi Lee:
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation. ACL/IJCNLP (Findings) 2021: 1068-1077 - [c131]Xuankai Chang, Takashi Maekaku, Pengcheng Guo, Jing Shi, Yen-Ju Lu, Aswin Shanmugam Subramanian, Tianzi Wang, Shu-Wen Yang, Yu Tsao, Hung-yi Lee, Shinji Watanabe
:
An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition. ASRU 2021: 228-235 - [c130]Shun-Po Chuang, Heng-Jui Chang, Sung-Feng Huang, Hung-yi Lee:
Non-Autoregressive Mandarin-English Code-Switching Speech Recognition. ASRU 2021: 465-472 - [c129]Wei-Tsung Kao, Hung-yi Lee:
Is BERT a Cross-Disciplinary Knowledge Learner? A Surprising Finding of Pre-trained Models' Transferability. EMNLP (Findings) 2021: 2195-2208 - [c128]Yuan-Kuei Wu, Kuan-Po Huang, Yu Tsao, Hung-yi Lee:
One Shot Learning for Speech Separation. ICASSP 2021: 5769-5773 - [c127]Yist Y. Lin, Chung-Ming Chien, Jheng-Hao Lin, Hung-yi Lee, Lin-Shan Lee:
Fragmentvc: Any-To-Any Voice Conversion by End-To-End Extracting and Fusing Fine-Grained Voice Fragments with Attention. ICASSP 2021: 5939-5943 - [c126]Yen-Hao Chen, Da-Yi Wu, Tsung-Han Wu, Hung-yi Lee:
Again-VC: A One-Shot Voice Conversion Using Activation Guidance and Adaptive Instance Normalization. ICASSP 2021: 5954-5958 - [c125]