default search action
Shang-Wen Li 0001
Shang-wen Li 0001 – Shang-Wen (Daniel) Li
Person information
- affiliation: Apple Inc., Cupertino, CA, USA
- affiliation (former): Amazon, Seattle, WA, USA
- affiliation (PhD 2017): Massachusetts Institute of Technology, Cambridge, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j6]Shu-Wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee:
A Large-Scale Evaluation of Speech Foundation Models. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2884-2899 (2024) - [j5]Kai-Wei Chang, Haibin Wu, Yu-Kai Wang, Yuan-Kuei Wu, Hua Shen, Wei-Cheng Tseng, Iu-thing Kang, Shang-wen Li, Hung-Yi Lee:
SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3730-3744 (2024) - [j4]Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mido Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jégou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski:
DINOv2: Learning Robust Visual Features without Supervision. Trans. Mach. Learn. Res. 2024 (2024) - [c48]Puyuan Peng, Po-Yao Huang, Shang-Wen Li, Abdelrahman Mohamed, David Harwath:
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild. ACL (1) 2024: 12442-12462 - [c47]Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CVPR 2024: 26344-26353 - [c46]Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. EMNLP 2024: 19302-19318 - [c45]Yuan Tseng, Layne Berry, Yiting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Poyao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Abdelrahman Mohamed, Chi-Luen Feng, Hung-Yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. ICASSP 2024: 6890-6894 - [c44]Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W. Black, Gopala Krishna Anumanchipalli:
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in Hubert. ICASSP 2024: 12076-12080 - [c43]Chyi-Jiunn Lin, Guan-Ting Lin, Yung-Sung Chuang, Wei-Lun Wu, Shang-Wen Li, Abdelrahman Mohamed, Hung-Yi Lee, Lin-Shan Lee:
SpeechDPR: End-To-End Spoken Passage Retrieval For Open-Domain Spoken Question Answering. ICASSP 2024: 12476-12480 - [c42]Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
Demystifying CLIP Data. ICLR 2024 - [i52]Chyi-Jiunn Lin, Guan-Ting Lin, Yung-Sung Chuang, Wei-Lun Wu, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering. CoRR abs/2401.13463 (2024) - [i51]Shu-Wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee:
A Large-Scale Evaluation of Speech Foundation Models. CoRR abs/2404.09385 (2024) - [i50]Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CoRR abs/2404.16030 (2024) - [i49]Vasu Sharma, Karthik Padthe, Newsha Ardalani, Kushal Tirumala, Russell Howes, Hu Xu, Po-Yao Huang, Shang-Wen Li, Armen Aghajanyan, Gargi Ghosh, Luke Zettlemoyer:
Text Quality-Based Pruning for Efficient Training of Language Models. CoRR abs/2405.01582 (2024) - [i48]Kai-Wei Chang, Haibin Wu, Yu-Kai Wang, Yuan-Kuei Wu, Hua Shen, Wei-Cheng Tseng, Iu-thing Kang, Shang-Wen Li, Hung-yi Lee:
SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks. CoRR abs/2408.13040 (2024) - [i47]Zhenyu Wang, Li Wan, Biqiao Zhang, Yiteng Huang, Shang-Wen Li, Ming Sun, Xin Lei, Zhaojun Yang:
Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting. CoRR abs/2408.13355 (2024) - [i46]Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. CoRR abs/2410.17251 (2024) - 2023
- [c41]Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Bing Liu, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Guan-Ting Lin, Alexei Baevski, Hung-yi Lee, Yizhou Sun, Wei Wang:
Introducing Semantics into Speech Encoders. ACL (1) 2023: 11413-11429 - [c40]Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James R. Glass:
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering. ACL (Findings) 2023: 12131-12147 - [c39]Kai-Wei Chang, Ming-Hsin Chen, Yun-Ping Lin, Jing Neng Hsu, Paul Kuo-Ming Huang, Chien-Yu Huang, Shang-Wen Li, Hung-Yi Lee:
Prompting and Adapter Tuning For Self-Supervised Encoder-Decoder Speech Model. ASRU 2023: 1-8 - [c38]Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chuang, Xuankai Chang, Yuxun Tang, Shang-Wen Li, Abdelrahman Mohamed, Hung-Yi Lee, Shinji Watanabe:
Findings of the 2023 ML-Superb Challenge: Pre-Training And Evaluation Over More Languages And Beyond. ASRU 2023: 1-8 - [c37]Ching-Feng Yeh, Po-Yao Huang, Vasu Sharma, Shang-Wen Li, Gargi Ghosh:
Flap: Fast Language-Audio Pre-Training. ASRU 2023: 1-8 - [c36]Zhenyu Wang, Li Wan, Biqiao Zhang, Yiteng Huang, Shang-Wen Li, Ming Sun, Xin Lei, Zhaojun Yang:
Disentangled Training with Adversarial Examples for Robust Small-Footprint Keyword Spotting. ICASSP 2023: 1-5 - [c35]Puyuan Peng, Shang-Wen Li, Okko Räsänen, Abdelrahman Mohamed, David Harwath:
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model. INTERSPEECH 2023: 391-395 - [c34]Jiatong Shi, Dan Berrebbi, William Chen, En-Pei Hu, Wei-Ping Huang, Ho-Lam Chung, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe:
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark. INTERSPEECH 2023: 884-888 - [c33]Guan-Wei Wu, Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee:
Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target. INTERSPEECH 2023: 1503-1507 - [c32]Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer:
MAViL: Masked Audio-Video Learners. NeurIPS 2023 - [i45]Kai-Wei Chang, Yu-Kai Wang, Hua Shen, Iu-thing Kang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee:
SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks. CoRR abs/2303.00733 (2023) - [i44]Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy V. Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael G. Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jégou, Julien Mairal, Patrick Labatut, Armand Joulin, Piotr Bojanowski:
DINOv2: Learning Robust Visual Features without Supervision. CoRR abs/2304.07193 (2023) - [i43]Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei-Ping Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe:
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark. CoRR abs/2305.10615 (2023) - [i42]Puyuan Peng, Shang-Wen Li, Okko Räsänen, Abdelrahman Mohamed, David Harwath:
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode. CoRR abs/2305.11435 (2023) - [i41]Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James R. Glass:
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering. CoRR abs/2305.17080 (2023) - [i40]Guan-Wei Wu, Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee:
Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target. CoRR abs/2305.18096 (2023) - [i39]Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz, Luke Zettlemoyer, Armen Aghajanyan:
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning. CoRR abs/2309.02591 (2023) - [i38]Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. CoRR abs/2309.10787 (2023) - [i37]Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer:
Demystifying CLIP Data. CoRR abs/2309.16671 (2023) - [i36]Kai-Wei Chang, Ming-Hsin Chen, Yun-Ping Lin, Jing Neng Hsu, Paul Kuo-Ming Huang, Chien-Yu Huang, Shang-Wen Li, Hung-yi Lee:
Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model. CoRR abs/2310.02971 (2023) - [i35]Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chung, Xuankai Chang, Yuxun Tang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe:
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond. CoRR abs/2310.05513 (2023) - [i34]Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W. Black, Gopala Krishna Anumanchipalli:
SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT. CoRR abs/2310.10803 (2023) - [i33]Ming-Hao Hsu, Kai-Wei Chang, Shang-Wen Li, Hung-yi Lee:
An Exploration of In-Context Learning for Speech Language Model. CoRR abs/2310.12477 (2023) - [i32]Ching-Feng Yeh, Po-Yao Huang, Vasu Sharma, Shang-Wen Li, Gargi Ghosh:
FLAP: Fast Language-Audio Pre-training. CoRR abs/2311.01615 (2023) - [i31]Min-Han Shih, Ho-Lam Chung, Yu-Chi Pai, Ming-Hao Hsu, Guan-Ting Lin, Shang-Wen Li, Hung-Yi Lee:
GSQA: An End-to-End Model for Generative Spoken Question Answering. CoRR abs/2312.09781 (2023) - 2022
- [j3]Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe:
Self-Supervised Speech Representation Learning: A Review. IEEE J. Sel. Top. Signal Process. 16(6): 1179-1210 (2022) - [c31]Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. ACL (1) 2022: 8479-8492 - [c30]Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee:
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition. INTERSPEECH 2022: 2198-2202 - [c29]Kai-Wei Chang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee:
An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks. INTERSPEECH 2022: 5005-5009 - [c28]Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-Wen Yang, Hsuan-Jui Chen, Shuyan Annie Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering. INTERSPEECH 2022: 5165-5169 - [c27]Hongyin Luo, Shang-Wen Li, Mingye Gao, Seunghak Yu, James R. Glass:
Cooperative Self-training of Machine Reading Comprehension. NAACL-HLT 2022: 244-257 - [c26]Hung-yi Lee, Shang-Wen Li, Thang Vu:
Meta Learning for Natural Language Processing: A Survey. NAACL-HLT 2022: 666-684 - [c25]Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Scott Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. NAACL-HLT 2022: 4207-4218 - [c24]Xisen Jin, Dejiao Zhang, Henghui Zhu, Wei Xiao, Shang-Wen Li, Xiaokai Wei, Andrew O. Arnold, Xiang Ren:
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora. NAACL-HLT 2022: 4764-4780 - [c23]Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee:
Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. SLT 2022: 1096-1103 - [c22]Zih-Ching Chen, Chin-Lun Fu, Chih-Ying Liu, Shang-Wen (Daniel) Li, Hung-yi Lee:
Exploring Efficient-Tuning Methods in Self-Supervised Speech Models. SLT 2022: 1120-1127 - [i30]Andy T. Liu, Wei Xiao, Henghui Zhu, Dejiao Zhang, Shang-Wen Li, Andrew O. Arnold:
QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition. CoRR abs/2203.01543 (2022) - [i29]Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-Wen Yang, Hsuan-Jui Chen, Shuyan Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering. CoRR abs/2203.04911 (2022) - [i28]Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. CoRR abs/2203.06849 (2022) - [i27]Guan-Ting Lin, Shang-Wen Li, Hung-yi Lee:
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition. CoRR abs/2203.14222 (2022) - [i26]Kai-Wei Chang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee:
An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks. CoRR abs/2203.16773 (2022) - [i25]Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Wen-tau Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. CoRR abs/2204.10298 (2022) - [i24]Hung-yi Lee, Shang-Wen Li, Ngoc Thang Vu:
Meta Learning for Natural Language Processing: A Survey. CoRR abs/2205.01500 (2022) - [i23]Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe:
Self-Supervised Speech Representation Learning: A Review. CoRR abs/2205.10643 (2022) - [i22]Zih-Ching Chen, Chin-Lun Fu, Chih-Ying Liu, Shang-Wen Li, Hung-yi Lee:
Exploring Efficient-tuning Methods in Self-supervised Speech Models. CoRR abs/2210.06175 (2022) - [i21]Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee:
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. CoRR abs/2210.08634 (2022) - [i20]Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Alexei Baevski, Guan-Ting Lin, Hung-yi Lee, Yizhou Sun, Wei Wang:
Introducing Semantics into Speech Encoders. CoRR abs/2211.08402 (2022) - [i19]Po-Yao Huang, Vasu Sharma, Hu Xu, Chaitanya Ryali, Haoqi Fan, Yanghao Li, Shang-Wen Li, Gargi Ghosh, Jitendra Malik, Christoph Feichtenhofer:
MAViL: Masked Audio-Video Learners. CoRR abs/2212.08071 (2022) - 2021
- [j2]Andy T. Liu, Shang-Wen Li, Hung-yi Lee:
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2351-2366 (2021) - [c21]Shuyang Li, Jin Cao, Mukund Sridhar, Henghui Zhu, Shang-Wen Li, Wael Hamza, Julian J. McAuley:
Zero-shot Generalization in Dialog State Tracking through Generative Question Answering. EACL 2021: 1063-1074 - [c20]Dejiao Zhang, Shang-Wen Li, Wei Xiao, Henghui Zhu, Ramesh Nallapati, Andrew O. Arnold, Bing Xiang:
Pairwise Supervised Contrastive Learning of Sentence Representations. EMNLP (1) 2021: 5786-5798 - [c19]Cheng-I Lai, Yung-Sung Chuang, Hung-Yi Lee, Shang-Wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. ICASSP 2021: 7468-7472 - [c18]Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB: Speech Processing Universal PERformance Benchmark. Interspeech 2021: 1194-1198 - [c17]Hongyin Luo, James R. Glass, Garima Lalwani, Yi Zhang, Shang-Wen Li:
Joint Retrieval-Extraction Training for Evidence-Aware Dialog Response Selection. Interspeech 2021: 3241-3245 - [c16]Dejiao Zhang, Feng Nan, Xiaokai Wei, Shang-Wen Li, Henghui Zhu, Kathleen R. McKeown, Ramesh Nallapati, Andrew O. Arnold, Bing Xiang:
Supporting Clustering with Contrastive Learning. NAACL-HLT 2021: 5419-5430 - [c15]Po-Han Chi, Pei-Hung Chung, Tsung-Han Wu, Chun-Cheng Hsieh, Yen-Hao Chen, Shang-Wen Li, Hung-yi Lee:
Audio Albert: A Lite Bert for Self-Supervised Learning of Audio Representation. SLT 2021: 344-350 - [c14]Shang-Wen Li, Jason Krone, Shuyan Dong, Yi Zhang, Yaser Al-Onaizan:
Meta Learning to Classify Intent and Slot Labels with Noisy Few Shot Examples. SLT 2021: 1004-1011 - [i18]Shuyang Li, Jin Cao, Mukund Sridhar, Henghui Zhu, Shang-Wen Li, Wael Hamza, Julian J. McAuley:
Zero-shot Generalization in Dialog State Tracking through Generative Question Answering. CoRR abs/2101.08333 (2021) - [i17]Hongyin Luo, Shang-Wen Li, James R. Glass:
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks. CoRR abs/2101.09773 (2021) - [i16]Hongyin Luo, Shang-Wen Li, Seunghak Yu, James R. Glass:
Cooperative Learning of Zero-Shot Machine Reading Comprehension. CoRR abs/2103.07449 (2021) - [i15]Dejiao Zhang, Feng Nan, Xiaokai Wei, Shang-Wen Li, Henghui Zhu, Kathleen R. McKeown, Ramesh Nallapati, Andrew O. Arnold, Bing Xiang:
Supporting Clustering with Contrastive Learning. CoRR abs/2103.12953 (2021) - [i14]Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB: Speech processing Universal PERformance Benchmark. CoRR abs/2105.01051 (2021) - [i13]Hongyin Luo, Shuyan Dong, Yung-Sung Chuang, Shang-Wen Li:
Meta-learning for downstream aware and agnostic pretraining. CoRR abs/2106.03270 (2021) - [i12]Yung-Sung Chuang, Mingye Gao, Hongyin Luo, James R. Glass, Hung-Yi Lee, Yun-Nung Chen, Shang-Wen Li:
Mitigating Biases in Toxic Language Detection through Invariant Rationalization. CoRR abs/2106.07240 (2021) - [i11]Dejiao Zhang, Shang-Wen Li, Wei Xiao, Henghui Zhu, Ramesh Nallapati, Andrew O. Arnold, Bing Xiang:
Pairwise Supervised Contrastive Learning of Sentence Representations. CoRR abs/2109.05424 (2021) - [i10]Xisen Jin, Dejiao Zhang, Henghui Zhu, Wei Xiao, Shang-Wen Li, Xiaokai Wei, Andrew O. Arnold, Xiang Ren:
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora. CoRR abs/2110.08534 (2021) - 2020
- [c13]Hongyin Luo, Shang-Wen Li, James R. Glass:
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks. ClinicalNLP@EMNLP 2020: 136-145 - [c12]Jin Cao, Jun Wang, Wael Hamza, Kelly Vanee, Shang-Wen Li:
Style Attuned Pre-Training and Parameter Efficient Fine-Tuning for Spoken Language Understanding. INTERSPEECH 2020: 1570-1574 - [c11]Hongyin Luo, Shang-Wen Li, James R. Glass:
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption. INTERSPEECH 2020: 3895-3899 - [i9]Po-Han Chi, Pei-Hung Chung, Tsung-Han Wu, Chun-Cheng Hsieh, Shang-wen Li, Hung-yi Lee:
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation. CoRR abs/2005.08575 (2020) - [i8]Hongyin Luo, Shang-Wen Li, James R. Glass:
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption. CoRR abs/2005.11153 (2020) - [i7]Andy T. Liu, Shang-wen Li, Hung-yi Lee:
TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech. CoRR abs/2007.06028 (2020) - [i6]Jin Cao, Jun Wang, Wael Hamza, Kelly Vanee, Shang-Wen Li:
Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding. CoRR abs/2010.04355 (2020) - [i5]Cheng-I Lai, Yung-Sung Chuang, Hung-yi Lee, Shang-wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. CoRR abs/2010.13826 (2020) - [i4]Cheng-I Lai, Jin Cao, Sravan Bodapati, Shang-Wen Li:
Towards Semi-Supervised Semantics Understanding from Speech. CoRR abs/2011.06195 (2020) - [i3]Shang-Wen Li, Jason Krone, Shuyan Dong, Yi Zhang, Yaser Al-Onaizan:
Meta learning to classify intent and slot labels with noisy few shot examples. CoRR abs/2012.07516 (2020) - [i2]Shang-Wen Li:
Improving Learning Experience in MOOCs with Educational Content Linking. CoRR abs/2012.15826 (2020)
2010 – 2019
- 2017
- [b1]Shang-wen Li:
Improving learning experience in MOOCs with educational content linking. Massachusetts Institute of Technology, Cambridge, USA, 2017 - [i1]Maryam Fazel-Zarandi, Shang-Wen Li, Jin Cao, Jared Casale, Peter Henderson, David Whitney, Alborz Geramifard:
Learning Robust Dialog Policies in Noisy Environments. CoRR abs/1712.04034 (2017) - 2016
- [c10]Xiangrong Zhang, Chen Li, Shang-wen Li, Victor Zue:
Automated Segmentation of MOOC Lectures towards Customized Learning. ICALT 2016: 20-22 - [c9]Chengjie Sun, Shang-wen Li, Lei Lin:
Thread Structure Prediction for MOOC Discussion Forum. ICYCSEE (2) 2016: 92-101 - 2015
- [c8]Shang-wen Li, Victor Zue:
Would Linked MOOC Courseware Enhance Information Search? ICALT 2015: 397-399 - [c7]Shang-Wen (Daniel) Li, Piotr Mitros:
Learnersourced Recommendations for Remediation. ICALT 2015: 411-412 - [c6]Sheng-syun Shen, Hung-yi Lee, Shang-wen Li, Victor Zue, Lin-Shan Lee:
Structuring lectures in massive open online courses (MOOCs) for efficient learning by linking similar sections and predicting prerequisites. INTERSPEECH 2015: 1363-1367 - [c5]Shang-wen Li, Victor Zue:
Linking MOOC courseware to accommodate diverse learner backgrounds. SLaTE 2015: 155-160 - 2014
- [c4]Juho Kim, Philip J. Guo, Carrie J. Cai, Shang-Wen (Daniel) Li, Krzysztof Z. Gajos, Robert C. Miller:
Data-driven interaction techniques for improving navigation of educational videos. UIST 2014: 563-572 - 2013
- [j1]Yow-Bang Wang, Shang-wen Li, Lin-Shan Lee:
An Experimental Analysis on Integrating Multi-Stream Spectro-Temporal, Cepstral and Pitch Information for Mandarin Speech Recognition. IEEE Trans. Speech Audio Process. 21(10): 2006-2014 (2013) - 2011
- [c3]Shang-wen Li, Liang-Che Sun, Lin-Shan Lee:
Multi-stream spectro-temporal and cepstral features based on data-driven hierarchical phoneme clusters. ICASSP 2011: 5196-5199 - [c2]