default search action
Maosong Sun 0001
Person information
- affiliation: Tsinghua University, Department of Computer Science and Technology, Beijing, China
- affiliation (PhD 2004): City University of Hong Kong
Other persons with the same name
- Maosong Sun (aka: Mao-song Sun) — disambiguation page
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j66]Yuan Yao, Ao Zhang, Zhengyan Zhang, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun:
CPT: Colorful Prompt Tuning for pre-trained vision-language models. AI Open 5: 30-38 (2024) - [j65]Shangda Wu, Yue Yang, Zhaowen Wang, Xiaobing Li, Maosong Sun:
Generating chord progression from melody with flexible harmonic rhythm and controllable harmonic density. EURASIP J. Audio Speech Music. Process. 2024(1): 4 (2024) - [j64]Weize Chen, Xu Han, Yankai Lin, Kaichen He, Ruobing Xie, Jie Zhou, Zhiyuan Liu, Maosong Sun:
Hyperbolic Pre-Trained Language Model. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3101-3112 (2024) - [j63]Yujia Qin, Xiaozhi Wang, Yusheng Su, Yankai Lin, Ning Ding, Jing Yi, Weize Chen, Zhiyuan Liu, Juanzi Li, Lei Hou, Peng Li, Maosong Sun, Jie Zhou:
Exploring Universal Intrinsic Task Subspace for Few-Shot Learning via Prompt Tuning. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3631-3643 (2024) - [j62]Shihao Liang, Runchu Tian, Kunlun Zhu, Yujia Qin, Huadong Wang, Xin Cong, Zhiyuan Liu, Xiaojiang Liu, Maosong Sun:
Exploring Format Consistency for Instruction Tuning. Trans. Mach. Learn. Res. 2024 (2024) - [j61]Zhiyuan Wen, Jiannong Cao, Jiaxing Shen, Ruosong Yang, Shuaiqi Liu, Maosong Sun:
Personality-affected Emotion Generation in Dialog Systems. ACM Trans. Inf. Syst. 42(5): 134:1-134:27 (2024) - [c303]Cheng Qian, Bingxiang He, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun:
Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents. ACL (1) 2024: 1088-1113 - [c302]Chaoqun He, Renjie Luo, Yuzhuo Bai, Shengding Hu, Zhen Leng Thai, Junhao Shen, Jinyi Hu, Xu Han, Yujie Huang, Yuxiang Zhang, Jie Liu, Lei Qi, Zhiyuan Liu, Maosong Sun:
OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems. ACL (1) 2024: 3828-3850 - [c301]Runchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Yinxu Pan, Yesai Wu, Haotian Hui, Weichuan Liu, Zhiyuan Liu, Maosong Sun:
DebugBench: Evaluating Debugging Capability of Large Language Models. ACL (Findings) 2024: 4173-4198 - [c300]Chen Qian, Yufan Dang, Jiahao Li, Wei Liu, Zihao Xie, Yifei Wang, Weize Chen, Cheng Yang, Xin Cong, Xiaoyin Che, Zhiyuan Liu, Maosong Sun:
Experiential Co-Learning of Software-Developing Agents. ACL (1) 2024: 5628-5640 - [c299]Yufei Huang, Xu Han, Maosong Sun:
FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection. ACL (1) 2024: 6262-6276 - [c298]Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu:
CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models. ACL (1) 2024: 10639-10659 - [c297]Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu:
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models. ACL (Findings) 2024: 11143-11156 - [c296]Yuanchi Zhang, Yile Wang, Zijun Liu, Shuo Wang, Xiaolong Wang, Peng Li, Maosong Sun, Yang Liu:
Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages. ACL (1) 2024: 11189-11204 - [c295]Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion. ACL (1) 2024: 11229-11245 - [c294]Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Model Composition for Multimodal Large Language Models. ACL (1) 2024: 11246-11262 - [c293]Zhiyu Yang, Zihan Zhou, Shuo Wang, Xin Cong, Xu Han, Yukun Yan, Zhenghao Liu, Zhixing Tan, Pengyuan Liu, Dong Yu, Zhiyuan Liu, Xiaodong Shi, Maosong Sun:
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization. ACL (Findings) 2024: 11789-11804 - [c292]Haoyu Wang, Shuo Wang, Yukun Yan, Xujia Wang, Zhiyu Yang, Yuzhuang Xu, Zhenghao Liu, Liner Yang, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun:
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset. ACL (1) 2024: 11929-11942 - [c291]Hanqing Wang, Bowen Ping, Shuo Wang, Xu Han, Yun Chen, Zhiyuan Liu, Maosong Sun:
LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks. ACL (1) 2024: 12871-12882 - [c290]Zhu Liu, Cunliang Kong, Ying Liu, Maosong Sun:
Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics. ACL (Findings) 2024: 14551-14558 - [c289]Chen Qian, Wei Liu, Hongzhang Liu, Nuo Chen, Yufan Dang, Jiahao Li, Cheng Yang, Weize Chen, Yusheng Su, Xin Cong, Juyuan Xu, Dahai Li, Zhiyuan Liu, Maosong Sun:
ChatDev: Communicative Agents for Software Development. ACL (1) 2024: 15174-15186 - [c288]Xinrong Zhang, Yingfa Chen, Shengding Hu, Zihang Xu, Junhao Chen, Moo Khai Hao, Xu Han, Zhen Leng Thai, Shuo Wang, Zhiyuan Liu, Maosong Sun:
ınftyBench: Extending Long Context Evaluation Beyond 100K Tokens. ACL (1) 2024: 15262-15277 - [c287]Chaojun Xiao, Yutao Sun, Yuan Yao, Xu Han, Wenbin Zhang, Zhiyuan Liu, Maosong Sun:
Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training. LREC/COLING 2024: 7324-7335 - [c286]Yingfa Chen, Zhengyan Zhang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Chen Chen, Kuai Li, Tao Yang, Maosong Sun:
Robust and Scalable Model Editing for Large Language Models. LREC/COLING 2024: 14157-14172 - [c285]Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie Zhou:
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors. ICLR 2024 - [c284]Jinyi Hu, Yuan Yao, Chongyi Wang, Shan Wang, Yinxu Pan, Qianyu Chen, Tianyu Yu, Hanghao Wu, Yue Zhao, Haoye Zhang, Xu Han, Yankai Lin, Jiao Xue, Dahai Li, Zhiyuan Liu, Maosong Sun:
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages. ICLR 2024 - [c283]Shengding Hu, Xin Liu, Xu Han, Xinrong Zhang, Chaoqun He, Weilin Zhao, Yankai Lin, Ning Ding, Zebin Ou, Guoyang Zeng, Zhiyuan Liu, Maosong Sun:
Predicting Emergent Abilities with Infinite Resolution Evaluation. ICLR 2024 - [c282]Yujia Qin, Shihao Liang, Yining Ye, Kunlun Zhu, Lan Yan, Yaxi Lu, Yankai Lin, Xin Cong, Xiangru Tang, Bill Qian, Sihan Zhao, Lauren Hong, Runchu Tian, Ruobing Xie, Jie Zhou, Mark Gerstein, Dahai Li, Zhiyuan Liu, Maosong Sun:
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs. ICLR 2024 - [c281]Ganqu Cui, Lifan Yuan, Ning Ding, Guanming Yao, Bingxiang He, Wei Zhu, Yuan Ni, Guotong Xie, Ruobing Xie, Yankai Lin, Zhiyuan Liu, Maosong Sun:
ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback. ICML 2024 - [c280]Zhengyan Zhang, Chaojun Xiao, Qiujieli Qin, Yankai Lin, Zhiyuan Zeng, Xu Han, Zhiyuan Liu, Ruobing Xie, Maosong Sun, Jie Zhou:
Exploring the Benefit of Activation Sparsity in Pre-training. ICML 2024 - [c279]Haiyang Bian, Yixin Chen, Xiaomin Dong, Chen Li, Minsheng Hao, Sijie Chen, Jinyi Hu, Maosong Sun, Lei Wei, Xuegong Zhang:
scMulan: A Multitask Generative Pre-Trained Language Model for Single-Cell Analysis. RECOMB 2024: 479-482 - [i250]Runchu Tian, Yining Ye, Yujia Qin, Xin Cong, Yankai Lin, Yinxu Pan, Yesai Wu, Zhiyuan Liu, Maosong Sun:
DebugBench: Evaluating Debugging Capability of Large Language Models. CoRR abs/2401.04621 (2024) - [i249]Cheng Qian, Shihao Liang, Yujia Qin, Yining Ye, Xin Cong, Yankai Lin, Yesai Wu, Zhiyuan Liu, Maosong Sun:
Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution. CoRR abs/2401.13996 (2024) - [i248]Junjie Fang, Likai Tang, Hongzhe Bi, Yujia Qin, Si Sun, Zhenyu Li, Haolun Li, Yongjian Li, Xin Cong, Yukun Yan, Xiaodong Shi, Sen Song, Yankai Lin, Zhiyuan Liu, Maosong Sun:
UniMem: Towards a Unified View of Long-Context Large Language Models. CoRR abs/2402.03009 (2024) - [i247]Zhengyan Zhang, Yixin Song, Guanghui Yu, Xu Han, Yankai Lin, Chaojun Xiao, Chenyang Song, Zhiyuan Liu, Zeyu Mi, Maosong Sun:
ReLU2 Wins: Discovering Efficient Activation Functions for Sparse LLMs. CoRR abs/2402.03804 (2024) - [i246]Haoyu Wang, Shuo Wang, Yukun Yan, Xujia Wang, Zhiyu Yang, Yuzhuang Xu, Zhenghao Liu, Liner Yang, Ning Ding, Xu Han, Zhiyuan Liu, Maosong Sun:
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset. CoRR abs/2402.04588 (2024) - [i245]Chaojun Xiao, Pengle Zhang, Xu Han, Guangxuan Xiao, Yankai Lin, Zhengyan Zhang, Zhiyuan Liu, Song Han, Maosong Sun:
InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory. CoRR abs/2402.04617 (2024) - [i244]Jiarui Zhang, Jinyi Hu, Mahyar Khayatkhoei, Filip Ilievski, Maosong Sun:
Exploring Perceptual Limitation of Multimodal Large Language Models. CoRR abs/2402.07384 (2024) - [i243]Cheng Qian, Bingxiang He, Zhong Zhuang, Jia Deng, Yujia Qin, Xin Cong, Zhong Zhang, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun:
Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents. CoRR abs/2402.09205 (2024) - [i242]Zhiyu Yang, Zihan Zhou, Shuo Wang, Xin Cong, Xu Han, Yukun Yan, Zhenghao Liu, Zhixing Tan, Pengyuan Liu, Dong Yu, Zhiyuan Liu, Xiaodong Shi, Maosong Sun:
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization. CoRR abs/2402.11453 (2024) - [i241]Hanqing Wang, Bowen Ping, Shuo Wang, Xu Han, Yun Chen, Zhiyuan Liu, Maosong Sun:
LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks. CoRR abs/2402.11455 (2024) - [i240]Ziyue Wang, Chi Chen, Yiqi Zhu, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Browse and Concentrate: Comprehending Multimodal Content via prior-LLM Context Fusion. CoRR abs/2402.12195 (2024) - [i239]Yuanchi Zhang, Yile Wang, Zijun Liu, Shuo Wang, Xiaolong Wang, Peng Li, Maosong Sun, Yang Liu:
Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages. CoRR abs/2402.12204 (2024) - [i238]Chi Chen, Yiyang Du, Zheng Fang, Ziyue Wang, Fuwen Luo, Peng Li, Ming Yan, Ji Zhang, Fei Huang, Maosong Sun, Yang Liu:
Model Composition for Multimodal Large Language Models. CoRR abs/2402.12750 (2024) - [i237]Chenyang Song, Xu Han, Zhengyan Zhang, Shengding Hu, Xiyu Shi, Kuai Li, Chen Chen, Zhiyuan Liu, Guangli Li, Tao Yang, Maosong Sun:
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models. CoRR abs/2402.13516 (2024) - [i236]Yang Liu, Meng Xu, Shuo Wang, Liner Yang, Haoyu Wang, Zhenghao Liu, Cunliang Kong, Yun Chen, Yang Liu, Maosong Sun, Erhong Yang:
OMGEval: An Open Multilingual Generative Evaluation Benchmark for Large Language Models. CoRR abs/2402.13524 (2024) - [i235]Fuwen Luo, Chi Chen, Zihao Wan, Zhaolu Kang, Qidong Yan, Yingjie Li, Xiaolong Wang, Siyu Wang, Ziyue Wang, Xiaoyue Mi, Peng Li, Ning Ma, Maosong Sun, Yang Liu:
CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models. CoRR abs/2402.13607 (2024) - [i234]Xinrong Zhang, Yingfa Chen, Shengding Hu, Zihang Xu, Junhao Chen, Moo Khai Hao, Xu Han, Zhen Leng Thai, Shuo Wang, Zhiyuan Liu, Maosong Sun:
∞Bench: Extending Long Context Evaluation Beyond 100K Tokens. CoRR abs/2402.13718 (2024) - [i233]Weilin Zhao, Yuxiang Huang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Maosong Sun:
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting. CoRR abs/2402.13720 (2024) - [i232]Chaoqun He, Renjie Luo, Yuzhuo Bai, Shengding Hu, Zhen Leng Thai, Junhao Shen, Jinyi Hu, Xu Han, Yujie Huang, Yuxiang Zhang, Jie Liu, Lei Qi, Zhiyuan Liu, Maosong Sun:
OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems. CoRR abs/2402.14008 (2024) - [i231]Yufei Huang, Shengding Hu, Xu Han, Zhiyuan Liu, Maosong Sun:
Unified View of Grokking, Double Descent and Emergent Abilities: A Perspective from Circuits Competition. CoRR abs/2402.15175 (2024) - [i230]Jingsi Yu, Cunliang Kong, Liner Yang, Meishan Zhang, Lin Zhu, Yujie Wang, Haozhe Lin, Maosong Sun, Erhong Yang:
Cross-domain Chinese Sentence Pattern Parsing. CoRR abs/2402.16311 (2024) - [i229]Qinyu Luo, Yining Ye, Shihao Liang, Zhong Zhang, Yujia Qin, Yaxi Lu, Yesai Wu, Xin Cong, Yankai Lin, Yingli Zhang, Xiaoyin Che, Zhiyuan Liu, Maosong Sun:
RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation. CoRR abs/2402.16667 (2024) - [i228]Weize Chen, Chenfei Yuan, Jiarui Yuan, Yusheng Su, Chen Qian, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun:
Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication. CoRR abs/2402.18439 (2024) - [i227]Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Jiexin Wang, Huimin Chen, Bowen Sun, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun:
Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment. CoRR abs/2402.19085 (2024) - [i226]Zhu Liu, Cunliang Kong, Ying Liu, Maosong Sun:
Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics. CoRR abs/2403.01509 (2024) - [i225]Xinpeng Wang, Shitong Duan, Xiaoyuan Yi, Jing Yao, Shanlin Zhou, Zhihua Wei, Peng Zhang, Dongkuan Xu, Maosong Sun, Xing Xie:
On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models. CoRR abs/2403.04204 (2024) - [i224]Zhicheng Guo, Sijie Cheng, Hao Wang, Shihao Liang, Yujia Qin, Peng Li, Zhiyuan Liu, Maosong Sun, Yang Liu:
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models. CoRR abs/2403.07714 (2024) - [i223]Ning Ding, Yulin Chen, Ganqu Cui, Xingtai Lv, Weilin Zhao, Ruobing Xie, Bowen Zhou, Zhiyuan Liu, Maosong Sun:
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models. CoRR abs/2403.08281 (2024) - [i222]Sun Ao, Weilin Zhao, Xu Han, Cheng Yang, Zhiyuan Liu, Chuan Shi, Maosong Sun, Shengnan Wang, Teng Su:
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences. CoRR abs/2403.09347 (2024) - [i221]Ruyi Xu, Yuan Yao, Zonghao Guo, Junbo Cui, Zanlin Ni, Chunjiang Ge, Tat-Seng Chua, Zhiyuan Liu, Maosong Sun, Gao Huang:
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images. CoRR abs/2403.11703 (2024) - [i220]Yingfa Chen, Zhengyan Zhang, Xu Han, Chaojun Xiao, Zhiyuan Liu, Chen Chen, Kuai Li, Tao Yang, Maosong Sun:
Robust and Scalable Model Editing for Large Language Models. CoRR abs/2403.17431 (2024) - [i219]Lifan Yuan, Ganqu Cui, Hanbin Wang, Ning Ding, Xingyao Wang, Jia Deng, Boji Shan, Huimin Chen, Ruobing Xie, Yankai Lin, Zhenghao Liu, Bowen Zhou, Hao Peng, Zhiyuan Liu, Maosong Sun:
Advancing LLM Reasoning Generalists with Preference Trees. CoRR abs/2404.02078 (2024) - [i218]Shengding Hu, Yuge Tu, Xu Han, Chaoqun He, Ganqu Cui, Xiang Long, Zhi Zheng, Yewei Fang, Yuxiang Huang, Weilin Zhao, Xinrong Zhang, Zhen Leng Thai, Kai Zhang, Chongyi Wang, Yuan Yao, Chenyang Zhao, Jie Zhou, Jie Cai, Zhongwu Zhai, Ning Ding, Chao Jia, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun:
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies. CoRR abs/2404.06395 (2024) - [i217]Zhiyuan Wen, Jiannong Cao, Jiaxing Shen, Ruosong Yang, Shuaiqi Liu, Maosong Sun:
Personality-affected Emotion Generation in Dialog Systems. CoRR abs/2404.07229 (2024) - [i216]Chaoqun He, Renjie Luo, Shengding Hu, Yuanqian Zhao, Jie Zhou, Hanghao Wu, Jiajie Zhang, Xu Han, Zhiyuan Liu, Maosong Sun:
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs. CoRR abs/2404.07584 (2024) - [i215]Pablo Biedma, Xiaoyuan Yi, Linus Huang, Maosong Sun, Xing Xie:
Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches. CoRR abs/2404.12744 (2024) - [i214]Zhili Cheng, Zhitong Wang, Jinyi Hu, Shengding Hu, An Liu, Yuge Tu, Pengkai Li, Lei Shi, Zhiyuan Liu, Maosong Sun:
LEGENT: Open Platform for Embodied Agents. CoRR abs/2404.18243 (2024) - [i213]Chen Qian, Jiahao Li, Yufan Dang, Wei Liu, Yifei Wang, Zihao Xie, Weize Chen, Cheng Yang, Yingli Zhang, Zhiyuan Liu, Maosong Sun:
Iterative Experience Refinement of Software-Developing Agents. CoRR abs/2405.04219 (2024) - [i212]Tianyu Yu, Haoye Zhang, Yuan Yao, Yunkai Dang, Da Chen, Xiaoman Lu, Ganqu Cui, Taiwen He, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun:
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness. CoRR abs/2405.17220 (2024) - [i211]Ao Sun, Weilin Zhao, Xu Han, Cheng Yang, Zhiyuan Liu, Chuan Shi, Maosong Sun:
Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training. CoRR abs/2406.03488 (2024) - [i210]Chen Qian, Zihao Xie, Yifei Wang, Wei Liu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun:
Scaling Large-Language-Model-based Multi-Agent Collaboration. CoRR abs/2406.07155 (2024) - [i209]Bowen Ping, Shuo Wang, Hanqing Wang, Xu Han, Yuzhuang Xu, Yukun Yan, Yun Chen, Baobao Chang, Zhiyuan Liu, Maosong Sun:
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models. CoRR abs/2406.08903 (2024) - [i208]Wentong Chen, Junbo Cui, Jinyi Hu, Yujia Qin, Junjie Fang, Yue Zhao, Chongyi Wang, Jun Liu, Guirong Chen, Yupeng Huo, Yuan Yao, Yankai Lin, Zhiyuan Liu, Maosong Sun:
GUICourse: From General Vision Language Models to Versatile GUI Agents. CoRR abs/2406.11317 (2024) - [i207]Bingxiang He, Ning Ding, Cheng Qian, Jia Deng, Ganqu Cui, Lifan Yuan, Huan-ang Gao, Huimin Chen, Zhiyuan Liu, Maosong Sun:
Zero-Shot Generalization during Instruction Tuning: Insights from Similarity and Granularity. CoRR abs/2406.11721 (2024) - [i206]Fengxiang Wang, Hongzhen Wang, Di Wang, Zonghao Guo, Zhenyu Zhong, Long Lan, Jing Zhang, Zhiyuan Liu, Maosong Sun:
Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset. CoRR abs/2406.11933 (2024) - [i205]Xinrong Zhang, Yingfa Chen, Shengding Hu, Xu Han, Zihang Xu, Yuanwei Xu, Weilin Zhao, Maosong Sun, Zhiyuan Liu:
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models. CoRR abs/2406.15718 (2024) - [i204]Shangda Wu, Yashan Wang, Xiaobing Li, Feng Yu, Maosong Sun:
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing. CoRR abs/2407.02277 (2024) - [i203]Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun:
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence. CoRR abs/2407.07061 (2024) - [i202]Junhao Chen, Shengding Hu, Zhiyuan Liu, Maosong Sun:
States Hidden in Hidden States: LLMs Emerge Discrete State Representations Implicitly. CoRR abs/2407.11421 (2024) - [i201]Zheni Zeng, Jiayi Chen, Huimin Chen, Yukun Yan, Yuxuan Chen, Zhenghao Liu, Zhiyuan Liu, Maosong Sun:
PersLLM: A Personified Training Approach for Large Language Models. CoRR abs/2407.12393 (2024) - [i200]Kunlun Zhu, Yifan Luo, Dingling Xu, Ruobing Wang, Shi Yu, Shuo Wang, Yukun Yan, Zhenghao Liu, Xu Han, Zhiyuan Liu, Maosong Sun:
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework. CoRR abs/2408.01262 (2024) - [i199]Yuan Yao, Tianyu Yu, Ao Zhang, Chongyi Wang, Junbo Cui, Hongji Zhu, Tianchi Cai, Haoyu Li, Weilin Zhao, Zhihui He, Qianyu Chen, Huarong Zhou, Zhensheng Zou, Haoye Zhang, Shengding Hu, Zhi Zheng, Jie Zhou, Jie Cai, Xu Han, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun:
MiniCPM-V: A GPT-4V Level MLLM on Your Phone. CoRR abs/2408.01800 (2024) - 2023
- [j60]Qinhong Zhou, Peng Li, Yang Liu, Yuyang Guan, Qizhou Xing, Ming Chen, Maosong Sun, Yang Liu:
AdaDS: Adaptive data selection for accelerating pre-trained language model knowledge distillation. AI Open 4: 56-63 (2023) - [j59]Chaojun Xiao, Ruobing Xie, Yuan Yao, Zhiyuan Liu, Maosong Sun, Xu Zhang, Leyu Lin:
UPRec: User-aware Pre-training for sequential Recommendation. AI Open 4: 137-144 (2023) - [j58]Zhengyan Zhang, Guangxuan Xiao, Yongwei Li, Tian Lv, Fanchao Qi, Zhiyuan Liu, Yasheng Wang, Xin Jiang, Maosong Sun:
Red Alarm for Pre-trained Models: Universal Vulnerability to Neuron-level Backdoor Attacks. Mach. Intell. Res. 20(2): 180-193 (2023) - [j57]Ning Ding, Yujia Qin, Guang Yang, Fuchao Wei, Zonghan Yang, Yusheng Su, Shengding Hu, Yulin Chen, Chi-Min Chan, Weize Chen, Jing Yi, Weilin Zhao, Xiaozhi Wang, Zhiyuan Liu, Hai-Tao Zheng, Jianfei Chen, Yang Liu, Jie Tang, Juanzi Li, Maosong Sun:
Parameter-efficient fine-tuning of large-scale pre-trained language models. Nat. Mac. Intell. 5(3): 220-235 (2023) - [j56]Chenglei Si, Zhengyan Zhang, Yingfa Chen, Fanchao Qi, Xiaozhi Wang, Zhiyuan Liu, Yasheng Wang, Qun Liu, Maosong Sun:
Sub-Character Tokenization for Chinese Pretrained Language Models. Trans. Assoc. Comput. Linguistics 11: 469-487 (2023) - [j55]Biru Zhu, Ganqu Cui, Yangyi Chen, Yujia Qin, Lifan Yuan, Chong Fu, Yangdong Deng, Zhiyuan Liu, Maosong Sun, Ming Gu:
Removing Backdoors in Pre-trained Models by Regularized Continual Pre-training. Trans. Assoc. Comput. Linguistics 11: 1608-1623 (2023) - [j54]Yu Zhang, Ziya Zhou, Xiaobing Li, Feng Yu, Maosong Sun:
CCOM-HuQin: An Annotated Multimodal Chinese Fiddle Performance Dataset. Trans. Int. Soc. Music. Inf. Retr. 6(1): 60-74 (2023) - [j53]Cheng Yang, Hao Wang, Jian Tang, Chuan Shi, Maosong Sun, Ganqu Cui, Zhiyuan Liu:
Full-Scale Information Diffusion Prediction With Reinforced Recurrent Networks. IEEE Trans. Neural Networks Learn. Syst. 34(5): 2271-2283 (2023) - [c278]Yuan Yao, Tianyu Yu, Ao Zhang, Mengdi Li, Ruobing Xie, Cornelius Weber, Zhiyuan Liu, Hai-Tao Zheng, Stefan Wermter, Tat-Seng Chua, Maosong Sun:
Visually Grounded Commonsense Knowledge Acquisition. AAAI 2023: 6583-6592 - [c277]Xingtai Lv, Ning Ding, Yujia Qin, Zhiyuan Liu, Maosong Sun:
Parameter-efficient Weight Ensembling Facilitates Task-level Knowledge Transfer. ACL (2) 2023: 270-282 - [c276]Shengding Hu, Ning Ding, Weilin Zhao, Xingtai Lv, Zhen Zhang, Zhiyuan Liu, Maosong Sun:
OpenDelta: A Plug-and-play Library for Parameter-efficient Adaptation of Pre-trained Models. ACL (demo) 2023: 274-281 - [c275]Zhengyan Zhang, Zhiyuan Zeng, Yankai Lin, Chaojun Xiao, Xiaozhi Wang, Xu Han, Zhiyuan Liu, Ruobing Xie, Maosong Sun, Jie Zhou:
Emergent Modularity in Pre-trained Transformers. ACL (Findings) 2023: 4066-4083 - [c274]Shengding Hu, Yifan Luo, Huadong Wang, Xingyi Cheng, Zhiyuan Liu, Maosong Sun:
Won't Get Fooled Again: Answering Questions with False Premises. ACL (1) 2023: 5626-5643 - [c273]Yuanchi Zhang, Peng Li, Maosong Sun, Yang Liu:
Continual Knowledge Distillation for Neural Machine Translation. ACL (1) 2023: 7978-7996 - [c272]Chenglei Si, Zhengyan Zhang, Yingfa Chen, Xiaozhi Wang, Zhiyuan Liu, Maosong Sun:
READIN: A Chinese Multi-Task Benchmark with Realistic and Diverse Input Noises. ACL (1) 2023: 8272-8285 - [c271]Chi Chen, Peng Li, Maosong Sun, Yang Liu:
Weakly Supervised Vision-and-Language Pre-training with Relative Representations. ACL (1) 2023: 8341-8355 - [c270]Yujia Qin, Zihan Cai, Dian Jin, Lan Yan, Shihao Liang, Kunlun Zhu, Yankai Lin, Xu Han, Ning Ding, Huadong Wang, Ruobing Xie, Fanchao Qi, Zhiyuan Liu, Maosong Sun, Jie Zhou:
WebCPM: Interactive Web Search for Chinese Long-form Question Answering. ACL (1) 2023: 8968-8988 - [c269]Yangyi Chen, Hongcheng Gao, Ganqu Cui, Lifan Yuan, Dehan Kong, Hanlu Wu, Ning Shi, Bo Yuan, Longtao Huang, Hui Xue, Zhiyuan Liu, Maosong Sun, Heng Ji:
From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework. ACL (Findings) 2023: 9607-9632 - [c268]