


default search action
Ge Zhang 0009
Person information
- affiliation: ByteDance Inc.
- affiliation (former): 01.AI
- affiliation (PhD): University of Waterloo, Canada
- affiliation (former): Beijing Academy of Artificial Intelligence, China
- affiliation (former): University of Michigan, Ann Arbor, MI, USA
Other persons with the same name
- Ge Zhang — disambiguation page
- Ge Zhang 0001 — Karlstad University, Karlstad, Sweden
- Ge Zhang 0002
— Macquarie University, Sydney, NSA, Australia - Ge Zhang 0003
— Massachusetts Institute of Technology, MIT, Department of Chemical Engineering, Cambridge, MA, USA - Ge Zhang 0004
— Xinjiang University, Urumqi, China - Ge Zhang 0005 — University of Kaiserslautern, Department of Computer Science, Integrated Communication Systems Lab, Germany
- Ge Zhang 0006
— Northwestern Polytechnical University, School of Electronics and Information, Xi'an, China - Ge Zhang 0007 — Chinese Academy of Sciences, Institute of Computing Technology, Key Laboratory of Computer System and Architecture, Beijing, China
- Ge Zhang 0008
— Henan University of Chinese Medicine, School of Information Engineering, Zhengzhou, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j5]Zhenhua Li, Lei Zhang, Songlin Yin, Ge Zhang:
MSCFF-Net: multi-scale context feature fusion network for polyp segmentation. Multim. Syst. 31(3): 189 (2025)
[j4]Tianle Li, Ge Zhang, Quy Duc Do, Xiang Yue, Wenhu Chen:
Long-context LLMs Struggle with Long In-context Learning. Trans. Mach. Learn. Res. 2025 (2025)
[j3]Zhouliang Yu, Yuhuan Yuan, Tim Z. Xiao, Fuxiang Frank Xia, Jie Fu, Ge Zhang, Ge Lin, Weiyang Liu:
Generating Symbolic World Models via Test-time Scaling of Large Language Models. Trans. Mach. Learn. Res. 2025 (2025)
[c50]Xianjie Wu, Jian Yang, Linzheng Chai, Ge Zhang, Jiaheng Liu, Xeron Du, Di Liang, Daixin Shu, Xianfu Cheng, Tianzhen Sun, Tongliang Li, Zhoujun Li, Guanglin Niu:
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering. AAAI 2025: 25497-25506
[c49]King Zhu, Qianbo Zang, Shian Jia, Siwei Wu, Feiteng Fang, Yizhi Li, Shuyue Guo, Tianyu Zheng, Jiawei Guo, Bo Li, Haoning Wu, Xingwei Qu, Jian Yang, Ruibo Liu, Xiang Yue, Jiaheng Liu, Chenghua Lin, Hamid Alinejad-Rokny, Min Yang, Shiwen Ni, Wenhao Huang, Ge Zhang:
LIME: Less Is More for MLLM Evaluation. ACL (Findings) 2025: 9086-9121
[c48]Chenhao Zhang, Xi Feng, Yuelin Bai, Xeron Du, Jinchang Hou, Kaixin Deng, Guangzeng Han, Qinrui Li, Bingli Wang, Jiaheng Liu, Xingwei Qu, Yifei Zhang, Qixuan Zhao, Yiming Liang, Ziqiang Liu, Feiteng Fang, Min Yang, Wenhao Huang, Chenghua Lin, Ge Zhang, Shiwen Ni:
Can MLLMs Understand the Deep Implication Behind Chinese Images? ACL (1) 2025: 14369-14402
[c47]Xiang Yue, Tianyu Zheng, Yuansheng Ni, Yubo Wang, Kai Zhang, Shengbang Tong, Yuxuan Sun, Botao Yu, Ge Zhang, Huan Sun, Yu Su, Wenhu Chen, Graham Neubig:
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark. ACL (1) 2025: 15134-15186
[c46]Jiaheng Liu, Ken Deng, Congnan Liu, Jian Yang, Shukai Liu, He Zhu, Peng Zhao, Linzheng Chai, Yanan Wu, Ke Jin, Ge Zhang, Zekun Moore Wang, Guoan Zhang, Yingshui Tan, Bangyu Xiang, Zhaoxiang Zhang, Wenbo Su, Bo Zheng:
M2RC-EVAL: Massively Multilingual Repository-level Code Completion Evaluation. ACL (1) 2025: 15661-15684
[c45]Yancheng He, Shilong Li, Jiaheng Liu, Weixun Wang, Xingyuan Bu, Ge Zhang, Z. Y. Peng, Zhaoxiang Zhang, Zhicheng Zheng, Wenbo Su, Bo Zheng:
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? ACL (1) 2025: 18468-18489
[c44]Siyuan Fang, Kaijing Ma, Tianyu Zheng, Xeron Du, Ningxuan Lu, Ge Zhang, Qingkun Tang:
KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation. ACL (Findings) 2025: 24724-24746
[c43]Siming Huang, Tianhao Cheng, Jason Klein Liu, Weidi Xu, Jiaran Hao, Liuyihan Song, Yang Xu, Jian Yang, Jiaheng Liu, Chenchen Zhang, Linzheng Chai, Ruifeng Yuan, Xianzhen Luo, Qiufeng Wang, YuanTao Fan, Qingfu Zhu, Zhaoxiang Zhang, Yang Gao, Jie Fu, Qian Liu, Houyi Li, Ge Zhang, Yuan Qi, Yinghui Xu, Wei Chu, Zili Wang:
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models. ACL (1) 2025: 33167-33193
[c42]Zekun Moore Wang, King Zhu, Chunpu Xu, Wangchunshu Zhou, Jiaheng Liu, Yibo Zhang, Jessie Wang, Ning Shi, Siyu Li, Yizhi Li, Haoran Que, Zhaoxiang Zhang, Yuanxing Zhang, Ge Zhang, Ke Xu, Jie Fu, Wenhao Huang:
MIO: A Foundation Model on Multimodal Tokens. EMNLP 2025: 5077-5099
[c41]Linzheng Chai, Shukai Liu, Jian Yang, Yuwei Yin, Ke Jin, Jiaheng Liu, Tao Sun, Ge Zhang, Changyu Ren, Hongcheng Guo, Noah Wang, Boyang Wang, Xianjie Wu, Bing Wang, Tongliang Li, Liqun Yang, Sufeng Duan, Zhaoxiang Zhang, Zhoujun Li:
McEval: Massively Multilingual Code Evaluation. ICLR 2025
[c40]Bofei Gao, Feifan Song, Zhe Yang, Zefan Cai, Yibo Miao, Qingxiu Dong, Lei Li, Chenghao Ma, Liang Chen, Runxin Xu, Zhengyang Tang, Benyou Wang, Daoguang Zan, Shanghaoran Quan, Ge Zhang, Lei Sha, Yichang Zhang, Xuancheng Ren, Tianyu Liu, Baobao Chang:
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models. ICLR 2025
[c39]Kaijing Ma, Xeron Du, Yunran Wang, Haoran Zhang, Zhoufutu Wen, Xingwei Qu, Jian Yang, Jiaheng Liu, Minghao Liu, Xiang Yue, Wenhao Huang, Ge Zhang:
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks. ICLR 2025
[c38]Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xeron Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia, Emmanouil Benetos, Xiang Yue, Chenghua Lin, Xu Tan, Wenhao Huang, Jie Fu, Ge Zhang:
MuPT: A Generative Symbolic Music Pretrained Transformer. ICLR 2025
[c37]Pei Wang, Yanan Wu, Noah Wang, Jiaheng Liu, Xiaoshuai Song, Z. Y. Peng, Ken Deng, Chenchen Zhang, Jiakai Wang, Junran Peng, Ge Zhang, Hangyu Guo, Zhaoxiang Zhang, Wenbo Su, Bo Zheng:
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models. ICLR 2025
[c36]Cong Wei, Zheyang Xiong, Weiming Ren, Xeron Du, Ge Zhang, Wenhu Chen:
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision. ICLR 2025
[c35]Tianyu Zhang, Suyuchen Wang, Lu Li, Ge Zhang, Perouz Taslakian, Sai Rajeswar, Jie Fu, Bang Liu, Yoshua Bengio:
VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text. ICLR 2025
[c34]Shangda Wu, Yashan Wang, Ruibin Yuan, Zhancheng Guo, Xu Tan, Ge Zhang, Monan Zhou, Jing Chen, Xuefeng Mu, Yuejie Gao, Yuanliang Dong, Jiafeng Liu, Xiaobing Li, Feng Yu, Maosong Sun:
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models. NAACL (Findings) 2025: 435-451
[c33]Yuelin Bai, Xeron Du, Yiming Liang, Leo Jin, Junting Zhou, Ziqiang Liu, Feiteng Fang, Mingshan Chang, Tianyu Zheng, Xincheng Zhang, Nuo Ma, Zekun Moore Wang, Ruibin Yuan, Haihong Wu, Hongquan Lin, Wenhao Huang, Jiajun Zhang, Chenghua Lin, Jie Fu, Min Yang, Shiwen Ni, Ge Zhang:
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning. NAACL (Findings) 2025: 8190-8205
[c32]Tao Sun
, Jian Xu
, Yuanpeng Li
, Zhao Yan
, Ge Zhang
, Lintao Xie
, Lu Geng
, Zheng Wang
, Yueyan Chen
, Qin Lin
, Wenbo Duan
, Kaixin Sui
, Yuanshuo Zhu
:
BitsAI-CR: Automated Code Review via LLM in Practice. SIGSOFT FSE Companion 2025: 274-285
[i146]Yiming Liang, Tianyu Zheng, Xinrun Du, Ge Zhang, Jiaheng Liu, Xingwei Qu, Wenqiang Zu, Xingrun Xing, Chujie Zheng, Lei Ma, Wenhu Chen, Guoyin Wang, Zhaoxiang Zhang, Wenhao Huang, Xiang Yue, Jiajun Zhang:
Aligning Instruction Tuning with Pre-training. CoRR abs/2501.09368 (2025)
[i145]Tao Sun, Jian Xu, Yuanpeng Li, Zhao Yan, Ge Zhang, Lintao Xie, Lu Geng, Zheng Wang, Yueyan Chen, Qin Lin, Wenbo Duan, Kaixin Sui:
BitsAI-CR: Automated Code Review via LLM in Practice. CoRR abs/2501.15134 (2025)
[i144]Zhouliang Yu, Yuhuan Yuan, Tim Z. Xiao, Fuxiang Frank Xia, Jie Fu, Ge Zhang, Ge Lin, Weiyang Liu:
Generating Symbolic World Models via Test-time Scaling of Large Language Models. CoRR abs/2502.04728 (2025)
[i143]Jiajun Shi, Chaoren Wei, Liqun Yang, Zekun Moore Wang, Chenghao Yang, Ge Zhang, Stephen Huang, Tao Peng, Jian Yang, Zhoufutu Wen:
CryptoX : Compositional Reasoning Evaluation of Large Language Models. CoRR abs/2502.07813 (2025)
[i142]Xianfu Cheng, Wei Zhang, Shiwei Zhang, Jian Yang, Xiangyuan Guan, Xianjie Wu, Xiang Li, Ge Zhang, Jiaheng Liu, Yuying Mai, Yutao Zeng, Zhoufutu Wen, Ke Jin, Baorui Wang, Weixiao Zhou, Yunhong Lu, Tongliang Li, Wenhao Huang, Zhoujun Li:
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models. CoRR abs/2502.13059 (2025)
[i141]Liumeng Xue, Ziya Zhou, Jiahao Pan, Zixuan Li, Shuai Fan, Yinghao Ma, Sitong Cheng, Dongchao Yang, Haohan Guo, Yujia Xiao, Xinsheng Wang, Zixuan Shen, Chuanbo Zhu, Xinshen Zhang, Tianchi Liu, Ruibin Yuan, Zeyue Tian, Haohe Liu, Emmanouil Benetos, Ge Zhang, Yike Guo
, Wei Xue:
Audio-FLAN: A Preliminary Release. CoRR abs/2502.16584 (2025)
[i140]Alexander Zhang, Marcus Dong, Jiaheng Liu, Wei Zhang, Yejie Wang, Jian Yang, Ge Zhang, Tianyu Liu, Zhongyuan Peng, Yingshui Tan, Yuanxing Zhang, Zhexu Wang, Weixun Wang, Yancheng He, Ken Deng, Wangchunshu Zhou, Wenhao Huang, Zhaoxiang Zhang:
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models. CoRR abs/2502.16614 (2025)
[i139]Yancheng He, Shilong Li, Jiaheng Liu, Weixun Wang, Xingyuan Bu, Ge Zhang, Zhongyuan Peng, Zhaoxiang Zhang, Zhicheng Zheng, Wenbo Su, Bo Zheng:
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? CoRR abs/2502.19361 (2025)
[i138]Ruibin Yuan, Hanfeng Lin, Shuyue Guo, Ge Zhang, Jiahao Pan, Yongyi Zang, Haohe Liu, Yiming Liang, Wenye Ma, Xingjian Du, Xinrun Du, Zhen Ye, Tianyu Zheng, Yinghao Ma, Minghao Liu, Zeyue Tian, Ziya Zhou, Liumeng Xue, Xingwei Qu, Yizhi Li, Shangda Wu, Tianhao Shen, Ziyang Ma, Jun Zhan, Chunhui Wang, Yatian Wang, Xiaowei Chi, Xinyue Zhang, Zhenzhu Yang, Xiangzhou Wang, Shansong Liu, Lingrui Mei, Peng Li, Junjie Wang, Jianwei Yu, Guojian Pang, Xu Li, Zihao Wang, Xiaohuan Zhou, Lijun Yu, Emmanouil Benetos, Yong Chen, Chenghua Lin, Xie Chen, Gus Xia, Zhaoxiang Zhang, Chao Zhang, Wenhu Chen, Xinyu Zhou, Xipeng Qiu, Roger B. Dannenberg, Zheng-Jia Liu, Jian Yang, Wenhao Huang, Wei Xue, Xu Tan, Yike Guo:
YuE: Scaling Open Foundation Models for Long-Form Music Generation. CoRR abs/2503.08638 (2025)
[i137]Weiming Ren, Wentao Ma, Huan Yang, Cong Wei, Ge Zhang, Wenhu Chen:
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers. CoRR abs/2503.11579 (2025)
[i136]Luxi Chen, Zihan Zhou, Min Zhao, Yikai Wang, Ge Zhang, Wenhao Huang, Hao Sun, Ji-Rong Wen, Chongxuan Li:
FlexWorld: Progressively Expanding 3D Scenes for Flexiable-View Synthesis. CoRR abs/2503.13265 (2025)
[i135]Jiaheng Liu, Dawei Zhu, Zhiqi Bai, Yancheng He, Huanxuan Liao, Haoran Que, Zekun Wang, Chenchen Zhang, Ge Zhang, Jiebin Zhang, Yuanxing Zhang, Zhuo Chen, Hangyu Guo, Shilong Li, Ziqiang Liu, Yong Shan, Yifan Song, Jiayi Tian, Wenhao Wu, Zhejian Zhou, Ruijie Zhu, Junlan Feng, Yang Gao, Shizhu He, Zhoujun Li, Tianyu Liu, Fanyu Meng, Wenbo Su, Yingshui Tan, Zili Wang, Jian Yang, Wei Ye, Bo Zheng, Wangchunshu Zhou, Wenhao Huang, Sujian Li, Zhaoxiang Zhang:
A Comprehensive Survey on Long Context Language Modeling. CoRR abs/2503.17407 (2025)
[i134]Meng Cao, Pengfei Hu, Yingyao Wang, Jihao Gu, Haoran Tang, Haoze Zhao, Jiahua Dong, Wangbo Yu, Ge Zhang, Ian Reid, Xiaodan Liang:
Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models. CoRR abs/2503.18923 (2025)
[i133]Jiazhan Feng, Shijue Huang, Xingwei Qu, Ge Zhang, Yujia Qin, Baoquan Zhong, Chengquan Jiang, Jinxin Chi, Wanjun Zhong:
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs. CoRR abs/2504.11536 (2025)
[i132]David Ma, Yuanxing Zhang, Jincheng Ren, Jarvis Guo, Yifan Yao, Zhenlin Wei, Zhenzhu Yang, Zhongyuan Peng, Boyu Feng, Jun Ma, Xiao Gu, Zhoufutu Wen, King Zhu, Yancheng He, Meng Cao, Shiwen Ni, Jiaheng Liu, Wenhao Huang, Ge Zhang, Xiaojie Jin:
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs. CoRR abs/2504.15415 (2025)
[i131]Zhouliang Yu, Ruotian Peng, Keyi Ding, Yizhe Li, Zhongyuan Peng, Minghao Liu, Yifan Zhang, Zheng Yuan, Huajian Xin, Wenhao Huang, Yandong Wen, Ge Zhang, Weiyang Liu:
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models. CoRR abs/2505.02735 (2025)
[i130]Jiajun Shi, Jian Yang, Jiaheng Liu, Xingyuan Bu, Jiangjie Chen, Junting Zhou, Kaijing Ma, Zhoufutu Wen, Bingli Wang, Yancheng He, Liang Song, Hualei Zhu, Shilong Li, Xingjian Wang, Wei Zhang, Ruibin Yuan, Yifan Yao, Wenjun Yang, Yunli Wang, Siyuan Fang, Siyu Yuan, Qianyu He, Xiangru Tang, Yingshui Tan, Wangchunshu Zhou, Zhaoxiang Zhang, Zhoujun Li, Wenhao Huang, Ge Zhang:
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation. CoRR abs/2505.14552 (2025)
[i129]Wentao Ma, Weiming Ren, Yiming Jia, Zhuofeng Li, Ping Nie, Ge Zhang, Wenhu Chen:
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation. CoRR abs/2505.14640 (2025)
[i128]Xueguang Ma, Qian Liu, Dongfu Jiang, Ge Zhang, Zejun Ma, Wenhu Chen:
General-Reasoner: Advancing LLM Reasoning Across All Domains. CoRR abs/2505.14652 (2025)
[i127]Tao Sun, Enhao Pan, Zhengkai Yang, Kaixin Sui, Jiajun Shi, Xianfu Cheng, Tongliang Li, Wenhao Huang, Ge Zhang, Jian Yang, Zhoujun Li:
P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark. CoRR abs/2505.17104 (2025)
[i126]Chenghao Yang, Yinbo Luo, Zhoufutu Wen, Qi Chu, Tao Gong, Longxiang Liu, Kaiyuan Zhang, Jianpeng Jiao, Ge Zhang, Wenhao Huang, Nenghai Yu:
MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation. CoRR abs/2505.23810 (2025)
[i125]David Ma, Huaqing Yuan, Xingjian Wang, Qianbo Zang, Tianci Liu, Xinyang He, Yanbin Wei, Jiawei Guo, Ni Jiahui, Zhenzhu Yang, Meng Cao, Shanghaoran Quan, Yizhi Li, Wangchunshu Zhou, Jiaheng Liu, Wenhao Huang, Ge Zhang, Shiwen Ni, Xiaojie Jin:
ScaleLong: A Multi-Timescale Benchmark for Long Video Understanding. CoRR abs/2505.23922 (2025)
[i124]Dingfeng Shi, Jingyi Cao, Qianben Chen, Weichen Sun, Weizhen Li, Hongxuan Lu, Fangchen Dong, Tianrui Qin, King Zhu, Minghao Liu, Jian Yang, Ge Zhang, Jiaheng Liu, Changwang Zhang, Jun Wang, Yuchen Eleanor Jiang, Wangchunshu Zhou:
TaskCraft: Automated Generation of Agentic Tasks. CoRR abs/2506.10055 (2025)
[i123]Junting Zhou, Tingjia Miao, Yiyan Liao
, Qichao Wang, Zhoufutu Wen, Yanqin Wang, Yunjie Huang, Ge Yan, Leqi Wang, Yucheng Xia, Hongwan Gao, Yuansong Zeng, Renjie Zheng, Chen Dun, Yitao Liang, Tong Yang, Wenhao Huang, Ge Zhang:
SciDA: Scientific Dynamic Assessor of LLMs. CoRR abs/2506.12909 (2025)
[i122]King Zhu, Hanhao Li, Siwei Wu, Tianshun Xing, Dehua Ma, Xiangru Tang, Minghao Liu, Jian Yang, Jiaheng Liu, Yuchen Eleanor Jiang, Changwang Zhang, Chenghua Lin, Jun Wang, Ge Zhang, Wangchunshu Zhou:
Scaling Test-time Compute for LLM Agents. CoRR abs/2506.12928 (2025)
[i121]He Zhu, Tianrui Qin, King Zhu, Heyuan Huang, Yeyi Guan, Jinxiang Xia, Yi Yao, Hanhao Li, Ningning Wang, Pai Liu, Tianhao Peng, Xin Gui, Xiaowan Li, Yuhui Liu, Yuchen Eleanor Jiang, Jun Wang, Changwang Zhang, Xiangru Tang, Ge Zhang, Jian Yang, Minghao Liu, Xitong Gao, Jiaheng Liu, Wangchunshu Zhou:
OAgents: An Empirical Study of Building Effective Agents. CoRR abs/2506.15741 (2025)
[i120]Zhongyuan Peng, Yifan Yao, Kaijing Ma, Shuyue Guo, Yizhe Li, Yichi Zhang, Chenchen Zhang, Yifan Zhang, Zhouliang Yu, Luming Li, Minghao Liu, Yihang Xia, Jiawei Shen, Yuchen Wu, Yixin Cao, Zhaoxiang Zhang, Wenhao Huang, Jiaheng Liu, Ge Zhang:
CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization. CoRR abs/2507.06181 (2025)
[i119]Ruijie Zhu, Tianhao Peng, Tianhao Cheng, Xingwei Qu, Jinfa Huang, Dawei Zhu, Hao Wang, Kaiwen Xue, Xuanliang Zhang, Yong Shan, Tianle Cai, Taylor Kergan, Assel Kembay, Andrew Smith, Chenghua Lin, Binh Nguyen, Yuqi Pan, Yuhong Chou, Zefan Cai, Zhenhe Wu, Yongchi Zhao, Tianyu Liu, Jian Yang, Wangchunshu Zhou, Chujie Zheng, Chongxuan Li, Yuyin Zhou, Zhoujun Li, Zhaoxiang Zhang, Jiaheng Liu, Ge Zhang, Wenhao Huang, Jason Eshraghian:
A Survey on Latent Reasoning. CoRR abs/2507.06203 (2025)
[i118]Xiangru Tang, Tianrui Qin, Tianhao Peng, Ziyang Zhou
, Daniel Shao, Tingting Du, Xinming Wei, Peng Xia, Fang Wu, He Zhu, Ge Zhang, Jiaheng Liu, Xingyao Wang, Sirui Hong, Chenglin Wu, Hao Cheng, Chi Wang, Wangchunshu Zhou:
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving. CoRR abs/2507.06229 (2025)
[i117]Dustin Wang
, Rui-Jie Zhu, Steven Abreu, Yong Shan, Taylor Kergan, Yuqi Pan, Yuhong Chou, Zheng Li, Ge Zhang, Wenhao Huang, Jason Eshraghian:
A Systematic Analysis of Hybrid Linear Attention. CoRR abs/2507.06457 (2025)
[i116]Tianyu Zheng, Tianshun Xing, Qingshui Gu, Taoran Liang, Xingwei Qu, Xin Zhou, Yizhi Li, Zhoufutu Wen, Chenghua Lin, Wenhao Huang, Qian Liu, Ge Zhang, Zejun Ma:
First Return, Entropy-Eliciting Explore. CoRR abs/2507.07017 (2025)
[i115]Linzheng Chai, Jian Yang, Shukai Liu, Wei Zhang, Liran Wang, Ke Jin, Tao Sun, Congnan Liu, Chenchen Zhang, Hualei Zhu, Jiaheng Liu, Xianjie Wu, Ge Zhang, Tianyu Liu, Zhoujun Li:
Multilingual Multimodal Software Developer for Code Generation. CoRR abs/2507.08719 (2025)
[i114]Jian Yang, Wei Zhang, Shukai Liu, Linzheng Chai, Yingshui Tan, Jiaheng Liu, Ge Zhang, Wangchunshu Zhou, Guanglin Niu, Zhoujun Li, Binyuan Hui, Junyang Lin:
IFEvalCode: Controlled Code Generation. CoRR abs/2507.22462 (2025)
[i113]Luoxin Chen, Jinming Gu, Liankai Huang, Wenhao Huang, Zhicheng Jiang, Allan Jie, Xiaoran Jin, Xing Jin, Chenggang Li, Kaijing Ma, Cheng Ren, Jiawei Shen, Wenlei Shi, Tong Sun, He Sun, Jiahui Wang, Siran Wang, Zhihong Wang, Chenrui Wei, Shufa Wei, Yonghui Wu, Yuchen Wu, Yihang Xia, Huajian Xin, Fan Yang, Huaiyuan Ying, Hongyi Yuan, Zheng Yuan, Tianyang Zhan, Chi Zhang, Yue Zhang, Ge Zhang, Tianyun Zhao, Jianqiu Zhao, Yichi Zhou, Thomas Hanwen Zhu:
Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving. CoRR abs/2507.23726 (2025)
[i112]Yuxuan Song, Zheng Zhang, Cheng Luo, Pengyang Gao, Fan Xia, Hao Luo, Zheng Li, Yuehang Yang, Hongli Yu, Xingwei Qu, Yuwei Fu, Jing Su, Ge Zhang, Wenhao Huang, Mingxuan Wang, Lin Yan, Xiaoying Jia, Jingjing Liu, Wei-Ying Ma, Ya-Qin Zhang, Yonghui Wu, Hao Zhou:
Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference. CoRR abs/2508.02193 (2025)
[i111]Ningning Wang, Xavier Hu, Pai Liu, He Zhu, Yue Hou, Heyuan Huang, Shengyu Zhang, Jian Yang, Jiaheng Liu, Ge Zhang, Changwang Zhang, Jun Wang, Yuchen Eleanor Jiang, Wangchunshu Zhou:
Efficient Agents: Building Effective Agents While Reducing Cost. CoRR abs/2508.02694 (2025)
[i110]Shunyu Liu, Minghao Liu, Huichi Zhou, Zhenyu Cui, Yang Zhou, Yuhao Zhou, Wendong Fan, Ge Zhang, Jiajun Shi, Weihao Xuan, Jiaxing Huang, Shuang Luo, Fang Wu, Heli Qi, Qingcheng Zeng, Ziqi Ren, Jialiang Gao, Jindi Lv, Junjie Wang, Aosong Feng, Heng Zhou, Wangchunshu Zhou, Zhenfei Yin, Wenlong Zhang, Guohao Li, Wenhao Yu, Irene Li, Lei Ma, Lei Bai, Qunshu Lin, Mingli Song, Dacheng Tao:
VeriGUI: Verifiable Long-Chain GUI Dataset. CoRR abs/2508.04026 (2025)
[i109]Zhiyuan Zeng, Jiashuo Liu, Siyuan Chen, Tianci He, Yali Liao, Jinpeng Wang, Zaiyuan Wang, Yang Yang, Lingyue Yin, Mingren Yin, Zhenwei Zhu, Tianle Cai, Zehui Chen, Jiecao Chen, Yantao Du, Xiang Gao, Jiacheng Guo, Liang Hu, Jianpeng Jiao, Xiangsheng Li, Jingkai Liu, Shuang Ni, Zhoufutu Wen, Ge Zhang, Kaiyuan Zhang, Xin Zhou, Jose H. Blanchet, Xipeng Qiu, Mengdi Wang, Wenhao Huang:
FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction. CoRR abs/2508.11987 (2025)
[i108]Weizhen Li, Jianbo Lin, Zhuosong Jiang, Jingyi Cao, Xinpeng Liu, Jiayu Zhang, Zhenqiang Huang, Qianben Chen, Weichen Sun, Qiexiang Wang, Hongxuan Lu, Tianrui Qin, Chenghao Zhu, Yi Yao, Shuying Fan, Xiaowan Li, Tiannan Wang, Pai Liu, King Zhu, He Zhu, Dingfeng Shi, Piaohong Wang, Yeyi Guan, Xiangru Tang, Minghao Liu, Yuchen Eleanor Jiang, Jian Yang, Jiaheng Liu, Ge Zhang, Wangchunshu Zhou:
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL. CoRR abs/2508.13167 (2025)
[i107]Shilong Li, Xingyuan Bu, Wenjie Wang, Jiaheng Liu, Jun Dong, Haoyang He, Hao Lu, Haozhe Zhang, Chenchen Jing, Zhen Li, Chuanhao Li, Jiayi Tian, Chenchen Zhang, Tianhao Peng, Yancheng He, Jihao Gu, Yuanxing Zhang, Jian Yang, Ge Zhang, Wenhao Huang, Wangchunshu Zhou, Zhaoxiang Zhang, Ruizhe Ding, Shilei Wen:
MM-BrowseComp: A Comprehensive Benchmark for Multimodal Browsing Agents. CoRR abs/2508.13186 (2025)
[i106]Daixin Shu, Jian Yang, Zhenhe Wu, Xianjie Wu, Xianfu Cheng, Xiangyuan Guan, Yanghai Wang, Pengfei Wu, Tingyang Yang, Hualei Zhu, Wei Zhang, Ge Zhang, Jiaheng Liu, Zhoujun Li:
M3TQA: Massively Multilingual Multitask Table Question Answering. CoRR abs/2508.16265 (2025)
[i105]Yizhi Li, Qingshui Gu, Zhoufutu Wen, Ziniu Li, Tianshun Xing, Shuyue Guo, Tianyu Zheng, Xin Zhou, Xingwei Qu, Wangchunshu Zhou, Zheng Zhang, Wei Shen, Qian Liu, Chenghua Lin, Jian Yang, Ge Zhang, Wenhao Huang:
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling. CoRR abs/2508.17445 (2025)
[i104]Haozhe Wang, Haoran Que, Qixin Xu, Minghao Liu, Wangchunshu Zhou, Jiazhan Feng, Wanjun Zhong, Wei Ye, Tong Yang, Wenhao Huang, Ge Zhang, Fangzhen Lin:
Reverse-Engineered Reasoning for Open-Ended Generation. CoRR abs/2509.06160 (2025)
[i103]Liang Hu, Jianpeng Jiao, Jiashuo Liu, Yanle Ren, Zhoufutu Wen, Kaiyuan Zhang, Xuanliang Zhang, Xiang Gao, Tianci He, Fei Hu, Yali Liao, Zaiyuan Wang, Chenghao Yang, Qianyu Yang, Mingren Yin, Zhiyuan Zeng, Ge Zhang, Xinyi Zhang, Xiying Zhao, Zhenwei Zhu, Hongseok Namkoong, Wenhao Huang, Yuwen Tang:
FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning. CoRR abs/2509.13160 (2025)
[i102]Xuan He, Dongfu Jiang, Ping Nie, Minghao Liu, Zhengxuan Jiang, Mingyi Su, Wentao Ma, Junru Lin, Chun Ye, Yi Lu, Keming Wu, Benjamin Schneider, Quy Duc Do, Zhuofeng Li, Yiming Jia, Yuxuan Zhang, Guo Cheng, Haozhe Wang, Wangchunshu Zhou, Qunshu Lin, Yuanxing Zhang, Ge Zhang, Wenhao Huang, Wenhu Chen:
VideoScore2: Think before You Score in Generative Video Evaluation. CoRR abs/2509.22799 (2025)
[i101]Yuan Liang, Jiaxian Li, Yuqing Wang, Piaohong Wang, Motong Tian, Pai Liu, Shuofei Qiao, Runnan Fang, He Zhu, Ge Zhang, Minghao Liu, Yuchen Eleanor Jiang, Ningyu Zhang, Wangchunshu Zhou:
Towards Personalized Deep Research: Benchmarks and Evaluations. CoRR abs/2509.25106 (2025)
[i100]Ziniu Li, Congliang Chen, Tianyun Yang, Tian Ding, Ruoyu Sun, Ge Zhang, Wenhao Huang, Zhi-Quan Luo:
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation. CoRR abs/2509.25849 (2025)
[i99]Caorui Li, Yu Chen, Yiyan Ji, Jin Xu, Zhenyu Cui, Shihao Li, Yuanxing Zhang, Jiafu Tang, Zhenghao Song, Dingling Zhang, Ying He, Haoxiang Liu, Yuxuan Wang, Qiufeng Wang, Zhenhe Wu, Jiehui Luo, Zhiyu Pan, Weihao Xie, Chenchen Zhang, Zhaohui Wang, Jiayi Tian, Yanghai Wang, Zhe Cao, Minxin Dai, Ke Wang, Runzhe Wen, Yinghao Ma, Yaning Pan, Sungkyun Chang, Termeh Taheri, Haiwen Xia, Christos Plachouras, Emmanouil Benetos, Yizhi Li, Ge Zhang, Jian Yang, Tianhao Peng, Zili Wang, Minghao Liu, Junran Peng, Zhaoxiang Zhang, Jiaheng Liu:
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs. CoRR abs/2510.10689 (2025)
[i98]Xin Gui, King Zhu, JinCheng Ren, Qianben Chen, Zekun Moore Wang, Yizhi Li, Xinpeng Liu, Xiaowan Li, Wenli Ren, Linyu Miao, Tianrui Qin, Ziqi Shu, He Zhu, Xiangru Tang, Dingfeng Shi, Jiaheng Liu, Yuchen Eleanor Jiang, Minghao Liu, Ge Zhang, Wangchunshu Zhou:
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems. CoRR abs/2510.11652 (2025)
[i97]Qianben Chen, Jingyi Cao, Jiayu Zhang, Tianrui Qin, Xiaowan Li, King Zhu, Dingfeng Shi, He Zhu, Minghao Liu, Xiaobo Liang, Xin Gui, Ge Zhang, Jian Yang, Yuchen Eleanor Jiang, Wangchunshu Zhou:
A2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning. CoRR abs/2510.12838 (2025)
[i96]Shuangshuang Ying, Yunwen Li, Xingwei Qu, Xin Li, Sheng Jin, Minghao Liu, Zhoufutu Wen, Xeron Du, Tianyu Zheng, Yichi Zhang, Letian Ni, Yuyang Cheng, Qiguang Chen, Jingzhe Ding, Shengda Long, Wangchunshu Zhou, Jiazhan Feng, Wanjun Zhong, Libo Qin, Ge Zhang, Wenhao Huang, Wanxiang Che, Chenghua Lin:
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures. CoRR abs/2510.14616 (2025)
[i95]Rui-Jie Zhu, Zixuan Wang, Kai Hua, Tianyu Zhang, Ziniu Li, Haoran Que, Boyi Wei, Zixin Wen, Fan Yin, He Xing, Lu Li, Jiajun Shi, Kaijing Ma, Shanda Li, Taylor Kergan, Andrew Smith, Xingwei Qu, Mude Hui, Bohong Wu, Qiyang Min, Hongzhi Huang, Xun Zhou, Wei Ye, Jiaheng Liu, Jian Yang, Yunfeng Shi, Chenghua Lin, Enduo Zhao, Tianle Cai, Ge Zhang, Wenhao Huang, Yoshua Bengio, Jason Eshraghian:
Scaling Latent Reasoning via Looped Language Models. CoRR abs/2510.25741 (2025)
[i94]Kaiyuan Zhang, Chenghao Yang, Zhoufutu Wen, Sihang Yuan, Qiuyue Wang, Chaoyi Huang, Guosheng Zhu, He Wang, Huawenyu Lu, Jianing Wen, Jianpeng Jiao, Lishu Luo, Longxiang Liu, Sijin Wu, Xiaolei Zhu, Xuanliang Zhang, Ge Zhang, Yi Lin, Guang Shi, Chaoyou Fu, Wenhao Huang:
MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity. CoRR abs/2511.03146 (2025)
[i93]Zhiyuan Zeng, Jiashuo Liu, Zhangyue Yin, Ge Zhang, Wenhao Huang, Xipeng Qiu:
RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization. CoRR abs/2511.04285 (2025)
[i92]Liya Zhu, Peizhuang Cong, Aowei Ji, Wenya Wu, Jiani Hou, Chunjie Wu, Xiang Gao, Jingkai Liu, Zhou Huan, Xuelei Sun, Yang Yang, Jianpeng Jiao, Liang Hu, Xinjie Chen, Jiashuo Liu, Jingzhe Ding, Tong Yang, Zaiyuan Wang, Ge Zhang, Wenhao Huang:
LPFQA: A Long-Tail Professional Forum-based Benchmark for LLM Evaluation. CoRR abs/2511.06346 (2025)
[i91]Tianhao Peng, Haochen Wang, Yuanxing Zhang, Zekun Wang, Zili Wang, Ge Zhang, Jian Yang, Shihao Li, Yanghai Wang, Xintao Wang, Houyi Li, Wei Ji, Pengfei Wan, Wenhao Huang, Zhaoxiang Zhang, Jiaheng Liu:
MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs. CoRR abs/2511.07250 (2025)
[i90]Xiying Zhao, Zhoufutu Wen, Zhixuan Chen, Jingzhe Ding, Jianpeng Jiao, Shuai Li, Xi Li, Danni Liang, Shengda Long, Qianqian Liu, Xianbo Wu, Hongwan Gao, Xiang Gao, Liang Hu, Jiashuo Liu, Mengyun Liu, Weiran Shi, Chenghao Yang, Qianyu Yang, Xuanliang Zhang, Ge Zhang, Wenhao Huang, Yuwen Tang:
DiscoX: Benchmarking Discourse-Level Translation task in Expert Domains. CoRR abs/2511.10984 (2025)
[i89]Xiaoxuan Tang, Xinping Lei, Chaoran Zhu, Shiyun Chen, Ruibin Yuan, Yizhi Li, Changjae Oh, Ge Zhang, Wenhao Huang, Emmanouil Benetos, Yang Liu, Jiaheng Liu, Yinghao Ma:
AutoMV: An Automatic Multi-Agent System for Music Video Generation. CoRR abs/2512.12196 (2025)
[i88]Jingzhe Ding, Shengda Long, Changxin Pu, Huan Zhou, Hongwan Gao, Xiang Gao, Chao He, Yue Hou, Fei Hu, Zhaojian Li, Weiran Shi, Zaiyuan Wang, Daoguang Zan, Chenchen Zhang, Xiaoxu Zhang, Qizhi Chen, Xianfu Cheng, Bo Deng, Qingshui Gu, Kai Hua, Juntao Lin, Pai Liu, Mingchen Li, Xuanguang Pan, Zifan Peng, Yujia Qin, Yong Shan, Zhewen Tan, Weihao Xie, Zihan Wang, Yishuo Yuan, Jiayu Zhang, Enduo Zhao, Yunfei Zhao, He Zhu, Chenyang Zou, Ming Ding, Jianpeng Jiao, Jiaheng Liu, Minghao Liu, Qian Liu, Chongyao Tao, Jian Yang, Tong Yang, Zhaoxiang Zhang, Xinjie Chen, Wenhao Huang, Ge Zhang:
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents. CoRR abs/2512.12730 (2025)
[i87]Jian Yang, Wei Zhang, Yizhi Li, Shawn Guo, Haowen Wang, Aishan Liu, Ge Zhang, Zili Wang, Zhoujun Li, Xianglong Liu, Weifeng Lv:
CodeSimpleQA: Scaling Factuality in Code Large Language Models. CoRR abs/2512.19424 (2025)
[i86]Yingru Li, Jiawei Xu, Jiacai Liu, Yuxuan Tong, Ziniu Li, Tianle Cai, Ge Zhang, Qian Liu, Baoxiang Wang:
Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning. CoRR abs/2512.23087 (2025)
[i85]Xingwei Qu, Shaowen Wang, Zihao Huang, Kai Hua, Fan Yin, Rui-Jie Zhu, Jundong Zhou, Qiyang Min, Zihao Wang, Yizhi Li, Tianyu Zhang, He Xing, Zheng Zhang, Yuxuan Song, Tianyu Zheng, Zhiyuan Zeng, Chenghua Lin, Ge Zhang, Wenhao Huang:
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space. CoRR abs/2512.24617 (2025)
[i84]Yiming Liang, Yizhi Li, Yantao Du, Ge Zhang, Jiayi Zhou, Yuchen Wu, Yinzhu Piao, Denghui Cao, Tong Sun, Ziniu Li, Li Du, Bo Lei, Jiaheng Liu, Chenghua Lin, Zhaoxiang Zhang, Wenhao Huang, Jiajun Zhang:
Encyclo-K: Evaluating LLMs with Dynamically Composed Knowledge Statements. CoRR abs/2512.24867 (2025)- 2024
[j2]Dongfu Jiang, Yishan Li, Ge Zhang, Wenhao Huang, Bill Yuchen Lin, Wenhu Chen:
TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks. Trans. Mach. Learn. Res. 2024 (2024)
[j1]Weiming Ren, Huan Yang, Ge Zhang, Cong Wei, Xinrun Du, Wenhao Huang, Wenhu Chen:
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation. Trans. Mach. Learn. Res. 2024 (2024)
[c31]Jiaheng Liu, Zhiqi Bai, Yuanxing Zhang, Chenchen Zhang, Yu Zhang, Ge Zhang, Jiakai Wang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng:
E2-LLM: Efficient and Extreme Length Extension of Large Language Models. ACL (Findings) 2024: 4243-4253
[c30]Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Liumeng Xue, Ziyang Ma, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Jie Fu, Emmanouil Benetos, Gus Xia, Roger B. Dannenberg, Wei Xue, Shiyin Kang, Yike Guo
:
ChatMusician: Understanding and Generating Music Intrinsically with LLM. ACL (Findings) 2024: 6252-6271
[c29]Jun Zhan, Junqi Dai, Jiasheng Ye, Yunhua Zhou, Dong Zhang, Zhigeng Liu, Xin Zhang, Ruibin Yuan, Ge Zhang, Linyang Li, Hang Yan, Jie Fu, Tao Gui, Tianxiang Sun, Yu-Gang Jiang, Xipeng Qiu:
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling. ACL (1) 2024: 9637-9662
[c28]Yizhi Li, Ge Zhang, Xingwei Qu, Jiali Li, Zhaoqun Li, Noah Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Wenhao Huang, Chenghua Lin, Jie Fu:
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models. ACL (Findings) 2024: 12431-12446
[c27]Siwei Wu, Yizhi Li, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang
, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin:
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval. ACL (Findings) 2024: 12560-12574
[c26]Tianyu Zheng, Ge Zhang, Tianhao Shen, Xueling Liu, Bill Yuchen Lin, Jie Fu, Wenhu Chen, Xiang Yue:
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement. ACL (Findings) 2024: 12834-12859
[c25]Yujie Shao, Xinrong Yao, Xingwei Qu, Chenghua Lin, Shi Wang, Wenhao Huang, Ge Zhang, Jie Fu:
CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation. LREC/COLING 2024: 3357-3366
[c24]Tianyu Zheng, Ge Zhang, Xingwei Qu, Ming Kuang, Wenhao Huang, Zhaofeng He:
MORE-3S: Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces. LREC/COLING 2024: 11593-11604
[c23]Xiang Yue
, Yuansheng Ni, Tianyu Zheng, Kai Zhang, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen:
MMMU: A Massive Multi-Discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI. CVPR 2024: 9556-9567
[c22]Cong Wei
, Yang Chen
, Haonan Chen, Hexiang Hu, Ge Zhang, Jie Fu, Alan Ritter, Wenhu Chen
:
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers. ECCV (87) 2024: 387-404
[c21]Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Bill Yuchen Lin, Wenhu Chen:
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation. EMNLP 2024: 2105-2123
[c20]Shun Wang, Ge Zhang, Han Wu, Tyler Loakman, Wenhao Huang, Chenghua Lin:
MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language. EMNLP 2024: 11343-11358
[c19]Yizhi Li, Ruibin Yuan, Ge Zhang, Yinghao Ma, Xingran Chen, Hanzhi Yin, Chenghao Xiao, Chenghua Lin, Anton Ragni, Emmanouil Benetos, Norbert Gyenge, Roger B. Dannenberg, Ruibo Liu, Wenhu Chen, Gus Xia, Yemin Shi, Wenhao Huang, Zili Wang, Yike Guo, Jie Fu:
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training. ICLR 2024
[c18]Chenmien Tan, Ge Zhang, Jie Fu:
Massive Editing for Large Language Models via Meta Learning. ICLR 2024
[c17]Xiang Yue, Xingwei Qu, Ge Zhang, Yao Fu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen:
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning. ICLR 2024
[c16]Guangyao Chen, Siwei Dong, Yu Shu, Ge Zhang, Jaward Sesay, Börje Karlsson, Jie Fu, Yemin Shi:
AutoAgents: A Framework for Automatic Agent Generation. IJCAI 2024: 22-30
[c15]Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang, Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo:
ComposerX: Multi-Agent Symbolic Music Composition With LLMs. ISMIR 2024: 669-679
[c14]Zihao Deng, Yinghao Ma, Yudong Liu, Rongchen Guo, Ge Zhang, Wenhu Chen, Wenhao Huang, Emmanouil Benetos:
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response. NAACL-HLT (Findings) 2024: 3643-3655
[c13]Ziqiang Liu, Feiteng Fang, Xi Feng, Xeron Du, Chenhao Zhang, Noah Wang, Yuelin Bai, Qixuan Zhao, Liyang Fan, Chengguang Gan, Hongquan Lin, Jiaming Li, Yuansheng Ni, Haihong Wu, Yaswanth Narsupalli, Zhigang Zheng, Chengming Li, Xiping Hu, Ruifeng Xu, Xiaojun Chen, Min Yang, Jiaheng Liu, Ruibo Liu, Wenhao Huang, Ge Zhang, Shiwen Ni:
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models. NeurIPS 2024
[c12]Jiaheng Liu, Zehao Ni, Haoran Que, Tao Sun, Noah Wang, Jian Yang, Jiakai Wang, Hongcheng Guo, Zhongyuan Peng, Ge Zhang, Jiayi Tian, Xingyuan Bu, Ke Xu, Wenge Rong, Junran Peng, Zhaoxiang Zhang:
RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts. NeurIPS 2024
[c11]Jiaheng Liu, Chenchen Zhang, Jinyang Guo, Yuanxing Zhang, Haoran Que, Ken Deng, Zhiqi Bai, Jie Liu, Ge Zhang, Jiakai Wang, Yanan Wu, Congnan Liu, Jiamang Wang, Lin Qu, Wenbo Su, Bo Zheng:
DDK: Distilling Domain Knowledge for Efficient Large Language Models. NeurIPS 2024
[c10]Haoran Que, Jiaheng Liu, Ge Zhang, Chenchen Zhang, Xingwei Qu, Yinghao Ma, Feiyu Duan, Zhiqi Bai, Jiakai Wang, Yuanxing Zhang, Xu Tan, Jie Fu, Jiamang Wang, Lin Qu, Wenbo Su, Bo Zheng:
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models. NeurIPS 2024
[c9]Yubo Wang, Xueguang Ma, Ge Zhang, Yuansheng Ni, Abhranil Chandra, Shiguang Guo, Weiming Ren, Aaran Arulraj, Xuan He, Ziyan Jiang, Tianle Li, Max Ku, Kai Wang, Alex Zhuang, Rongqi Fan, Xiang Yue, Wenhu Chen:
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark. NeurIPS 2024
[c8]Xiang Yue, Tianyu Zheng, Ge Zhang, Wenhu Chen:
MAmmoTH2: Scaling Instructions from the Web. NeurIPS 2024
[c7]Xingwei Qu, Ge Zhang, Siwei Wu, Yizhi Li, Chenghua Lin:
Overview of the NLPCC 2024 Shared Task on Chinese Metaphor Generation. NLPCC (5) 2024: 181-192
[i83]Tianyu Zheng, Shuyue Guo, Xingwei Qu, Jiawei Guo, Weixu Zhang, Xinrun Du, Chenghua Lin, Wenhao Huang, Wenhu Chen, Jie Fu, Ge Zhang:
Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation. CoRR abs/2401.06477 (2024)
[i82]Jiaheng Liu, Zhiqi Bai, Yuanxing Zhang, Chenchen Zhang, Yu Zhang, Ge Zhang, Jiakai Wang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng:
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models. CoRR abs/2401.06951 (2024)
[i81]Ge Zhang, Xinrun Du, Bei Chen, Yiming Liang, Tongxu Luo, Tianyu Zheng, Kang Zhu, Yuyang Cheng, Chunpu Xu, Shuyue Guo, Haoran Zhang
, Xingwei Qu, Junjie Wang, Ruibin Yuan, Yizhi Li, Zekun Wang, Yudong Liu, Yu-Hsuan Tsai, Fengji Zhang, Chenghua Lin, Wenhao Huang, Wenhu Chen, Jie Fu:
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark. CoRR abs/2401.11944 (2024)
[i80]Siwei Wu, Yizhi Li
, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin:
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval. CoRR abs/2401.13478 (2024)
[i79]Yonggang Jin, Ge Zhang, Hao Zhao, Tianyu Zheng, Jiawei Guo, Liuyu Xiang, Shawn Yue, Stephen W. Huang, Wenhu Chen, Zhaofeng He, Jie Fu:
Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction. CoRR abs/2402.04154 (2024)
[i78]Weiming Ren, Harry Yang, Ge Zhang, Cong Wei, Xinrun Du, Stephen Huang, Wenhu Chen:
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation. CoRR abs/2402.04324 (2024)
[i77]Jun Zhan, Junqi Dai, Jiasheng Ye, Yunhua Zhou, Dong Zhang, Zhigeng Liu, Xin Zhang, Ruibin Yuan, Ge Zhang, Linyang Li, Hang Yan, Jie Fu, Tao Gui, Tianxiang Sun, Yu-Gang Jiang, Xipeng Qiu:
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling. CoRR abs/2402.12226 (2024)
[i76]Tianyu Zheng, Ge Zhang, Xingwei Qu, Ming Kuang, Stephen W. Huang, Zhaofeng He:
MORE-3S: Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces. CoRR abs/2402.12845 (2024)
[i75]Yizhi Li, Ge Zhang, Xingwei Qu, Jiali Li, Zhaoqun Li, Zekun Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Stephen W. Huang, Chenghua Lin, Wenhu Chen, Jie Fu:
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models. CoRR abs/2402.13109 (2024)
[i74]Yujie Shao, Xinrong Yao, Xingwei Qu, Chenghua Lin, Shi Wang, Stephen W. Huang, Ge Zhang, Jie Fu:
CMDAG: A Chinese Metaphor Dataset with Annotated Grounds as CoT for Boosting Metaphor Generation. CoRR abs/2402.13145 (2024)
[i73]Tianyu Zheng, Ge Zhang, Tianhao Shen
, Xueling Liu, Bill Yuchen Lin, Jie Fu, Wenhu Chen, Xiang Yue
:
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement. CoRR abs/2402.14658 (2024)
[i72]Ruibin Yuan, Hanfeng Lin, Yi Wang, Zeyue Tian, Shangda Wu, Tianhao Shen
, Ge Zhang, Yuhang Wu, Cong Liu, Ziya Zhou, Ziyang Ma, Liumeng Xue, Ziyu Wang, Qin Liu, Tianyu Zheng, Yizhi Li, Yinghao Ma, Yiming Liang, Xiaowei Chi, Ruibo Liu, Zili Wang, Pengfei Li, Jingcheng Wu, Chenghua Lin, Qifeng Liu, Tao Jiang, Wenhao Huang, Wenhu Chen, Emmanouil Benetos, Jie Fu, Gus Xia
, Roger B. Dannenberg, Wei Xue, Shiyin Kang, Yike Guo
:
ChatMusician: Understanding and Generating Music Intrinsically with LLM. CoRR abs/2402.16153 (2024)
[i71]Alex Zhuang, Ge Zhang, Tianyu Zheng, Xinrun Du, Junjie Wang, Weiming Ren, Stephen W. Huang, Jie Fu, Xiang Yue
, Wenhu Chen:
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding. CoRR abs/2402.16671 (2024)
[i70]Xingwei Qu, Yiming Liang, Yucheng Wang, Tianyu Zheng, Tommy Yue, Lei Ma, Stephen W. Huang, Jiajun Zhang, Wenhu Chen, Chenghua Lin, Jie Fu, Ge Zhang:
DEEP-ICL: Definition-Enriched Experts for Language Model In-Context Learning. CoRR abs/2403.04233 (2024)
[i69]Alex Young, Bei Chen, Chao Li, Chengen Huang, Ge Zhang, Guanwei Zhang, Heng Li, Jiangcheng Zhu, Jianqun Chen, Jing Chang, Kaidong Yu, Peng Liu, Qiang Liu, Shawn Yue, Senbin Yang, Shiming Yang, Tao Yu, Wen Xie, Wenhao Huang, Xiaohui Hu, Xiaoyi Ren, Xinyao Niu, Pengcheng Nie, Yuchi Xu, Yudong Liu, Yue Wang, Yuxuan Cai, Zhenyu Gu, Zhiyuan Liu, Zonghong Dai:
Yi: Open Foundation Models by 01.AI. CoRR abs/2403.04652 (2024)
[i68]Yuelin Bai, Xinrun Du, Yiming Liang, Yonggang Jin, Ziqiang Liu, Junting Zhou, Tianyu Zheng, Xincheng Zhang, Nuo Ma, Zekun Wang, Ruibin Yuan, Haihong Wu, Hongquan Lin, Wenhao Huang, Jiajun Zhang, Wenhu Chen, Chenghua Lin, Jie Fu, Min Yang, Shiwen Ni, Ge Zhang:
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning. CoRR abs/2403.18058 (2024)
[i67]Chen Yang, Junzhuo Li, Xinyao Niu, Xinrun Du, Songyang Gao, Haoran Zhang
, Zhaoliang Chen, Xingwei Qu, Ruibin Yuan, Yizhi Li, Jiaheng Liu, Stephen W. Huang, Shawn Yue, Wenhu Chen, Jie Fu, Ge Zhang:
The Fine Line: Navigating Large Language Model Pretraining with Down-streaming Capability Analysis. CoRR abs/2404.01204 (2024)
[i66]Tianle Li, Ge Zhang, Quy Duc Do, Xiang Yue
, Wenhu Chen:
Long-context LLMs Struggle with Long In-context Learning. CoRR abs/2404.02060 (2024)
[i65]Jiawei Guo, Ziming Li, Xueling Liu, Kaijing Ma, Tianyu Zheng, Zhouliang Yu, Ding Pan
, Yizhi Li, Ruibo Liu, Yue Wang, Shuyue Guo, Xingwei Qu, Xiang Yue
, Ge Zhang, Wenhu Chen, Jie Fu:
CodeEditorBench: Evaluating Code Editing Capability of Large Language Models. CoRR abs/2404.03543 (2024)
[i64]Xinrun Du, Zhouliang Yu, Songyang Gao, Ding Pan, Yuyang Cheng, Ziyang Ma, Ruibin Yuan, Xingwei Qu, Jiaheng Liu, Tianyu Zheng, Xinchen Luo, Guorui Zhou, Binhang Yuan, Wenhu Chen, Jie Fu, Ge Zhang:
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model. CoRR abs/2404.04167 (2024)
[i63]Xingwei Qu, Yuelin Bai, Yinghao Ma, Ziya Zhou, Ka Man Lo, Jiaheng Liu, Ruibin Yuan, Lejun Min, Xueling Liu, Tianyu Zhang, Xinrun Du, Shuyue Guo, Yiming Liang, Yizhi Li, Shangda Wu, Junting Zhou, Tianyu Zheng, Ziyang Ma, Fengze Han, Wei Xue, Gus Xia
, Emmanouil Benetos, Xiang Yue
, Chenghua Lin, Xu Tan, Stephen W. Huang, Wenhu Chen, Jie Fu, Ge Zhang:
MuPT: A Generative Symbolic Music Pretrained Transformer. CoRR abs/2404.06393 (2024)
[i62]Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang, Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo
:
ComposerX: Multi-Agent Symbolic Music Composition with LLMs. CoRR abs/2404.18081 (2024)
[i61]Xiang Yue
, Tuney Zheng, Ge Zhang, Wenhu Chen:
MAmmoTH2: Scaling Instructions from the Web. CoRR abs/2405.03548 (2024)
[i60]Ge Zhang, Scott Qu, Jiaheng Liu, Chenchen Zhang, Chenghua Lin, Chou Leuang Yu, Danny Pan, Esther Cheng, Jie Liu, Qunshu Lin, Raven Yuan, Tuney Zheng, Wei Pang, Xinrun Du, Yiming Liang, Yinghao Ma, Yizhi Li, Ziyang Ma, Bill Y. Lin, Emmanouil Benetos, Huan Yang, Junting Zhou, Kaijing Ma, Minghao Liu, Morry Niu, Noah Wang, Quehry Que, Ruibo Liu, Sine Liu, Shawn Guo, Soren Gao, Wangchunshu Zhou, Xinyue Zhang, Yizhi Zhou, Yubo Wang, Yuelin Bai, Yuhan Zhang, Yuxiang Zhang, Zenith Wang, Zhenzhu Yang, Zijian Zhao, Jiajun Zhang, Wanli Ouyang
, Wenhao Huang, Wenhu Chen:
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series. CoRR abs/2405.19327 (2024)
[i59]Haoran Que, Jiaheng Liu, Ge Zhang, Chenchen Zhang, Xingwei Qu, Yinghao Ma, Feiyu Duan, Zhiqi Bai, Jiakai Wang, Yuanxing Zhang, Xu Tan, Jie Fu, Wenbo Su, Jiamang Wang, Lin Qu, Bo Zheng:
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models. CoRR abs/2406.01375 (2024)
[i58]Yubo Wang, Xueguang Ma, Ge Zhang, Yuansheng Ni, Abhranil Chandra, Shiguang Guo, Weiming Ren, Aaran Arulraj, Xuan He, Ziyan Jiang, Tianle Li, Max Ku, Kai Wang, Alex Zhuang, Rongqi Fan, Xiang Yue
, Wenhu Chen:
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark. CoRR abs/2406.01574 (2024)
[i57]Ziqiang Liu, Feiteng Fang, Xi Feng, Xinrun Du, Chenhao Zhang
, Zekun Wang, Yuelin Bai, Qixuan Zhao, Liyang Fan, Chengguang Gan
, Hongquan Lin, Jiaming Li, Yuansheng Ni, Haihong Wu, Yaswanth Narsupalli
, Zhigang Zheng, Chengming Li, Xiping Hu, Ruifeng Xu, Xiaojun Chen, Min Yang, Jiaheng Liu, Ruibo Liu, Wenhao Huang, Ge Zhang, Shiwen Ni:
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models. CoRR abs/2406.05862 (2024)
[i56]Tianyu Zhang, Suyuchen Wang, Lu Li, Ge Zhang, Perouz Taslakian, Sai Rajeswar, Jie Fu, Bang Liu, Yoshua Bengio:
VCR: Visual Caption Restoration. CoRR abs/2406.06462 (2024)
[i55]Linzheng Chai, Shukai Liu, Jian Yang, Yuwei Yin, Ke Jin, Jiaheng Liu, Tao Sun, Ge Zhang, Changyu Ren, Hongcheng Guo, Zekun Wang, Boyang Wang, Xianjie Wu, Bing Wang, Tongliang Li, Liqun Yang, Sufeng Duan, Zhoujun Li:
McEval: Massively Multilingual Code Evaluation. CoRR abs/2406.07436 (2024)
[i54]Shun Wang, Ge Zhang, Han Wu, Tyler Loakman, Wenhao Huang, Chenghua Lin:
MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language. CoRR abs/2406.13698 (2024)
[i53]Junjie Wang, Yin Zhang, Yatai Ji, Yuxiang Zhang, Chunyang Jiang, Yubo Wang, Kang Zhu, Zekun Wang, Tiezhen Wang, Wenhao Huang, Jie Fu, Bei Chen, Qunshu Lin, Minghao Liu, Ge Zhang, Wenhu Chen:
PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents. CoRR abs/2406.13923 (2024)
[i52]Leyan Wang, Yonggang Jin, Tianhao Shen, Tianyu Zheng, Xinrun Du, Chenchen Zhang, Wenhao Huang, Jiaheng Liu, Shi Wang, Ge Zhang, Liuyu Xiang, Zhaofeng He:
GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models. CoRR abs/2406.14903 (2024)
[i51]Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni
, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli
, Rongqi Fan, Zhiheng Lyu, Yuchen Lin, Wenhu Chen:
VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation. CoRR abs/2406.15252 (2024)
[i50]Shawn Gavin, Tuney Zheng, Jiaheng Liu, Quehry Que, Noah Wang, Jian Yang, Chenchen Zhang, Wenhao Huang, Wenhu Chen, Ge Zhang:
LongIns: A Challenging Long-context Instruction-based Exam for LLMs. CoRR abs/2406.17588 (2024)
[i49]Jiaheng Liu, Chenchen Zhang, Jinyang Guo, Yuanxing Zhang, Haoran Que, Ken Deng, Zhiqi Bai, Jie Liu, Ge Zhang, Jiakai Wang, Yanan Wu, Congnan Liu, Wenbo Su, Jiamang Wang, Lin Qu, Bo Zheng:
DDK: Distilling Domain Knowledge for Efficient Large Language Models. CoRR abs/2407.16154 (2024)
[i48]Siwei Wu, Kang Zhu, Yu Bai, Yiming Liang, Yizhi Li, Haoning Wu, Jiaheng Liu, Ruibo Liu, Xingwei Qu, Xuxin Cheng, Ge Zhang, Wenhao Huang, Chenghua Lin:
MMRA: A Benchmark for Multi-granularity Multi-image Relational Association. CoRR abs/2407.17379 (2024)
[i47]Xingwei Qu, Ge Zhang, Siwei Wu, Yizhi Li, Chenghua Lin:
Overview of the NLPCC 2024 Shared Task on Chinese Metaphor Generation. CoRR abs/2408.04378 (2024)
[i46]Yiming Liang, Ge Zhang, Xingwei Qu, Tianyu Zheng, Jiawei Guo, Xinrun Du, Zhenzhu Yang, Jiaheng Liu, Chenghua Lin, Lei Ma, Wenhao Huang, Jiajun Zhang:
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm. CoRR abs/2408.08072 (2024)
[i45]Xianjie Wu, Jian Yang, Linzheng Chai, Ge Zhang, Jiaheng Liu, Xinrun Du, Di Liang, Daixin Shu, Xianfu Cheng, Tianzhen Sun, Guanglin Niu, Tongliang Li, Zhoujun Li:
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering. CoRR abs/2408.09174 (2024)
[i44]Yinghao Ma, Anders Øland, Anton Ragni, Bleiz Macsen Del Sette, Charalampos Saitis, Chris Donahue, Chenghua Lin, Christos Plachouras, Emmanouil Benetos, Elio Quinton, Elona Shatri, Fabio Morreale, Ge Zhang, György Fazekas, Gus Xia
, Huan Zhang, Ilaria Manco, Jiawen Huang, Julien Guinot
, Liwei Lin, Luca Marinelli, Max W. Y. Lam, Megha Sharma
, Qiuqiang Kong, Roger B. Dannenberg, Ruibin Yuan, Shangda Wu, Shih-Lun Wu, Shuqi Dai, Shun Lei, Shiyin Kang, Simon Dixon, Wenhu Chen, Wenhao Huang, Xingjian Du
, Xingwei Qu, Xu Tan, Yizhi Li, Zeyue Tian, Zhiyong Wu, Zhizheng Wu, Ziyang Ma, Ziyu Wang:
Foundation Models for Music: A Survey. CoRR abs/2408.14340 (2024)
[i43]Liqun Yang, Jian Yang, Chaoren Wei, Guanglin Niu, Ge Zhang, Yunli Wang, Linzheng Chai, Wanxu Xia, Hongcheng Guo, Shun Zhang, Jiaheng Liu, Yuwei Yin, Junran Peng, Jiaxin Ma, Liang Sun, Zhoujun Li:
FuzzCoder: Byte-level Fuzzing Test via Large Language Model. CoRR abs/2409.01944 (2024)
[i42]Bofei Gao, Feifan Song, Yibo Miao, Zefan Cai, Zhe Yang, Liang Chen, Helan Hu, Runxin Xu, Qingxiu Dong, Ce Zheng, Wen Xiao, Ge Zhang, Daoguang Zan, Keming Lu, Bowen Yu, Dayiheng Liu, Zeyu Cui, Jian Yang, Lei Sha, Houfeng Wang, Zhifang Sui, Peiyi Wang, Tianyu Liu, Baobao Chang:
Towards a Unified View of Preference Learning for Large Language Models: A Survey. CoRR abs/2409.02795 (2024)
[i41]Xiang Yue
, Tianyu Zheng, Yuansheng Ni, Yubo Wang, Kai Zhang, Shengbang Tong, Yuxuan Sun, Botao Yu, Ge Zhang, Huan Sun, Yu Su, Wenhu Chen, Graham Neubig:
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark. CoRR abs/2409.02813 (2024)
[i40]King Zhu, Qianbo Zang, Shian Jia, Siwei Wu, Feiteng Fang, Yizhi Li, Shawn Gavin, Tuney Zheng, Jiawei Guo, Bo Li, Haoning Wu, Xingwei Qu, Jian Yang, Zachary Liu, Xiang Yue, J. H. Liu, Chenghua Lin, Min Yang, Shiwen Ni, Wenhao Huang, Ge Zhang:
LIME: Less Is More for MLLM Evaluation. CoRR abs/2409.06851 (2024)
[i39]Yizhi Li, Ge Zhang, Yinghao Ma, Ruibin Yuan, Kang Zhu, Hangyu Guo, Yiming Liang, Jiaheng Liu, Jian Yang, Siwei Wu, Xingwei Qu, Jinjie Shi, Xinyue Zhang, Zhenzhu Yang, Xiangzhou Wang, Zhaoxiang Zhang, Zachary Liu, Emmanouil Benetos, Wenhao Huang, Chenghua Lin:
OmniBench: Towards The Future of Universal Omni-Language Models. CoRR abs/2409.15272 (2024)
[i38]Haoran Que, Feiyu Duan, Liqun He, Yutao Mou, Wangchunshu Zhou, Jiaheng Liu, Wenge Rong, Zekun Moore Wang, Jian Yang, Ge Zhang, Junran Peng, Zhaoxiang Zhang, Songyang Zhang
, Kai Chen:
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models. CoRR abs/2409.16191 (2024)
[i37]Zekun Wang, King Zhu, Chunpu Xu, Wangchunshu Zhou, Jiaheng Liu, Yibo Zhang, Jiashuo Wang, Ning Shi, Siyu Li, Yizhi Li, Haoran Que, Zhaoxiang Zhang, Yuanxing Zhang, Ge Zhang, Ke Xu, Jie Fu, Wenhao Huang:
MIO: A Foundation Model on Multimodal Tokens. CoRR abs/2409.17692 (2024)
[i36]Kaijing Ma, Xinrun Du, Yunran Wang, Haoran Zhang, Zhoufutu Wen, Xingwei Qu, Jian Yang, Jiaheng Liu, Minghao Liu, Xiang Yue
, Wenhao Huang, Ge Zhang:
KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks. CoRR abs/2410.06526 (2024)
[i35]Haoran Zhang, Hangyu Guo, Shuyue Guo, Meng Cao, Wenhao Huang, Jiaheng Liu, Ge Zhang:
ING-VP: MLLMs cannot Play Easy Vision-based Games Yet. CoRR abs/2410.06555 (2024)
[i34]Bofei Gao, Feifan Song, Zhe Yang, Zefan Cai, Yibo Miao, Qingxiu Dong, Lei Li, Chenghao Ma, Liang Chen, Runxin Xu, Zhengyang Tang, Benyou Wang, Daoguang Zan, Shanghaoran Quan, Ge Zhang, Lei Sha, Yichang Zhang, Xuancheng Ren, Tianyu Liu, Baobao Chang:
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models. CoRR abs/2410.07985 (2024)
[i33]Pei Wang, Yanan Wu, Zekun Wang, Jiaheng Liu, Xiaoshuai Song, Zhongyuan Peng, Ken Deng, Chenchen Zhang, Jiakai Wang, Junran Peng, Ge Zhang, Hangyu Guo, Zhaoxiang Zhang, Wenbo Su, Bo Zheng:
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models. CoRR abs/2410.11710 (2024)
[i32]Shangda Wu, Yashan Wang, Ruibin Yuan, Zhancheng Guo, Xu Tan
, Ge Zhang, Monan Zhou, Jing Chen, Xuefeng Mu, Yuejie Gao, Yuanliang Dong, Jiafeng Liu, Xiaobing Li, Feng Yu, Maosong Sun:
CLaMP 2: Multimodal Music Information Retrieval Across 101 Languages Using Large Language Models. CoRR abs/2410.13267 (2024)
[i31]Siwei Wu, Zhongyuan Peng, Xinrun Du, Tuney Zheng, Minghao Liu, Jialong Wu, Jiachen Ma, Yizhi Li, Jian Yang, Wangchunshu Zhou, Qunshu Lin, Junbo Zhao, Zhaoxiang Zhang, Wenhao Huang, Ge Zhang, Chenghua Lin, Jiaheng Liu:
A Comparative Study on Reasoning Patterns of OpenAI's o1 Model. CoRR abs/2410.13639 (2024)
[i30]Chenhao Zhang, Xi Feng, Yuelin Bai, Xinrun Du, Jinchang Hou, Kaixin Deng
, Guangzeng Han, Qinrui Li, Bingli Wang
, Jiaheng Liu, Xingwei Qu, Yifei Zhang, Qixuan Zhao, Yiming Liang, Ziqiang Liu, Feiteng Fang, Min Yang, Wenhao Huang, Chenghua Lin, Ge Zhang, Shiwen Ni:
Can MLLMs Understand the Deep Implication Behind Chinese Images? CoRR abs/2410.13854 (2024)
[i29]Ziming Li, Qianbo Zang
, David Ma, Jiawei Guo, Tuney Zheng, Minghao Liu, Xinyao Niu, Yue Wang, Jian Yang, Jiaheng Liu, Wanjun Zhong, Wangchunshu Zhou, Wenhao Huang, Ge Zhang:
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions. CoRR abs/2410.20424 (2024)
[i28]Jiaheng Liu, Ken Deng, Congnan Liu, Jian Yang, Shukai Liu, He Zhu, Peng Zhao, Linzheng Chai, Yanan Wu, Ke Jin, Ge Zhang, Zekun Wang, Guoan Zhang, Bangyu Xiang, Wenbo Su, Bo Zheng:
M2rc-Eval: Massively Multilingual Repository-level Code Completion Evaluation. CoRR abs/2410.21157 (2024)
[i27]Shukai Liu, Linzheng Chai, Jian Yang, Jiajun Shi, He Zhu, Liran Wang, Ke Jin, Wei Zhang, Hualei Zhu, Shuyue Guo, Tao Sun, Jiaheng Liu, Yunlong Duan, Yu Hao, Liqun Yang, Guanglin Niu, Ge Zhang, Zhoujun Li:
MdEval: Massively Multilingual Code Debugging. CoRR abs/2411.02310 (2024)
[i26]Siming Huang, Tianhao Cheng, Jason Klein Liu, Jiaran Hao, Liuyihan Song, Yang Xu, J. Yang, J. H. Liu, Chenchen Zhang, Linzheng Chai, Ruifeng Yuan, Zhaoxiang Zhang, Jie Fu, Qian Liu, Ge Zhang, Zili Wang, Yuan Qi, Yinghui Xu, Wei Chu:
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models. CoRR abs/2411.04905 (2024)
[i25]Cong Wei, Zheyang Xiong, Weiming Ren, Xinrun Du, Ge Zhang, Wenhu Chen:
OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision. CoRR abs/2411.07199 (2024)
[i24]Meng Cao, Haoran Tang, Haoze Zhao, Hangyu Guo, Jiaheng Liu, Ge Zhang, Ruyang Liu, Qiang Sun, Ian Reid, Xiaodan Liang:
PhysGame: Uncovering Physical Commonsense Violations in Gameplay Videos. CoRR abs/2412.01800 (2024)
[i23]Liang Chen, Zekun Wang, Shuhuai Ren, Lei Li, Haozhe Zhao, Yunshui Li, Zefan Cai, Hongcheng Guo, Lei Zhang, Yizhe Xiong, Yichi Zhang, Ruoyu Wu, Qingxiu Dong, Ge Zhang, Jian Yang, Lingwei Meng, Shujie Hu, Yulong Chen, Junyang Lin, Shuai Bai, Andreas Vlachos, Xu Tan, Minjia Zhang, Wen Xiao, Aaron Yee, Tianyu Liu, Baobao Chang:
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey. CoRR abs/2412.18619 (2024)
[i22]Siyuan Fang, Kaijing Ma, Tianyu Zheng, Xinrun Du, Ningxuan Lu, Ge Zhang, Qingkun Tang:
KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation. CoRR abs/2412.20995 (2024)- 2023
[c6]Le Zhuo, Ruibin Yuan, Jiahao Pan, Yinghao Ma, Yizhi Li, Ge Zhang, Si Liu, Roger B. Dannenberg, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenhu Chen, Wei Xue, Yike Guo:
LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT. ISMIR 2023: 343-351
[c5]Yinghao Ma, Ruibin Yuan, Yizhi Li, Ge Zhang, Chenghua Lin, Xingran Chen, Anton Ragni, Hanzhi Yin, Emmanouil Benetos, Norbert Gyenge, Ruibo Liu, Gus Xia, Roger B. Dannenberg, Yike Guo, Jie Fu:
On the Effectiveness of Speech Self-Supervised Learning for Music. ISMIR 2023: 457-465
[c4]Ruibin Yuan, Yinghao Ma, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Le Zhuo, Yiqi Liu, Jiawen Huang, Zeyue Tian, Binyue Deng, Ningzhi Wang, Chenghua Lin, Emmanouil Benetos, Anton Ragni, Norbert Gyenge, Roger B. Dannenberg, Wenhu Chen, Gus Xia, Wei Xue, Si Liu, Shi Wang, Ruibo Liu, Yike Guo, Jie Fu:
MARBLE: Music Audio Representation Benchmark for Universal Evaluation. NeurIPS 2023
[i21]Ge Zhang, Yizhi Li, Yaoyao Wu, Linyuan Zhang, Chenghua Lin, Jiayi Geng, Shi Wang, Jie Fu:
CORGI-PM: A Chinese Corpus For Gender Bias Probing and Mitigation. CoRR abs/2301.00395 (2023)
[i20]Ge Zhang, Yemin Shi, Ruibo Liu, Ruibin Yuan, Yizhi Li, Siwei Dong, Yu Shu, Zhaoqun Li, Zekun Wang, Chenghua Lin, Wenhao Huang, Jie Fu:
Chinese Open Instruction Generalist: A Preliminary Release. CoRR abs/2304.07987 (2023)
[i19]Zekun Wang, Ge Zhang, Kexin Yang, Ning Shi, Wangchunshu Zhou, Shaochun Hao, Guangzheng Xiong, Yizhi Li, Mong Yuan Sim, Xiuying Chen, Qingqing Zhu, Zhenzhu Yang, Adam Nik, Qi Liu, Chenghua Lin, Shi Wang, Ruibo Liu, Wenhu Chen, Ke Xu, Dayiheng Liu, Yike Guo
, Jie Fu:
Interactive Natural Language Processing. CoRR abs/2305.13246 (2023)
[i18]Xingran Chen, Ge Zhang, Jie Fu:
TPDM: Selectively Removing Positional Information for Zero-shot Translation via Token-Level Position Disentangle Module. CoRR abs/2305.19857 (2023)
[i17]Yizhi Li, Ruibin Yuan, Ge Zhang, Yinghao Ma, Xingran Chen, Hanzhi Yin, Chenghua Lin, Anton Ragni, Emmanouil Benetos, Norbert Gyenge, Roger B. Dannenberg, Ruibo Liu, Wenhu Chen, Gus Xia
, Yemin Shi, Wenhao Huang, Yike Guo
, Jie Fu:
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training. CoRR abs/2306.00107 (2023)
[i16]Ruibin Yuan, Yinghao Ma, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Le Zhuo, Yiqi Liu, Jiawen Huang, Zeyue Tian, Binyue Deng, Ningzhi Wang, Chenghua Lin, Emmanouil Benetos, Anton Ragni, Norbert Gyenge, Roger B. Dannenberg, Wenhu Chen, Gus Xia
, Wei Xue, Si Liu, Shi Wang, Ruibo Liu, Yike Guo
, Jie Fu:
MARBLE: Music Audio Representation Benchmark for Universal Evaluation. CoRR abs/2306.10548 (2023)
[i15]Le Zhuo, Ruibin Yuan, Jiahao Pan, Yinghao Ma, Yizhi Li, Ge Zhang, Si Liu, Roger B. Dannenberg, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenhu Chen, Wei Xue, Yike Guo
:
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT. CoRR abs/2306.17103 (2023)
[i14]Yinghao Ma, Ruibin Yuan, Yizhi Li, Ge Zhang, Xingran Chen, Hanzhi Yin, Chenghua Lin, Emmanouil Benetos, Anton Ragni, Norbert Gyenge, Ruibo Liu, Gus Xia, Roger B. Dannenberg, Yike Guo, Jie Fu:
On the Effectiveness of Speech Self-supervised Learning for Music. CoRR abs/2307.05161 (2023)
[i13]Xiang Yue
, Xingwei Qu, Ge Zhang, Yao Fu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen:
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning. CoRR abs/2309.05653 (2023)
[i12]Zihao Deng, Yinghao Ma, Yudong Liu, Rongchen Guo, Ge Zhang, Wenhu Chen, Wenhao Huang, Emmanouil Benetos:
MusiLingo: Bridging Music and Text with Pre-trained Language Models for Music Captioning and Query Response. CoRR abs/2309.08730 (2023)
[i11]Guangyao Chen
, Siwei Dong, Yu Shu, Ge Zhang, Jaward Sesay, Börje F. Karlsson
, Jie Fu, Yemin Shi:
AutoAgents: A Framework for Automatic Agent Generation. CoRR abs/2309.17288 (2023)
[i10]Dongfu Jiang, Yishan Li, Ge Zhang, Wenhao Huang, Bill Yuchen Lin, Wenhu Chen:
TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks. CoRR abs/2310.00752 (2023)
[i9]Chenmien Tan, Ge Zhang, Jie Fu:
Massive Editing for Large Language Models via Meta Learning. CoRR abs/2311.04661 (2023)
[i8]Xiang Yue
, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen:
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI. CoRR abs/2311.16502 (2023)
[i7]Cong Wei, Yang Chen, Haonan Chen, Hexiang Hu
, Ge Zhang, Jie Fu, Alan Ritter, Wenhu Chen:
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers. CoRR abs/2311.17136 (2023)
[i6]Chunpu Xu, Steffi Chern, Ethan Chern, Ge Zhang, Zekun Wang, Ruibo Liu, Jing Li, Jie Fu, Pengfei Liu:
Align on the Fly: Adapting Chatbot Behavior to Established Norms. CoRR abs/2312.15907 (2023)- 2022
[c3]Adam Nik, Ge Zhang, Xingran Chen, Mingyu Li, Jie Fu:
1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data. CASE@EMNLP 2022: 91-99
[c2]Xingran Chen, Ge Zhang, Adam Nik, Mingyu Li, Jie Fu:
1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position Selector. CASE@EMNLP 2022: 100-105
[c1]Yizhi Li, Ge Zhang, Bohao Yang, Chenghua Lin, Anton Ragni, Shi Wang, Jie Fu:
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models. AACL/IJCNLP (Findings) 2022: 334-346
[i5]Xingran Chen, Ge Zhang, Adam Nik, Mingyu Li, Jie Fu:
1Cademy @ Causal News Corpus 2022: Enhance Causal Span Detection via Beam-Search-based Position Selector. CoRR abs/2210.17157 (2022)
[i4]Adam Nik, Ge Zhang, Xingran Chen, Mingyu Li, Jie Fu:
1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data. CoRR abs/2211.02729 (2022)
[i3]Yizhi Li
, Ge Zhang, Bohao Yang
, Chenghua Lin, Shi Wang, Anton Ragni, Jie Fu:
HERB: Measuring Hierarchical Regional Bias in Pre-trained Language Models. CoRR abs/2211.02882 (2022)
[i2]Yizhi Li, Ruibin Yuan, Ge Zhang, Yinghao Ma, Chenghua Lin, Xingran Chen, Anton Ragni, Hanzhi Yin, Zhijie Hu, Haoyu He, Emmanouil Benetos, Norbert Gyenge, Ruibo Liu, Jie Fu:
MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning. CoRR abs/2212.02508 (2022)- 2020
[i1]Ruibin Yuan, Ge Zhang, Anqiao Yang, Xinyue Zhang:
Diverse Melody Generation from Chinese Lyrics via Mutual Information Maximization. CoRR abs/2012.03805 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-09 23:24 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







