


default search action
Haoyuan Li 0002
Person information
- affiliation: Alibaba Group, Hangzhou, China
- affiliation: Zhejiang University (ZJU), Hangzhou, China
Other persons with the same name
- Haoyuan Li (aka: Hao-Yuan Li) — disambiguation page
- Haoyuan Li 0001 — Alluxio Inc., San Mateo, CA, USA (and 2 more)
- Haoyuan Li 0003
— Northeastern University, College of Software, Shenyang, China (and 1 more) - Haoyuan Li 0004
— Tsinghua University, Beijing, China - Haoyuan Li 0005
— Wuhan University, School of Electronic Information, Wuhan, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c15]Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang
, Leilei Gan, Hao Jiang:
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis. AAAI 2025: 17123-17131
[c14]Wenyi Xiao, Ziwei Huang
, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Fangxun Shu, Hao Jiang, Linchao Zhu:
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback. AAAI 2025: 25543-25551
[c13]Hongzhe Huang, Jiang Liu, Zhewen Yu, Li Cai, Dian Jiao, Wenqiao Zhang, Siliang Tang, Juncheng Li, Hao Jiang, Haoyuan Li, Yueting Zhuang:
Align²LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation. ACL (Findings) 2025: 8759-8781
[c12]Tianwei Lin, Jiang Liu, Wenqiao Zhang, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Jiannan Guo, Hao Jiang, Siliang Tang, Yueting Zhuang:
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition. ACL (1) 2025: 13622-13637
[c11]Ziwei Huang, Wanggui He, Quanyu Long, Yandi Wang, Haoyuan Li, Zhelun Yu, Fangxun Shu, Weilong Dai, Hao Jiang, Fei Wu, Leilei Gan:
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts. ACL (1) 2025: 27501-27524
[c10]Shangzhe Di, Zhelun Yu, Guanghao Zhang, Haoyuan Li, Tao Zhong, Hao Cheng, Bolin Li, Wanggui He, Fangxun Shu, Hao Jiang:
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval. ICLR 2025
[c9]Fangxun Shu, Yue Liao, Lei Zhang, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chan, Tao Zhong, Zhelun Yu, Wanggui He, Siming Fu, Haoyuan Li, Si Liu, Hongsheng Li, Hao Jiang:
LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation. ICLR 2025
[c8]Tianwei Lin, Wenqiao Zhang, Sijing Li, Yuqian Yuan, Binhe Yu, Haoyuan Li, Wanggui He, Hao Jiang, Mengze Li, Xiaohui Song, Siliang Tang, Jun Xiao, Hui Lin, Yueting Zhuang, Beng Chin Ooi:
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation. ICML 2025
[i22]Tianwei Lin, Wenqiao Zhang, Sijing Li, Yuqian Yuan, Binhe Yu, Haoyuan Li, Wanggui He, Hao Jiang, Mengze Li, Xiaohui Song, Siliang Tang
, Jun Xiao, Hui Lin, Yueting Zhuang, Beng Chin Ooi:
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation. CoRR abs/2502.09838 (2025)
[i21]Shangzhe Di, Zhelun Yu, Guanghao Zhang, Haoyuan Li, Tao Zhong, Hao Cheng, Bolin Li, Wanggui He, Fangxun Shu, Hao Jiang:
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval. CoRR abs/2503.00540 (2025)
[i20]Yi Wang, Mushui Liu, Wanggui He, Longxiang Zhang, Ziwei Huang
, Guanghao Zhang, Fangxun Shu, Tao Zhong, Dong She, Zhelun Yu, Haoyuan Li, Weilong Dai, Mingli Song, Jie Song, Hao Jiang:
MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation. CoRR abs/2503.01298 (2025)
[i19]Guanghao Zhang, Tao Zhong, Yan Xia, Zhelun Yu, Haoyuan Li, Wanggui He, Fangxun Shu, Mushui Liu, Dong She, Yi Wang, Hao Jiang:
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation. CoRR abs/2503.05255 (2025)
[i18]Wenyi Xiao, Leilei Gan, Weilong Dai, Wanggui He, Ziwei Huang, Haoyuan Li, Fangxun Shu, Zhelun Yu, Peng Zhang, Hao Jiang, Fei Wu:
Fast-Slow Thinking for Large Vision-Language Model Reasoning. CoRR abs/2504.18458 (2025)
[i17]Yihan Xie, Sijing Li, Tianwei Lin, Zhuonan Wang, Chenglin Yang, Yu Zhong, Wenqiao Zhang, Haoyuan Li, Hao Jiang, Fengda Zhang, Qishan Chen, Jun Xiao, Yueting Zhuang, Beng Chin Ooi:
Heartcare Suite: Multi-dimensional Understanding of ECG with Raw Multi-lead Signal Modeling. CoRR abs/2506.05831 (2025)- 2024
[c7]Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang
, Yueting Zhuang:
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text. ACL (1) 2024: 3345-3356
[c6]Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang
, Zhou Zhao:
Language Model is a Branch Predictor for Simultaneous Machine Translation. ICASSP 2024: 9976-9980
[c5]Ye Wang
, Jiahao Xun
, Minjie Hong
, Jieming Zhu
, Tao Jin
, Wang Lin
, Haoyuan Li
, Linjun Li
, Yan Xia
, Zhou Zhao
, Zhenhua Dong
:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. KDD 2024: 3245-3254
[i16]Wenqiao Zhang, Tianwei Lin, Jiang Liu, Fangxun Shu, Haoyuan Li, Lei Zhang, Wanggui He, Hao Zhou, Zheqi Lv, Hao Jiang, Juncheng Li, Siliang Tang
, Yueting Zhuang:
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models. CoRR abs/2403.13447 (2024)
[i15]Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu:
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback. CoRR abs/2404.14233 (2024)
[i14]Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang
, Yueting Zhuang:
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text. CoRR abs/2406.07119 (2024)
[i13]Ye Wang, Jiahao Xun, Mingjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. CoRR abs/2406.14017 (2024)
[i12]Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, Leilei Gan, Hao Jiang:
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis. CoRR abs/2407.07614 (2024)
[i11]Tianwei Lin, Jiang Liu, Wenqiao Zhang, Zhaocheng Li, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Hao Jiang, Siliang Tang
, Yueting Zhuang:
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition. CoRR abs/2408.09856 (2024)
[i10]Fangxun Shu, Yue Liao, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chen, Tao Zhong, Wanggui He, Siming Fu, Haoyuan Li, Bolin Li, Zhelun Yu, Si Liu, Hongsheng Li, Hao Jiang:
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation. CoRR abs/2408.15881 (2024)
[i9]Hongzhe Huang, Zhewen Yu, Jiang Liu, Li Cai, Dian Jiao, Wenqiao Zhang, Siliang Tang
, Juncheng Li, Hao Jiang, Haoyuan Li, Yueting Zhuang:
Align2LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation. CoRR abs/2409.18541 (2024)
[i8]Ziwei Huang, Wanggui He, Quanyu Long, Yandi Wang, Haoyuan Li, Zhelun Yu, Fangxun Shu, Long Chan, Hao Jiang, Leilei Gan, Fei Wu:
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts. CoRR abs/2412.04300 (2024)
[i7]Jiang Liu, Bolin Li, Haoyuan Li, Tianwei Lin, Wenqiao Zhang, Tao Zhong, Zhelun Yu, Jinghao Wei, Hao Cheng, Hao Jiang, Zheqi Lv, Juncheng Li, Siliang Tang
, Yueting Zhuang:
Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework. CoRR abs/2412.19684 (2024)- 2023
[c4]Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-Commerce. CVPR 2023: 19315-19324
[i6]Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-commerce. CoRR abs/2304.03669 (2023)
[i5]Haoyuan Li, Hao Jiang, Tianke Zhang, Zhelun Yu, Aoxiong Yin, Hao Cheng, Siming Fu, Yuhao Zhang, Wanggui He:
TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System. CoRR abs/2311.06622 (2023)
[i4]Haoyuan Li, Zhou Zhao, Zhu Zhang, Zhijie Lin:
Weakly-Supervised Video Moment Retrieval via Regularized Two-Branch Proposal Networks with Erasing Mechanism. CoRR abs/2311.13946 (2023)
[i3]Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang
, Zhou Zhao:
Language Model is a Branch Predictor for Simultaneous Machine Translation. CoRR abs/2312.14488 (2023)- 2022
[c3]Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. ACM Multimedia 2022: 5191-5200
[c2]Yang Zhao, Chen Zhang, Haifeng Huang, Haoyuan Li, Zhou Zhao:
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization. NeurIPS 2022
[i2]Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. CoRR abs/2209.00277 (2022)- 2021
[c1]Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory. ACM Multimedia 2021: 1359-1367
[i1]Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory. CoRR abs/2108.13630 (2021)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-12-17 22:12 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







