default search action

combined dblp search
author search
venue search
publication search

ask others

Haoyuan Li 0002

> Home > Persons

Person information

affiliation: Alibaba Group, Hangzhou, China
affiliation: Zhejiang University (ZJU), Hangzhou, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HeFLWXSWZYLHGJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HeFLWXSWZYLHGJ25
Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, Leilei Gan, Hao Jiang:
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis. AAAI 2025: 17123-17131
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XiaoHGHLYSJZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XiaoHGHLYSJZ25
Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Fangxun Shu, Hao Jiang, Linchao Zhu:
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback. AAAI 2025: 25543-25551
[c13]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/HuangLYCJZT00LZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangLYCJZT00LZ25
Hongzhe Huang, Jiang Liu, Zhewen Yu, Li Cai, Dian Jiao, Wenqiao Zhang, Siliang Tang, Juncheng Li, Hao Jiang, Haoyuan Li, Yueting Zhuang:
Align²LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation. ACL (Findings) 2025: 8759-8781
[c12]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/LinLZDLYH00JTZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LinLZDLYH00JTZ25
Tianwei Lin, Jiang Liu, Wenqiao Zhang, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Jiannan Guo, Hao Jiang, Siliang Tang, Yueting Zhuang:
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition. ACL (1) 2025: 13622-13637
[c11]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/HuangHLWLYSDJ0G25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangHLWLYSDJ0G25
Ziwei Huang, Wanggui He, Quanyu Long, Yandi Wang, Haoyuan Li, Zhelun Yu, Fangxun Shu, Weilong Dai, Hao Jiang, Fei Wu, Leilei Gan:
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts. ACL (1) 2025: 27501-27524
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DiYZLZCLHSJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DiYZLZCLHSJ25
Shangzhe Di, Zhelun Yu, Guanghao Zhang, Haoyuan Li, Tao Zhong, Hao Cheng, Bolin Li, Wanggui He, Fangxun Shu, Hao Jiang:
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval. ICLR 2025
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ShuLZZXZSCZYHFL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ShuLZZXZSCZYHFL25
Fangxun Shu, Yue Liao, Lei Zhang, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chan, Tao Zhong, Zhelun Yu, Wanggui He, Siming Fu, Haoyuan Li, Si Liu, Hongsheng Li, Hao Jiang:
LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation. ICLR 2025
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LinZLYY0H00ST0L25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LinZLYY0H00ST0L25
Tianwei Lin, Wenqiao Zhang, Sijing Li, Yuqian Yuan, Binhe Yu, Haoyuan Li, Wanggui He, Hao Jiang, Mengze Li, Xiaohui Song, Siliang Tang, Jun Xiao, Hui Lin, Yueting Zhuang, Beng Chin Ooi:
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation. ICML 2025
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-09838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-09838
Tianwei Lin, Wenqiao Zhang, Sijing Li, Yuqian Yuan, Binhe Yu, Haoyuan Li, Wanggui He, Hao Jiang, Mengze Li, Xiaohui Song, Siliang Tang, Jun Xiao, Hui Lin, Yueting Zhuang, Beng Chin Ooi:
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation. CoRR abs/2502.09838 (2025)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-00540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-00540
Shangzhe Di, Zhelun Yu, Guanghao Zhang, Haoyuan Li, Tao Zhong, Hao Cheng, Bolin Li, Wanggui He, Fangxun Shu, Hao Jiang:
Streaming Video Question-Answering with In-context Video KV-Cache Retrieval. CoRR abs/2503.00540 (2025)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-01298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-01298
Yi Wang, Mushui Liu, Wanggui He, Longxiang Zhang, Ziwei Huang, Guanghao Zhang, Fangxun Shu, Tao Zhong, Dong She, Zhelun Yu, Haoyuan Li, Weilong Dai, Mingli Song, Jie Song, Hao Jiang:
MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation. CoRR abs/2503.01298 (2025)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-05255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-05255
Guanghao Zhang, Tao Zhong, Yan Xia, Zhelun Yu, Haoyuan Li, Wanggui He, Fangxun Shu, Mushui Liu, Dong She, Yi Wang, Hao Jiang:
CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation. CoRR abs/2503.05255 (2025)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-18458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-18458
Wenyi Xiao, Leilei Gan, Weilong Dai, Wanggui He, Ziwei Huang, Haoyuan Li, Fangxun Shu, Zhelun Yu, Peng Zhang, Hao Jiang, Fei Wu:
Fast-Slow Thinking for Large Vision-Language Model Reasoning. CoRR abs/2504.18458 (2025)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-05831
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-05831
Yihan Xie, Sijing Li, Tianwei Lin, Zhuonan Wang, Chenglin Yang, Yu Zhong, Wenqiao Zhang, Haoyuan Li, Hao Jiang, Fengda Zhang, Qishan Chen, Jun Xiao, Yueting Zhuang, Beng Chin Ooi:
Heartcare Suite: Multi-dimensional Understanding of ECG with Raw Multi-lead Signal Modeling. CoRR abs/2506.05831 (2025)
2024
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YinLSTZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YinLSTZ24
Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang, Yueting Zhuang:
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text. ACL (1) 2024: 3345-3356
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YinZLTZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YinZLTZ24
Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang, Zhou Zhao:
Language Model is a Branch Predictor for Simultaneous Machine Translation. ICASSP 2024: 9976-9980
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/WangXHZ0LLLXZD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/WangXHZ0LLLXZD24
Ye Wang, Jiahao Xun, Minjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. KDD 2024: 3245-3254
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-13447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-13447
Wenqiao Zhang, Tianwei Lin, Jiang Liu, Fangxun Shu, Haoyuan Li, Lei Zhang, Wanggui He, Hao Zhou, Zheqi Lv, Hao Jiang, Juncheng Li, Siliang Tang, Yueting Zhuang:
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models. CoRR abs/2403.13447 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-14233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-14233
Wenyi Xiao, Ziwei Huang, Leilei Gan, Wanggui He, Haoyuan Li, Zhelun Yu, Hao Jiang, Fei Wu, Linchao Zhu:
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback. CoRR abs/2404.14233 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07119
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07119
Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang, Yueting Zhuang:
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text. CoRR abs/2406.07119 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-14017
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-14017
Ye Wang, Jiahao Xun, Mingjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. CoRR abs/2406.14017 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-07614
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-07614
Wanggui He, Siming Fu, Mushui Liu, Xierui Wang, Wenyi Xiao, Fangxun Shu, Yi Wang, Lei Zhang, Zhelun Yu, Haoyuan Li, Ziwei Huang, Leilei Gan, Hao Jiang:
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis. CoRR abs/2407.07614 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-09856
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-09856
Tianwei Lin, Jiang Liu, Wenqiao Zhang, Zhaocheng Li, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Hao Jiang, Siliang Tang, Yueting Zhuang:
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition. CoRR abs/2408.09856 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-15881
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-15881
Fangxun Shu, Yue Liao, Le Zhuo, Chenning Xu, Guanghao Zhang, Haonan Shi, Long Chen, Tao Zhong, Wanggui He, Siming Fu, Haoyuan Li, Bolin Li, Zhelun Yu, Si Liu, Hongsheng Li, Hao Jiang:
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation. CoRR abs/2408.15881 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-18541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-18541
Hongzhe Huang, Zhewen Yu, Jiang Liu, Li Cai, Dian Jiao, Wenqiao Zhang, Siliang Tang, Juncheng Li, Hao Jiang, Haoyuan Li, Yueting Zhuang:
Align²LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation. CoRR abs/2409.18541 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-04300
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-04300
Ziwei Huang, Wanggui He, Quanyu Long, Yandi Wang, Haoyuan Li, Zhelun Yu, Fangxun Shu, Long Chan, Hao Jiang, Leilei Gan, Fei Wu:
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts. CoRR abs/2412.04300 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-19684
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-19684
Jiang Liu, Bolin Li, Haoyuan Li, Tianwei Lin, Wenqiao Zhang, Tao Zhong, Zhelun Yu, Jinghao Wei, Hao Cheng, Hao Jiang, Zheqi Lv, Juncheng Li, Siliang Tang, Yueting Zhuang:
Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework. CoRR abs/2412.19684 (2024)
2023
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiJJLCLZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiJJLCLZZ23
Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-Commerce. CVPR 2023: 19315-19324
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-03669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-03669
Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-commerce. CoRR abs/2304.03669 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06622
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06622
Haoyuan Li, Hao Jiang, Tianke Zhang, Zhelun Yu, Aoxiong Yin, Hao Cheng, Siming Fu, Yuhao Zhang, Wanggui He:
TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System. CoRR abs/2311.06622 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-13946
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-13946
Haoyuan Li, Zhou Zhao, Zhu Zhang, Zhijie Lin:
Weakly-Supervised Video Moment Retrieval via Regularized Two-Branch Proposal Networks with Erasing Mechanism. CoRR abs/2311.13946 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-14488
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-14488
Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang, Zhou Zhao:
Language Model is a Branch Predictor for Simultaneous Machine Translation. CoRR abs/2312.14488 (2023)
2022
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/XiaZYZLR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/XiaZYZLR22
Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. ACM Multimedia 2022: 5191-5200
[c2]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Zhao0HLZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Zhao0HLZ22
Yang Zhao, Chen Zhang, Haifeng Huang, Haoyuan Li, Zhou Zhao:
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization. NeurIPS 2022
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-00277
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-00277
Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. CoRR abs/2209.00277 (2022)
2021
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LinZLLZZ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LinZLLZZ021
Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory. ACM Multimedia 2021: 1359-1367
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-13630
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-13630
Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He:
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory. CoRR abs/2108.13630 (2021)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.