default search action

combined dblp search
author search
venue search
publication search

ask others

Di Zhang 0026

> Home > Persons

Person information

affiliation: Kuaishou Technology, Beijing, China

Other persons with the same name

see FAQ

Di Zhang — disambiguation page
Di Zhang 0001 — Guangdong Medical College, School of Information Engineering, Dongguan, China (and 2 more)
Di Zhang 0002 — Zhengzhou University, School of Information Engineering, China (and 2 more)
Di Zhang 0003 — Waseda University, Graduate School of Advanced Science and Engineering, Tokyo, Japan
Di Zhang 0004 — University of Jyväskylä, Faculty of Information Technology, Finland
Di Zhang 0005 — Air Force Engineering University, Information and Navigation College, Xi'an, China
Di Zhang 0006 — Anhui University, College of Computer Science and Technology, Hefei, China
Di Zhang 0007 — Tsinghua University, State Key Laboratory of Hydroscience and Engineering, Beijing, China
Di Zhang 0008 — Wuhan University of Technology, National Engineering Research Centre for Water Transport Safety , Wuhan, China
Di Zhang 0009 — University of Waterloo, Ontario, Canada

Di Zhang 0010 — Beijing Jiaotong University, School of Software Engineering, China (and 1 more)
Di Zhang 0011 — Army Medical University (Third Military Medical University), Department of Information, Xinqiao Hospital, Chongqing, China (and 2 more)
Di Zhang 0012 — Naval Postgraduate School, Department of Electrical and Computer Engineering, Monterey, CA, USA
Di Zhang 0013 — University of Science and Technology of China, Hefei, China
Di Zhang 0014 — Wuhan University, School of Geodesy and Geomatics, China
Di Zhang 0015 — University of North Carolina at Charlotte, NC, USA
Di Zhang 0016 — Hikvision Digital Technology Co., Ltd., Hangzhou, China
Di Zhang 0017 — Huazhong University of Science and Technology, School of Physics, Wuhan, China
Di Zhang 0018 — University of Electronic Science and Technology of China, School of Information and Software Engineering, Chengdu, China
Di Zhang 0019 — Northwest Normal University, College of Computer Science and Engineering, Lanzhou, China
Di Zhang 0020 — China University of Mining and Technology, School of Computer Science and Technology, Xuzhou, China
Di Zhang 0021 — National University of Defense Technology, College of Meteorology and Oceanography, Changsha, China (and 1 more)
Di Zhang 0022 — East China Normal University, MoE Key Laboratory of Geographic Information Science, Shanghai, China
Di Zhang 0023 — China Jiliang University, National and Local Joint Engineering Laboratory of Disaster Monitoring Technology and Instruments, Hangzhou, China
Di Zhang 0024 — Beijing Technology and Business University, School of Computer and Artificial Intelligent, China
Di Zhang 0025 — South China University of Technology, School of Electric Power Engineering, Guangzhou, China (and 1 more)
Di Zhang 0027 — Macao Polytechnic University, Faculty of Humanities and Social Sciences, Macao
Di Zhang 0028 — Shenyang University of Technology, School of Artificial Intelligence, Liaoning, China (and 1 more)
Di Zhang 0029 — Beijing Jiaotong University, Department of Electrical Engineering, China
Di Zhang 0030 — Northwest Normal University, School of Educational Technology, Lanzhou, China
Di Zhang 0031 — Xi'an Jiaotong-Liverpool University, Suzhou, China (and 1 more)

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/WangWLZZN25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/WangWLZZN25
Xiao Wang, Jianlong Wu, Zijia Lin, Fuzheng Zhang, Di Zhang, Liqiang Nie:
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding. IEEE Trans. Pattern Anal. Mach. Intell. 47(4): 2912-2923 (2025)
[c38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/0001ZWHLTZLYXWC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/0001ZWHLTZLYXWC25
Junxian Li, Di Zhang, Xunzhi Wang, Zeying Hao, Jingdi Lei, Qian Tan, Cai Zhou, Wei Liu, Yaotian Yang, Xinrui Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Wei Li, Mao Su, Shufei Zhang, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou:
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area. AAAI 2025: 415-423
[c37]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ChenZRGLZSLZT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenZRGLZSLZT25
Xinlong Chen, Yuanxing Zhang, Chongling Rao, Yushuo Guan, Jiaheng Liu, Fuzheng Zhang, Chengru Song, Qiang Liu, Di Zhang, Tieniu Tan:
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation. ACL (Findings) 2025: 8543-8563
[c36]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/WangHLZZWZN25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangHLZZWZN25
Xiao Wang, Jingyun Hua, Weihong Lin, Yuanxing Zhang, Fuzheng Zhang, Jianlong Wu, Di Zhang, Liqiang Nie:
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models. ACL (1) 2025: 10158-10181
[c35]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/LiSMXBXKYGZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiSMXBXKYGZ25
Jiaze Li, Yaya Shi, Zongyang Ma, Haoran Xu, Yandong Bai, Huihui Xiao, Ruiwen Kang, Fan Yang, Tingting Gao, Di Zhang:
iMOVE : Instance-Motion-Aware Video Understanding. ACL (Findings) 2025: 23959-23975
[c34]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/LianCHXHD0NLZZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/LianCHXHD0NLZZ25
Haoran Lian, Junmin Chen, Wei Huang, Yizhe Xiong, Wenping Hu, Guiguang Ding, Hui Chen, Jianwei Niu, Zijia Lin, Fuzheng Zhang, Di Zhang:
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models. COLING 2025: 4897-4909
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/FengHZLLWZW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/FengHZLLWZW25
Wei-Qi Feng, Dong Han, Ze-Kang Zhou, Shunkai Li, Xiaoqiang Liu, Pengfei Wan, Di Zhang, Miao Wang:
GPAvatar: High-fidelity Head Avatars by Learning Efficient Gaussian Projections. CVPR 2025: 250-259
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YeH00ZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YeH00ZL25
Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo:
StyleMaster: Stylize Your Video with Artistic Generation and Translation. CVPR 2025: 2630-2640
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangSOCLWJYZTYW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangSOCLWJYZTYW25
Qiuheng Wang, Yukai Shi, Jiarong Ou, Rui Chen, Ke Lin, Jiahao Wang, Boyuan Jiang, Haotian Yang, Mingwu Zheng, Xin Tao, Fei Yang, Pengfei Wan, Di Zhang:
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content. CVPR 2025: 8428-8437
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangL0WL0LWYW025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangL0WL0LWYW025
Di Zhang, Jingdi Lei, Junxian Li, Xunzhi Wang, Yujie Liu, Zonglin Yang, Jiatong Li, Weida Wang, Suorong Yang, Jianbo Wu, Peng Ye, Wanli Ouyang, Dongzhan Zhou:
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning. CVPR 2025: 9050-9061
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiuYL0Z25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiuYL0Z25
Zhuoman Liu, Weicai Ye, Yan Luximon, Pengfei Wan, Di Zhang:
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation. CVPR 2025: 11016-11025
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/DuXLWWWZJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/DuXLWWWZJ25
Shian Du, Menghan Xia, Chang Liu, Xintao Wang, Jing Wang, Pengfei Wan, Di Zhang, Xiangyang Ji:
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution. CVPR 2025: 17799-17809
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YinZZLOCHWT0ZYZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YinZZLOCHWT0ZYZ25
Yuanyang Yin, Yaqi Zhao, Mingwu Zheng, Ke Lin, Jiarong Ou, Rui Chen, Victor Shea-Jay Huang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang, Kun Gai:
Towards Precise Scaling Laws for Video Diffusion Transformers. CVPR 2025: 18155-18165
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Liu0WY0Z025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Liu0WY0Z025
Feng-Lin Liu, Hongbo Fu, Xintao Wang, Weicai Ye, Pengfei Wan, Di Zhang, Lin Gao:
SketchVideo: Sketch-based Video Generation and Editing. CVPR 2025: 23379-23390
[c25]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/BaiX0YLH0Z25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BaiX0YLH0Z25
Jianhong Bai, Menghan Xia, Xintao Wang, Ziyang Yuan, Zuozhu Liu, Haoji Hu, Pengfei Wan, Di Zhang:
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints. ICLR 2025
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChenZLDSrXW0GZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChenZLDSrXW0GZ25
Jiankang Chen, Tianke Zhang, Changyi Liu, Haojie Ding, Yaya Shi, Cheng Feng, Huihui Xiao, Bin Wen, Fan Yang, Tingting Gao, Di Zhang:
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types. ICLR 2025
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChenZZLZZ0Z025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChenZZLZZ0Z025
Hejia Chen, Haoxian Zhang, Shoulong Zhang, Xiaoqiang Liu, Sisi Zhuang, Yuan Zhang, Pengfei Wan, Di Zhang, Shuai Li:
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control. ICLR 2025
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Fan0KYZ0TT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Fan0KYZ0TT25
Qi Fan, Xin Tao, Lei Ke, Mingqiao Ye, Di Zhang, Pengfei Wan, Yu-Wing Tai, Chi-Keung Tang:
Stable Segment Anything Model. ICLR 2025
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FuLWPXSY0ZL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FuLWPXSY0ZL25
Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin:
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation. ICLR 2025
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/ZhangWLCLXHZPLOZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/ZhangWLCLXHZPLOZ25
Di Zhang, Jianbo Wu, Jingdi Lei, Tong Che, Jiatong Li, Tong Xie, Xiaoshui Huang, Shufei Zhang, Marco Pavone, Yuqiang Li, Wanli Ouyang, Dongzhan Zhou:
LLaMA-Berry: Pairwise Optimization for Olympiad-level Mathematical Reasoning via O1-like Monte Carlo Tree Search. NAACL (Long Papers) 2025: 7315-7337
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-04698
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-04698
Yuzhou Huang, Ziyang Yuan, Quande Liu, Qiulin Wang, Xintao Wang, Ruimao Zhang, Pengfei Wan, Di Zhang, Kun Gai:
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning. CoRR abs/2501.04698 (2025)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-08325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-08325
Jiwen Yu, Yiran Qin, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu:
GameFactory: Creating New Games with Generative Interactive Videos. CoRR abs/2501.08325 (2025)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-13918
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-13918
Jie Liu, Gongye Liu, Jiajun Liang, Ziyang Yuan, Xiaokun Liu, Mingwu Zheng, Xiele Wu, Qiulin Wang, Wenyu Qin, Menghan Xia, Xintao Wang, Xiaohong Liu, Fei Yang, Pengfei Wan, Di Zhang, Kun Gai, Yujiu Yang, Wanli Ouyang:
Improving Video Generation with Human Feedback. CoRR abs/2501.13918 (2025)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-08639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-08639
Qinghe Wang, Yawen Luo, Xiaoyu Shi, Xu Jia, Huchuan Lu, Tianfan Xue, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai:
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation. CoRR abs/2502.08639 (2025)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-09925
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-09925
Jiankang Chen, Tianke Zhang, Changyi Liu, Haojie Ding, Yaya Shi, Feng Cheng, Huihui Xiao, Bin Wen, Fan Yang, Tingting Gao, Di Zhang:
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types. CoRR abs/2502.09925 (2025)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-10391
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-10391
Yifan Zhang, Tao Yu, Haochen Tian, Chaoyou Fu, Peiyan Li, Jianshu Zeng, Wulin Xie, Yang Shi, Huanyu Zhang, Junkang Wu, Xue Wang, Yibo Hu, Bin Wen, Fan Yang, Zhang Zhang, Tingting Gao, Di Zhang, Liang Wang, Rong Jin, Tieniu Tan:
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment. CoRR abs/2502.10391 (2025)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-11594
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-11594
Jiaze Li, Yaya Shi, Zongyang Ma, Haoran Xu, Feng Cheng, Huihui Xiao, Ruiwen Kang, Fan Yang, Tingting Gao, Di Zhang:
iMOVE: Instance-Motion-Aware Video Understanding. CoRR abs/2502.11594 (2025)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-12455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-12455
Minxuan Lv, Zhenpeng Su, Leiyu Pan, Yizhe Xiong, Zijia Lin, Hui Chen, Wei Zhou, Jungong Han, Guiguang Ding, Cheng Luo, Di Zhang, Kun Gai, Songlin Hu:
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs. CoRR abs/2502.12455 (2025)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-12782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-12782
Xinlong Chen, Yuanxing Zhang, Chongling Rao, Yushuo Guan, Jiaheng Liu, Fuzheng Zhang, Chengru Song, Qiang Liu, Di Zhang, Tieniu Tan:
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation. CoRR abs/2502.12782 (2025)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-12928
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-12928
Leiyu Pan, Zhenpeng Su, Minxuan Lv, Yizhe Xiong, Xiangwen Zhang, Zijia Lin, Hui Chen, Jungong Han, Guiguang Ding, Cheng Luo, Di Zhang, Kun Gai, Deyi Xiong:
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts. CoRR abs/2502.12928 (2025)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13472
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13472
Borui Liao, Yulong Xu, Jiao Ou, Kaiyuan Yang, Weihua Jian, Pengfei Wan, Di Zhang:
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems. CoRR abs/2502.13472 (2025)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13516
Hao Yi, Qingyang Li, Yulan Hu, Fuzheng Zhang, Di Zhang, Yong Liu:
SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin. CoRR abs/2502.13516 (2025)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-20811
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-20811
Xiao Wang, Jingyun Hua, Weihong Lin, Yuanxing Zhang, Fuzheng Zhang, Jianlong Wu, Di Zhang, Liqiang Nie:
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models. CoRR abs/2502.20811 (2025)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-02537
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-02537
Zhen Yang, Guibao Shen, Liang Hou, Mushui Liu, Luozhou Wang, Xin Tao, Pengfei Wan, Di Zhang, Ying-Cong Chen:
RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification. CoRR abs/2503.02537 (2025)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-06499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-06499
Xukun Zhou, Fengxin Li, Ming Chen, Yan Zhou, Pengfei Wan, Di Zhang, Yeying Jin, Zhaoxin Fan, Hongyan Liu, Jun He:
ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis. CoRR abs/2503.06499 (2025)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-09143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-09143
Haoyu Zhang, Qiaohui Chu, Meng Liu, Yunxiao Wang, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Yaowei Wang, Liqiang Nie:
Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding. CoRR abs/2503.09143 (2025)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-09994
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-09994
Yunxiao Wang, Meng Liu, Rui Shao, Haoyu Zhang, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Liqiang Nie:
TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs. CoRR abs/2503.09994 (2025)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-11647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-11647
Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang:
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video. CoRR abs/2503.11647 (2025)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-14487
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-14487
Minglei Shi, Ziyang Yuan, Haotian Yang, Xintao Wang, Mingwu Zheng, Xin Tao, Wenliang Zhao, Wenzhao Zheng, Jie Zhou, Jiwen Lu, Pengfei Wan, Di Zhang, Kun Gai:
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers. CoRR abs/2503.14487 (2025)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-14517
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-14517
Hejia Chen, Haoxian Zhang, Shoulong Zhang, Xiaoqiang Liu, Sisi Zhuang, Yuan Zhang, Pengfei Wan, Di Zhang, Shuai Li:
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control. CoRR abs/2503.14517 (2025)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-17359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-17359
Jiwen Yu, Yiran Qin, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu:
Position: Interactive Generative Video as Next-Generation Game Engine. CoRR abs/2503.17359 (2025)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-18719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-18719
Cong Liu, Liang Hou, Mingwu Zheng, Xin Tao, Pengfei Wan, Di Zhang, Kun Gai:
Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings. CoRR abs/2503.18719 (2025)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-19907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-19907
Xuan Ju, Weicai Ye, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qiang Xu:
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention. CoRR abs/2503.19907 (2025)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-20202
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-20202
Nan Gao, Yihua Bao, Dongdong Weng, Jiayi Zhao, Jia Li, Yan Zhou, Pengfei Wan, Di Zhang:
SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain. CoRR abs/2503.20202 (2025)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-23284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-23284
Feng-Lin Liu, Hongbo Fu, Xintao Wang, Weicai Ye, Pengfei Wan, Di Zhang, Lin Gao:
SketchVideo: Sketch-based Video Generation and Editing. CoRR abs/2503.23284 (2025)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-23907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-23907
Zhichao Liao, Xiaokun Liu, Wenyu Qin, Qingyu Li, Qiulin Wang, Pengfei Wan, Di Zhang, Long Zeng, Pingfa Feng:
HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment. CoRR abs/2503.23907 (2025)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-24379
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-24379
Shengqiong Wu, Weicai Ye, Jiahao Wang, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Shuicheng Yan, Hao Fei, Tat-Seng Chua:
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation. CoRR abs/2503.24379 (2025)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-06122
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-06122
Jingyuan Zhang, Qi Wang, Xingguang Ji, Yahui Liu, Yang Yue, Fuzheng Zhang, Di Zhang, Guorui Zhou, Kun Gai:
Leanabell-Prover: Posttraining Scaling in Formal Reasoning. CoRR abs/2504.06122 (2025)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-08809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-08809
Wei Chen, Xin Yan, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Long Chen:
Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models. CoRR abs/2504.08809 (2025)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-10068
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-10068
Yang Shi, Jiaheng Liu, Yushuo Guan, Zhenhua Wu, Yuanxing Zhang, Zihao Wang, Weihong Lin, Jingyun Hua, Zekun Wang, Xinlong Chen, Bohan Zeng, Wentao Zhang, Fuzheng Zhang, Wenjing Yang, Di Zhang:
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model. CoRR abs/2504.10068 (2025)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-10329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-10329
Xingyu Lu, Yuhang Hu, Yifan Zhang, Kaiyu Jiang, Changyi Liu, Tianke Zhang, Jinpeng Wang, Chun Yuan, Bin Wen, Fan Yang, Tingting Gao, Di Zhang:
InstructEngine: Instruction-driven Text-to-Image Alignment. CoRR abs/2504.10329 (2025)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-14858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-14858
Jiaqi Wei, Hao Zhou, Xiang Zhang, Di Zhang, Zijie Qiu, Wei Wei, Jinzhe Li, Wanli Ouyang, Siqi Sun:
AlignRAG: An Adaptable Framework for Resolving Misalignments in Retrieval-Aware Reasoning of RAG. CoRR abs/2504.14858 (2025)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-14904
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-14904
Xingyu Lu, Tianke Zhang, Chang Meng, Xiaobei Wang, Jinpeng Wang, Yifan Zhang, Shisong Tang, Changyi Liu, Haojie Ding, Kaiyu Jiang, Kaiyu Tang, Bin Wen, Hai-Tao Zheng, Fan Yang, Tingting Gao, Di Zhang, Kun Gai:
VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform. CoRR abs/2504.14904 (2025)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-21853
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-21853
Jiwen Yu, Yiran Qin, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Hao Chen, Xihui Liu:
A Survey of Interactive Generative Video. CoRR abs/2504.21853 (2025)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-05470
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-05470
Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di Zhang, Wanli Ouyang:
Flow-GRPO: Training Flow Matching Models via Online RL. CoRR abs/2505.05470 (2025)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-17618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-17618
Haoran He, Jiajun Liang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Ling Pan:
Scaling Image and Video Generation via Test-Time Evolutionary Search. CoRR abs/2505.17618 (2025)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-17873
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-17873
Wanhao Liu, Zonglin Yang, Jue Wang, Lidong Bing, Di Zhang, Dongzhan Zhou, Yuqiang Li, Houqiang Li, Erik Cambria, Wanli Ouyang:
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback. CoRR abs/2505.17873 (2025)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-21448
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-21448
Ziqiao Peng, Jiwen Liu, Haoxian Zhang, Xiaoqiang Liu, Songlin Tang, Pengfei Wan, Di Zhang, Hongyan Liu, Jun He:
OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers. CoRR abs/2505.21448 (2025)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-00189
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-00189
Di Zhang, Weida Wang, Junxian Li, Xunzhi Wang, Jiatong Li, Jianbo Wu, Jingdi Lei, Haonan He, Peng Ye, Shufei Zhang, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou:
Control-R: Towards controllable test-time scaling. CoRR abs/2506.00189 (2025)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01943
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01943
Xiao Fu, Xintao Wang, Xian Liu, Jianhong Bai, Runsen Xu, Pengfei Wan, Di Zhang, Dahua Lin:
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control. CoRR abs/2506.01943 (2025)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-03140
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-03140
Yawen Luo, Jianhong Bai, Xiaoyu Shi, Menghan Xia, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Tianfan Xue:
CamCloneMaster: Enabling Reference-based Camera Control for Video Generation. CoRR abs/2506.03140 (2025)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-03141
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-03141
Jiwen Yu, Jianhong Bai, Yiran Qin, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu:
Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval. CoRR abs/2506.03141 (2025)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-04213
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-04213
Xuanhua He, Quande Liu, Zixuan Ye, Weicai Ye, Qiulin Wang, Xintao Wang, Qifeng Chen, Pengfei Wan, Di Zhang, Kun Gai:
FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers. CoRR abs/2506.04213 (2025)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-04216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-04216
Zixuan Ye, Xuanhua He, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qifeng Chen, Wenhan Luo:
UNIC: Unified In-Context Video Editing. CoRR abs/2506.04216 (2025)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-17912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-17912
Chuhao Jin, Haosen Li, Bingzi Zhang, Che Liu, Xiting Wang, Ruihua Song, Wenbing Huang, Ying Qin, Fuzheng Zhang, Di Zhang:
PlanMoGPT: Flow-Enhanced Progressive Planning for Text to Motion Synthesis. CoRR abs/2506.17912 (2025)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-19774
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-19774
Jun Wang, Xijuan Zeng, Chunyu Qiang, Ruilong Chen, Shiyao Wang, Le Wang, Wangjing Zhou, Pengfei Cai, Jiahui Zhao, Nan Li, Zihan Li, Yuzhe Liang, Xiaopeng Wang, Haorui Zheng, Ming Wen, Kang Yin, Yiran Wang, Nan Li, Feng Deng, Liang Dong, Chen Zhang, Di Zhang, Kun Gai:
Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation. CoRR abs/2506.19774 (2025)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-23858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-23858
Jianzong Wu, Liang Hou, Haotian Yang, Xin Tao, Ye Tian, Pengfei Wan, Di Zhang, Yunhai Tong:
VMoBA: Mixture-of-Block Attention for Video Diffusion Models. CoRR abs/2506.23858 (2025)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-13345
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-13345
Yukai Shi, Jiarong Ou, Rui Chen, Haotian Yang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Kun Gai:
Imbalance in Balance: Online Concept Balancing in Generation Models. CoRR abs/2507.13345 (2025)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-00733
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-00733
Le Wang, Jun Wang, Chunyu Qiang, Feng Deng, Chen Zhang, Di Zhang, Kun Gai:
AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation. CoRR abs/2508.00733 (2025)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-07926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-07926
Liang Hou, Yuan Gao, Boyuan Jiang, Xin Tao, Qi Yan, Renjie Liao, Pengfei Wan, Di Zhang, Kun Gai:
Score Augmentation for Diffusion Models. CoRR abs/2508.07926 (2025)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-08401
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-08401
Jiatong Li, Weida Wang, Qinggang Zhang, Junxian Li, Di Zhang, Changmeng Zheng, Shufei Zhang, Xiaoyong Wei, Qing Li:
Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery. CoRR abs/2508.08401 (2025)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-14313
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-14313
Can Jin, Yang Zhou, Qixin Zhang, Hongwu Peng, Di Zhang, Marco Pavone, Ligong Han, Zhang-Wei Hong, Tong Che, Dimitris N. Metaxas:
Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS. CoRR abs/2508.14313 (2025)
2024
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LinFLLGWZ0ZG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LinFLLGWZ0ZG24
Lei Lin, Jia-Yi Fu, Pengli Liu, Qingyang Li, Yan Gong, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai:
Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios. ACL (Findings) 2024: 3829-3852
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ChenZZWZZW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ChenZZWZZW24
Zhipeng Chen, Kun Zhou, Xin Zhao, Junchen Wan, Fuzheng Zhang, Di Zhang, Ji-Rong Wen:
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint. ACL (Findings) 2024: 5694-5711
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/SunLZHSZZZG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/SunLZHSZZZG24
Yuchong Sun, Che Liu, Kun Zhou, Jinwen Huang, Ruihua Song, Xin Zhao, Fuzheng Zhang, Di Zhang, Kun Gai:
Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models. ACL (1) 2024: 9729-9750
[c16]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/SunZLZZ0CSZGX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/SunZLZZ0CSZGX24
Chenxi Sun, Hongzhi Zhang, Zijia Lin, Jingyuan Zhang, Fuzheng Zhang, Zhongyuan Wang, Bin Chen, Chengru Song, Di Zhang, Kun Gai, Deyi Xiong:
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs. LREC/COLING 2024: 4476-4487
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangWWLGZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangWWLGZ024
Sixian Zhang, Bohan Wang, Junqiang Wu, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang:
Learning Multi-Dimensional Human Preference for Text-to-Image Generation. CVPR 2024: 8018-8027
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ChengLZZZZGW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChengLZZZZGW24
Xiaoxue Cheng, Junyi Li, Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di Zhang, Kun Gai, Ji-Rong Wen:
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector. EMNLP 2024: 14600-14615
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/OuWLZZG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/OuWLZZG24
Jiao Ou, Jiayu Wu, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai:
Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues. EMNLP 2024: 17402-17431
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Jin0XCLTHCSMZOG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Jin0XCLTHCSMZOG24
Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chengru Song, Dai Meng, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu:
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization. ICLR 2024
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Jin00XCJHSLZ0GM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Jin00XCJHSLZ0GM24
Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang Song, Kun Gai, Yadong Mu:
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization. ICML 2024
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuangSWQXZWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuangSWQXZWZ024
Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan Zhang, Pengfei Wan, Di Zhang, Jia Jia:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. ACM Multimedia 2024: 6880-6889
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/OuLLTZZG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/OuLLTZZG24
Jiao Ou, Junda Lu, Che Liu, Yihong Tang, Fuzheng Zhang, Di Zhang, Kun Gai:
DialogBench: Evaluating LLMs as Human-like Dialogue Systems. NAACL-HLT 2024: 6137-6170
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/TianYYGDWYTWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TianYYGDWYTWZ024
Ye Tian, Ling Yang, Haotian Yang, Yuan Gao, Yufan Deng, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui:
VideoTetris: Towards Compositional Text-to-Video Generation. NeurIPS 2024
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/siggraph/GuoZHGDWZLHZHM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/siggraph/GuoZHGDWZLHZHM24
Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Pengfei Wan, Di Zhang, Yufan Liu, Weiming Hu, Zhengjun Zha, Haibin Huang, Chongyang Ma:
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models. SIGGRAPH (Conference Paper Track) 2024: 112
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/siggraph/YangHHMWZC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/siggraph/YangHHMWZC024
Shiyuan Yang, Liang Hou, Haibin Huang, Chongyang Ma, Pengfei Wan, Di Zhang, Xiaodong Chen, Jing Liao:
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion. SIGGRAPH (Conference Paper Track) 2024: 113
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/siggrapha/ZhengQJMHZW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/siggrapha/ZhengQJMHZW024
Yujian Zheng, Yuda Qiu, Leyang Jin, Chongyang Ma, Haibin Huang, Di Zhang, Pengfei Wan, Xiaoguang Han:
Towards Unified 3D Hair Reconstruction from Single-View Portraits. SIGGRAPH Asia 2024: 114:1-114:11
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06081
Zhipeng Chen, Kun Zhou, Wayne Xin Zhao, Junchen Wan, Fuzheng Zhang, Di Zhang, Ji-Rong Wen:
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint. CoRR abs/2401.06081 (2024)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03161
Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang Song, Kun Gai, Yadong Mu:
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization. CoRR abs/2402.03161 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03162
Shiyuan Yang, Liang Hou, Haibin Huang, Chongyang Ma, Pengfei Wan, Di Zhang, Xiaodong Chen, Jing Liao:
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion. CoRR abs/2402.03162 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-06852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-06852
Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li, Weiran Huang, Xiangyu Yue, Dongzhan Zhou, Shufei Zhang, Mao Su, Hansen Zhong, Yuqiang Li, Wanli Ouyang:
ChemLLM: A Chemical Large Language Model. CoRR abs/2402.06852 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-20193
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-20193
Luozhou Wang, Guibao Shen, Yixun Liang, Xin Tao, Pengfei Wan, Di Zhang, Yijun Li, Yingcong Chen:
Motion Inversion for Video Customization. CoRR abs/2403.20193 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09619
Zhaokun Zhou, Qiulin Wang, Bin Lin, Yiwei Su, Rui Chen, Xin Tao, Amin Zheng, Li Yuan, Pengfei Wan, Di Zhang:
UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark. CoRR abs/2404.09619 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-11095
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-11095
Jiao Ou, Jiayu Wu, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai:
Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues. CoRR abs/2404.11095 (2024)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14677
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14677
Zhicheng Sun, Zhenhao Yang, Yang Jin, Haozhe Chi, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Di Zhang, Yang Song, Kun Gai, Yadong Mu:
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance. CoRR abs/2405.14677 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14705
Sixian Zhang, Bohan Wang, Junqiang Wu, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang:
Learning Multi-dimensional Human Preference for Text-to-Image Generation. CoRR abs/2405.14705 (2024)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-15208
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-15208
Chenxi Sun, Hongzhi Zhang, Zijia Lin, Jingyuan Zhang, Fuzheng Zhang, Zhongyuan Wang, Bin Chen, Chengru Song, Di Zhang, Kun Gai, Deyi Xiong:
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs. CoRR abs/2405.15208 (2024)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-00210
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-00210
Jinchao Zhu, Yuxuan Wang, Siyuan Pan, Pengfei Wan, Di Zhang, Gao Huang:
A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature Inheritance Strategies. CoRR abs/2406.00210 (2024)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04277
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04277
Ye Tian, Ling Yang, Haotian Yang, Yuan Gao, Yufan Deng, Jingmin Chen, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui:
VideoTetris: Towards Compositional Text-to-Video Generation. CoRR abs/2406.04277 (2024)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07394
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07394
Di Zhang, Xiaoshui Huang, Dongzhan Zhou, Yuqiang Li, Wanli Ouyang:
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B. CoRR abs/2406.07394 (2024)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11277
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11277
Xiaoxue Cheng, Junyi Li, Wayne Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di Zhang, Kun Gai, Ji-Rong Wen:
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector. CoRR abs/2406.11277 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19905
Longrong Yang, Dong Sheng, Chaoxiang Cai, Fan Yang, Size Li, Di Zhang, Xi Li:
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model. CoRR abs/2406.19905 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03168
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03168
Jianzhu Guo, Dingyun Zhang, Xiaoqiang Liu, Zhizhou Zhong, Yuan Zhang, Pengfei Wan, Di Zhang:
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control. CoRR abs/2407.03168 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-13976
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-13976
Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan Zhang, Pengfei Wan, Di Zhang, Jia Jia:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. CoRR abs/2407.13976 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-14177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-14177
Kaibing Chen, Dong Shen, Hanwen Zhong, Huasong Zhong, Kui Xia, Di Xu, Wei Yuan, Yifei Hu, Bin Wen, Tianke Zhang, Changyi Liu, Dewen Fan, Huihui Xiao, Jiahong Wu, Fan Yang, Size Li, Di Zhang:
EVLM: An Efficient Vision-Language Model for Visual Understanding. CoRR abs/2407.14177 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-06614
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-06614
Liangdong Qiu, Chengxing Yu, Yanran Li, Zhao Wang, Haibin Huang, Chongyang Ma, Di Zhang, Pengfei Wan, Xiaoguang Han:
ViMo: Generating Motions from Casual Videos. CoRR abs/2408.06614 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-07246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-07246
Junxian Li, Di Zhang, Xunzhi Wang, Zeying Hao, Jingdi Lei, Qian Tan, Cai Zhou, Wei Liu, Yaotian Yang, Xinrui Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Wei Li, Shufei Zhang, Mao Su, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou:
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area. CoRR abs/2408.07246 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-11813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-11813
Yuanyang Yin, Yaqi Zhao, Yajie Zhang, Ke Lin, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang:
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs. CoRR abs/2408.11813 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-14710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-14710
Yihong Tang, Jiao Ou, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai:
ERABAL: Enhancing Role-Playing Agents through Boundary-Aware Learning. CoRR abs/2409.14710 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-16863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-16863
Yujian Zheng, Yuda Qiu, Leyang Jin, Chongyang Ma, Haibin Huang, Di Zhang, Pengfei Wan, Xiaoguang Han:
Towards Unified 3D Hair Reconstruction from Single-View Portraits. CoRR abs/2409.16863 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19532
Xiao Wang, Jianlong Wu, Zijia Lin, Fuzheng Zhang, Di Zhang, Liqiang Nie:
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding. CoRR abs/2409.19532 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-02884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-02884
Di Zhang, Jianbo Wu, Jingdi Lei, Tong Che, Jiatong Li, Tong Xie, Xiaoshui Huang, Shufei Zhang, Marco Pavone, Yuqiang Li, Wanli Ouyang, Dongzhan Zhou:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning. CoRR abs/2410.02884 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-08260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-08260
Qiuheng Wang, Yukai Shi, Jiarong Ou, Rui Chen, Ke Lin, Jiahao Wang, Boyuan Jiang, Haotian Yang, Mingwu Zheng, Xin Tao, Fei Yang, Pengfei Wan, Di Zhang:
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content. CoRR abs/2410.08260 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-04799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-04799
Xingyu Lu, Yuhang Hu, Changyi Liu, Tianke Zhang, Zhenyu Yang, Zhixiang Ding, Shengsheng Qian, Meng Du, Ruiwen Kang, Kaiyu Tang, Fan Yang, Tingting Gao, Di Zhang, Hai-Tao Zheng, Bin Wen:
Kwai-STaR: Transform LLMs into State-Transition Reasoners. CoRR abs/2411.04799 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-13154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-13154
Zhicong Li, Jiahao Wang, Zhishu Jiang, Hangyu Mao, Zhongxia Chen, Jiazhen Du, Yuanxing Zhang, Fuzheng Zhang, Di Zhang, Yong Liu:
DMQR-RAG: Diverse Multi-Query Rewriting for RAG. CoRR abs/2411.13154 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-14423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-14423
Zhuoman Liu, Weicai Ye, Yan Luximon, Pengfei Wan, Di Zhang:
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation. CoRR abs/2411.14423 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-14721
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-14721
Jiatong Li, Yunqing Liu, Wei Liu, Jingdi Lei, Di Zhang, Wenqi Fan, Dongzhan Zhou, Yuqiang Li, Qing Li:
MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts. CoRR abs/2411.14721 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-15260
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-15260
Jiahao Hu, Tianxiong Zhong, Xuebo Wang, Boyuan Jiang, Xingye Tian, Fei Yang, Pengfei Wan, Di Zhang:
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing. CoRR abs/2411.15260 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-16201
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-16201
Hao Yi, Qingyang Li, Yulan Hu, Fuzheng Zhang, Di Zhang, Yong Liu:
Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models. CoRR abs/2411.16201 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-17470
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-17470
Yuanyang Yin, Yaqi Zhao, Mingwu Zheng, Ke Lin, Jiarong Ou, Rui Chen, Victor Shea-Jay Huang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang, Kun Gai:
Towards Precise Scaling Laws for Video Diffusion Transformers. CoRR abs/2411.17470 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-18203
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-18203
Di Zhang, Junxian Li, Jingdi Lei, Xunzhi Wang, Yujie Liu, Zonglin Yang, Jiatong Li, Weida Wang, Suorong Yang, Jianbo Wu, Peng Ye, Wanli Ouyang, Dongzhan Zhou:
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning. CoRR abs/2411.18203 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-07171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-07171
Haoran Lian, Junmin Chen, Wei Huang, Yizhe Xiong, Wenping Hu, Guiguang Ding, Hui Chen, Jianwei Niu, Zijia Lin, Fuzheng Zhang, Di Zhang:
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models. CoRR abs/2412.07171 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-07744
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-07744
Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo:
StyleMaster: Stylize Your Video with Artistic Generation and Translation. CoRR abs/2412.07744 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-07759
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-07759
Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin:
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation. CoRR abs/2412.07759 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-07760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-07760
Jianhong Bai, Menghan Xia, Xintao Wang, Ziyang Yuan, Xiao Fu, Zuozhu Liu, Haoji Hu, Pengfei Wan, Di Zhang:
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints. CoRR abs/2412.07760 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-09600
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-09600
Yuanhui Huang, Wenzhao Zheng, Yuan Gao, Xin Tao, Pengfei Wan, Di Zhang, Jie Zhou, Jiwen Lu:
Owl-1: Omni World Model for Consistent Long Video Generation. CoRR abs/2412.09600 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-19191
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-19191
Haonan He, Yuchen Ren, Yining Tang, Ziyang Xu, Junxian Li, Minghao Yang, Di Zhang, Dong Yuan, Tao Chen, Shufei Zhang, Yuqiang Li, Nanqing Dong, Wanli Ouyang, Dongzhan Zhou, Peng Ye:
Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models. CoRR abs/2412.19191 (2024)
2023
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ChenYTCSZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ChenYTCSZ23
Jue Chen, Huan Yuan, Jianchao Tan, Bin Chen, Chengru Song, Di Zhang:
Resource Constrained Model Compression via Minimax Optimization for Spiking Neural Networks. ACM Multimedia 2023: 5204-5213
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-04669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-04669
Yang Jin, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu:
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization. CoRR abs/2309.04669 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-07301
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-07301
Yuchong Sun, Che Liu, Jinwen Huang, Ruihua Song, Fuzheng Zhang, Di Zhang, Zhongyuan Wang, Kun Gai:
Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions. CoRR abs/2310.07301 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-07488
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-07488
Jia-Yi Fu, Lei Lin, Xiaoyang Gao, Pengli Liu, Zhengzong Chen, Zhirui Yang, Shengnan Zhang, Xue Zheng, Yan Li, Yuliang Liu, Xucheng Ye, Yiqiao Liao, Chao Liao, Bin Chen, Chengru Song, Junchen Wan, Zijia Lin, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai:
KwaiYiiMath: Technical Report. CoRR abs/2310.07488 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-01677
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-01677
Jiao Ou, Junda Lu, Che Liu, Yihong Tang, Fuzheng Zhang, Di Zhang, Zhongyuan Wang, Kun Gai:
DialogBench: Evaluating LLMs as Human-like Dialogue Systems. CoRR abs/2311.01677 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-08154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-08154
Lei Lin, Jia-Yi Fu, Pengli Liu, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai:
Ask One More Time: Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios. CoRR abs/2311.08154 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16693
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16693
Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Chongyang Ma, Weiming Hu, Zhengjun Zha, Haibin Huang, Pengfei Wan, Di Zhang:
I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models. CoRR abs/2312.16693 (2023)
2022
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icde/XuWWLWYDZZXZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icde/XuWWLWYDZZXZ22
Zhirong Xu, Shiyang Wen, Junshan Wang, Guojun Liu, Liang Wang, Zhi Yang, Lei Ding, Yan Zhang, Di Zhang, Jian Xu, Bo Zheng:
AMCAD: Adaptive Mixed-Curvature Representation based Advertisement Retrieval System. ICDE 2022: 3439-3452
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icde/ZhangCYYYZWDXSL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icde/ZhangCYYYZWDXSL22
Yuanxing Zhang, Langshi Chen, Siran Yang, Man Yuan, Huimin Yi, Jie Zhang, Jiamang Wang, Jianbo Dong, Yunlong Xu, Yue Song, Yong Li, Di Zhang, Wei Lin, Lin Qu, Bo Zheng:
PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems. ICDE 2022: 3453-3466
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-14683
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-14683
Zhirong Xu, Shiyang Wen, Junshan Wang, Guojun Liu, Liang Wang, Zhi Yang, Lei Ding, Yan Zhang, Di Zhang, Jian Xu, Bo Zheng:
AMCAD: Adaptive Mixed-Curvature Representation based Advertisement Retrieval System. CoRR abs/2203.14683 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-04903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-04903
Yuanxing Zhang, Langshi Chen, Siran Yang, Man Yuan, Huimin Yi, Jie Zhang, Jiamang Wang, Jianbo Dong, Yunlong Xu, Yue Song, Yong Li, Di Zhang, Wei Lin, Lin Qu, Bo Zheng:
PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems. CoRR abs/2204.04903 (2022)
2021
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/WenCYZZWZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/WenCYZZWZ21
Shiyang Wen, Yiran Chen, Zhi Yang, Yan Zhang, Di Zhang, Liang Wang, Bo Zheng:
SMAD: Scalable Multi-view Ad Retrieval System for E-Commerce Sponsored Search. CIKM 2021: 3543-3547
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-15082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-15082
An Yang, Junyang Lin, Rui Men, Chang Zhou, Le Jiang, Xianyan Jia, Ang Wang, Jie Zhang, Jiamang Wang, Yong Li, Di Zhang, Wei Lin, Lin Qu, Jingren Zhou, Hongxia Yang:
Exploring Sparse Expert Models and Beyond. CoRR abs/2105.15082 (2021)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.