


default search action
Di Zhang 0026
Person information
- affiliation: Kuaishou Technology, Beijing, China
Other persons with the same name
- Di Zhang — disambiguation page
- Di Zhang 0001 — Guangdong Medical College, School of Information Engineering, Dongguan, China (and 2 more)
- Di Zhang 0002
— Zhengzhou University, School of Information Engineering, China (and 2 more)
- Di Zhang 0003
— Waseda University, Graduate School of Advanced Science and Engineering, Tokyo, Japan
- Di Zhang 0004
— University of Jyväskylä, Faculty of Information Technology, Finland
- Di Zhang 0005
— Air Force Engineering University, Information and Navigation College, Xi'an, China
- Di Zhang 0006
— Anhui University, College of Computer Science and Technology, Hefei, China
- Di Zhang 0007
— Tsinghua University, State Key Laboratory of Hydroscience and Engineering, Beijing, China
- Di Zhang 0008
— Wuhan University of Technology, National Engineering Research Centre for Water Transport Safety , Wuhan, China
- Di Zhang 0009 — University of Waterloo, Ontario, Canada
- Di Zhang 0010
— Beijing Jiaotong University, School of Software Engineering, China (and 1 more)
- Di Zhang 0011
— Army Medical University (Third Military Medical University), Department of Information, Xinqiao Hospital, Chongqing, China (and 2 more)
- Di Zhang 0012
— Naval Postgraduate School, Department of Electrical and Computer Engineering, Monterey, CA, USA
- Di Zhang 0013
— University of Science and Technology of China, Hefei, China
- Di Zhang 0014
— Wuhan University, School of Geodesy and Geomatics, China
- Di Zhang 0015
— University of North Carolina at Charlotte, NC, USA
- Di Zhang 0016
— Hikvision Digital Technology Co., Ltd., Hangzhou, China
- Di Zhang 0017
— Huazhong University of Science and Technology, School of Physics, Wuhan, China
- Di Zhang 0018
— University of Electronic Science and Technology of China, School of Information and Software Engineering, Chengdu, China
- Di Zhang 0019
— Northwest Normal University, College of Computer Science and Engineering, Lanzhou, China
- Di Zhang 0020
— China University of Mining and Technology, School of Computer Science and Technology, Xuzhou, China
- Di Zhang 0021
— National University of Defense Technology, College of Meteorology and Oceanography, Changsha, China (and 1 more)
- Di Zhang 0022
— East China Normal University, MoE Key Laboratory of Geographic Information Science, Shanghai, China
- Di Zhang 0023
— China Jiliang University, National and Local Joint Engineering Laboratory of Disaster Monitoring Technology and Instruments, Hangzhou, China
- Di Zhang 0024
— Beijing Technology and Business University, School of Computer and Artificial Intelligent, China
- Di Zhang 0025
— South China University of Technology, School of Electric Power Engineering, Guangzhou, China (and 1 more)
- Di Zhang 0027
— Macao Polytechnic University, Faculty of Humanities and Social Sciences, Macao
- Di Zhang 0028
— Shenyang University of Technology, School of Artificial Intelligence, Liaoning, China (and 1 more)
- Di Zhang 0029
— Beijing Jiaotong University, Department of Electrical Engineering, China
- Di Zhang 0030
— Northwest Normal University, School of Educational Technology, Lanzhou, China
- Di Zhang 0031
— Xi'an Jiaotong-Liverpool University, Suzhou, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j1]Xiao Wang
, Jianlong Wu
, Zijia Lin
, Fuzheng Zhang, Di Zhang, Liqiang Nie
:
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding. IEEE Trans. Pattern Anal. Mach. Intell. 47(4): 2912-2923 (2025) - [c38]Junxian Li, Di Zhang, Xunzhi Wang, Zeying Hao, Jingdi Lei, Qian Tan, Cai Zhou, Wei Liu, Yaotian Yang, Xinrui Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Wei Li, Mao Su, Shufei Zhang, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou:
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area. AAAI 2025: 415-423 - [c37]Xinlong Chen, Yuanxing Zhang, Chongling Rao, Yushuo Guan, Jiaheng Liu, Fuzheng Zhang, Chengru Song, Qiang Liu, Di Zhang, Tieniu Tan:
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation. ACL (Findings) 2025: 8543-8563 - [c36]Xiao Wang, Jingyun Hua, Weihong Lin, Yuanxing Zhang, Fuzheng Zhang, Jianlong Wu, Di Zhang, Liqiang Nie:
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models. ACL (1) 2025: 10158-10181 - [c35]Jiaze Li, Yaya Shi, Zongyang Ma, Haoran Xu, Yandong Bai, Huihui Xiao, Ruiwen Kang, Fan Yang, Tingting Gao, Di Zhang:
iMOVE : Instance-Motion-Aware Video Understanding. ACL (Findings) 2025: 23959-23975 - [c34]Haoran Lian, Junmin Chen, Wei Huang, Yizhe Xiong, Wenping Hu, Guiguang Ding, Hui Chen, Jianwei Niu, Zijia Lin, Fuzheng Zhang, Di Zhang:
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models. COLING 2025: 4897-4909 - [c33]Wei-Qi Feng, Dong Han, Ze-Kang Zhou, Shunkai Li, Xiaoqiang Liu, Pengfei Wan, Di Zhang, Miao Wang:
GPAvatar: High-fidelity Head Avatars by Learning Efficient Gaussian Projections. CVPR 2025: 250-259 - [c32]Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo:
StyleMaster: Stylize Your Video with Artistic Generation and Translation. CVPR 2025: 2630-2640 - [c31]Qiuheng Wang, Yukai Shi, Jiarong Ou, Rui Chen, Ke Lin, Jiahao Wang, Boyuan Jiang, Haotian Yang, Mingwu Zheng, Xin Tao, Fei Yang, Pengfei Wan, Di Zhang:
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content. CVPR 2025: 8428-8437 - [c30]Di Zhang, Jingdi Lei, Junxian Li, Xunzhi Wang, Yujie Liu, Zonglin Yang, Jiatong Li, Weida Wang, Suorong Yang, Jianbo Wu, Peng Ye, Wanli Ouyang, Dongzhan Zhou:
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning. CVPR 2025: 9050-9061 - [c29]Zhuoman Liu, Weicai Ye, Yan Luximon, Pengfei Wan, Di Zhang:
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation. CVPR 2025: 11016-11025 - [c28]Shian Du, Menghan Xia, Chang Liu, Xintao Wang, Jing Wang, Pengfei Wan, Di Zhang, Xiangyang Ji:
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution. CVPR 2025: 17799-17809 - [c27]Yuanyang Yin, Yaqi Zhao, Mingwu Zheng, Ke Lin, Jiarong Ou, Rui Chen, Victor Shea-Jay Huang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang, Kun Gai:
Towards Precise Scaling Laws for Video Diffusion Transformers. CVPR 2025: 18155-18165 - [c26]Feng-Lin Liu, Hongbo Fu, Xintao Wang, Weicai Ye, Pengfei Wan, Di Zhang, Lin Gao:
SketchVideo: Sketch-based Video Generation and Editing. CVPR 2025: 23379-23390 - [c25]Jianhong Bai, Menghan Xia, Xintao Wang, Ziyang Yuan, Zuozhu Liu, Haoji Hu, Pengfei Wan, Di Zhang:
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints. ICLR 2025 - [c24]Jiankang Chen, Tianke Zhang, Changyi Liu, Haojie Ding, Yaya Shi, Cheng Feng, Huihui Xiao, Bin Wen, Fan Yang, Tingting Gao, Di Zhang:
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types. ICLR 2025 - [c23]Hejia Chen, Haoxian Zhang, Shoulong Zhang, Xiaoqiang Liu, Sisi Zhuang, Yuan Zhang, Pengfei Wan, Di Zhang, Shuai Li:
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control. ICLR 2025 - [c22]Qi Fan, Xin Tao, Lei Ke, Mingqiao Ye, Di Zhang, Pengfei Wan, Yu-Wing Tai, Chi-Keung Tang:
Stable Segment Anything Model. ICLR 2025 - [c21]Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin:
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation. ICLR 2025 - [c20]Di Zhang, Jianbo Wu, Jingdi Lei, Tong Che, Jiatong Li, Tong Xie, Xiaoshui Huang, Shufei Zhang, Marco Pavone, Yuqiang Li, Wanli Ouyang, Dongzhan Zhou:
LLaMA-Berry: Pairwise Optimization for Olympiad-level Mathematical Reasoning via O1-like Monte Carlo Tree Search. NAACL (Long Papers) 2025: 7315-7337 - [i101]Yuzhou Huang, Ziyang Yuan, Quande Liu, Qiulin Wang, Xintao Wang, Ruimao Zhang, Pengfei Wan, Di Zhang, Kun Gai:
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning. CoRR abs/2501.04698 (2025) - [i100]Jiwen Yu, Yiran Qin, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu:
GameFactory: Creating New Games with Generative Interactive Videos. CoRR abs/2501.08325 (2025) - [i99]Jie Liu, Gongye Liu, Jiajun Liang, Ziyang Yuan, Xiaokun Liu, Mingwu Zheng, Xiele Wu, Qiulin Wang, Wenyu Qin, Menghan Xia, Xintao Wang, Xiaohong Liu, Fei Yang, Pengfei Wan, Di Zhang, Kun Gai, Yujiu Yang, Wanli Ouyang:
Improving Video Generation with Human Feedback. CoRR abs/2501.13918 (2025) - [i98]Qinghe Wang, Yawen Luo, Xiaoyu Shi, Xu Jia, Huchuan Lu, Tianfan Xue, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai:
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation. CoRR abs/2502.08639 (2025) - [i97]Jiankang Chen, Tianke Zhang, Changyi Liu, Haojie Ding, Yaya Shi, Feng Cheng, Huihui Xiao, Bin Wen, Fan Yang, Tingting Gao, Di Zhang:
TaskGalaxy: Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types. CoRR abs/2502.09925 (2025) - [i96]Yifan Zhang, Tao Yu, Haochen Tian, Chaoyou Fu, Peiyan Li, Jianshu Zeng, Wulin Xie, Yang Shi, Huanyu Zhang, Junkang Wu, Xue Wang, Yibo Hu, Bin Wen, Fan Yang, Zhang Zhang, Tingting Gao, Di Zhang, Liang Wang, Rong Jin, Tieniu Tan:
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment. CoRR abs/2502.10391 (2025) - [i95]Jiaze Li, Yaya Shi, Zongyang Ma, Haoran Xu, Feng Cheng, Huihui Xiao, Ruiwen Kang, Fan Yang, Tingting Gao, Di Zhang:
iMOVE: Instance-Motion-Aware Video Understanding. CoRR abs/2502.11594 (2025) - [i94]Minxuan Lv, Zhenpeng Su, Leiyu Pan, Yizhe Xiong, Zijia Lin, Hui Chen, Wei Zhou, Jungong Han, Guiguang Ding, Cheng Luo, Di Zhang, Kun Gai, Songlin Hu:
DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs. CoRR abs/2502.12455 (2025) - [i93]Xinlong Chen, Yuanxing Zhang, Chongling Rao, Yushuo Guan, Jiaheng Liu, Fuzheng Zhang, Chengru Song, Qiang Liu, Di Zhang, Tieniu Tan:
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation. CoRR abs/2502.12782 (2025) - [i92]Leiyu Pan, Zhenpeng Su, Minxuan Lv, Yizhe Xiong, Xiangwen Zhang, Zijia Lin, Hui Chen, Jungong Han, Guiguang Ding, Cheng Luo, Di Zhang, Kun Gai, Deyi Xiong:
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts. CoRR abs/2502.12928 (2025) - [i91]Borui Liao, Yulong Xu, Jiao Ou, Kaiyuan Yang, Weihua Jian, Pengfei Wan, Di Zhang:
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems. CoRR abs/2502.13472 (2025) - [i90]Hao Yi, Qingyang Li, Yulan Hu, Fuzheng Zhang, Di Zhang, Yong Liu:
SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin. CoRR abs/2502.13516 (2025) - [i89]Xiao Wang, Jingyun Hua, Weihong Lin, Yuanxing Zhang, Fuzheng Zhang, Jianlong Wu, Di Zhang, Liqiang Nie:
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models. CoRR abs/2502.20811 (2025) - [i88]Zhen Yang, Guibao Shen, Liang Hou, Mushui Liu, Luozhou Wang, Xin Tao, Pengfei Wan, Di Zhang, Ying-Cong Chen:
RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification. CoRR abs/2503.02537 (2025) - [i87]Xukun Zhou, Fengxin Li, Ming Chen, Yan Zhou, Pengfei Wan, Di Zhang, Yeying Jin, Zhaoxin Fan, Hongyan Liu, Jun He:
ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis. CoRR abs/2503.06499 (2025) - [i86]Haoyu Zhang, Qiaohui Chu, Meng Liu, Yunxiao Wang, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Yaowei Wang, Liqiang Nie:
Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding. CoRR abs/2503.09143 (2025) - [i85]Yunxiao Wang, Meng Liu, Rui Shao, Haoyu Zhang, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Liqiang Nie:
TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs. CoRR abs/2503.09994 (2025) - [i84]Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang:
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video. CoRR abs/2503.11647 (2025) - [i83]Minglei Shi, Ziyang Yuan, Haotian Yang, Xintao Wang, Mingwu Zheng, Xin Tao, Wenliang Zhao, Wenzhao Zheng, Jie Zhou, Jiwen Lu, Pengfei Wan, Di Zhang, Kun Gai:
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers. CoRR abs/2503.14487 (2025) - [i82]Hejia Chen, Haoxian Zhang, Shoulong Zhang, Xiaoqiang Liu, Sisi Zhuang, Yuan Zhang, Pengfei Wan, Di Zhang, Shuai Li:
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control. CoRR abs/2503.14517 (2025) - [i81]Jiwen Yu, Yiran Qin, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu:
Position: Interactive Generative Video as Next-Generation Game Engine. CoRR abs/2503.17359 (2025) - [i80]Cong Liu, Liang Hou, Mingwu Zheng, Xin Tao, Pengfei Wan, Di Zhang, Kun Gai:
Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings. CoRR abs/2503.18719 (2025) - [i79]Xuan Ju, Weicai Ye, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qiang Xu:
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention. CoRR abs/2503.19907 (2025) - [i78]Nan Gao, Yihua Bao, Dongdong Weng, Jiayi Zhao, Jia Li, Yan Zhou, Pengfei Wan, Di Zhang:
SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain. CoRR abs/2503.20202 (2025) - [i77]Feng-Lin Liu, Hongbo Fu, Xintao Wang, Weicai Ye, Pengfei Wan, Di Zhang, Lin Gao:
SketchVideo: Sketch-based Video Generation and Editing. CoRR abs/2503.23284 (2025) - [i76]Zhichao Liao, Xiaokun Liu, Wenyu Qin, Qingyu Li, Qiulin Wang, Pengfei Wan, Di Zhang, Long Zeng, Pingfa Feng:
HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment. CoRR abs/2503.23907 (2025) - [i75]Shengqiong Wu, Weicai Ye, Jiahao Wang, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Shuicheng Yan, Hao Fei, Tat-Seng Chua:
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation. CoRR abs/2503.24379 (2025) - [i74]Jingyuan Zhang, Qi Wang, Xingguang Ji, Yahui Liu, Yang Yue, Fuzheng Zhang, Di Zhang, Guorui Zhou, Kun Gai:
Leanabell-Prover: Posttraining Scaling in Formal Reasoning. CoRR abs/2504.06122 (2025) - [i73]Wei Chen, Xin Yan, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Long Chen:
Decoupling Contrastive Decoding: Robust Hallucination Mitigation in Multimodal Large Language Models. CoRR abs/2504.08809 (2025) - [i72]Yang Shi, Jiaheng Liu, Yushuo Guan, Zhenhua Wu, Yuanxing Zhang, Zihao Wang, Weihong Lin, Jingyun Hua, Zekun Wang, Xinlong Chen, Bohan Zeng, Wentao Zhang, Fuzheng Zhang, Wenjing Yang, Di Zhang:
Mavors: Multi-granularity Video Representation for Multimodal Large Language Model. CoRR abs/2504.10068 (2025) - [i71]Xingyu Lu, Yuhang Hu, Yifan Zhang, Kaiyu Jiang, Changyi Liu, Tianke Zhang, Jinpeng Wang, Chun Yuan, Bin Wen, Fan Yang, Tingting Gao, Di Zhang:
InstructEngine: Instruction-driven Text-to-Image Alignment. CoRR abs/2504.10329 (2025) - [i70]Jiaqi Wei, Hao Zhou, Xiang Zhang, Di Zhang, Zijie Qiu, Wei Wei, Jinzhe Li, Wanli Ouyang, Siqi Sun:
AlignRAG: An Adaptable Framework for Resolving Misalignments in Retrieval-Aware Reasoning of RAG. CoRR abs/2504.14858 (2025) - [i69]Xingyu Lu, Tianke Zhang, Chang Meng, Xiaobei Wang, Jinpeng Wang, Yifan Zhang, Shisong Tang, Changyi Liu, Haojie Ding, Kaiyu Jiang, Kaiyu Tang, Bin Wen, Hai-Tao Zheng, Fan Yang, Tingting Gao, Di Zhang, Kun Gai:
VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform. CoRR abs/2504.14904 (2025) - [i68]Jiwen Yu, Yiran Qin, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Hao Chen, Xihui Liu:
A Survey of Interactive Generative Video. CoRR abs/2504.21853 (2025) - [i67]Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di Zhang, Wanli Ouyang:
Flow-GRPO: Training Flow Matching Models via Online RL. CoRR abs/2505.05470 (2025) - [i66]Haoran He, Jiajun Liang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Ling Pan:
Scaling Image and Video Generation via Test-Time Evolutionary Search. CoRR abs/2505.17618 (2025) - [i65]Wanhao Liu, Zonglin Yang, Jue Wang, Lidong Bing, Di Zhang, Dongzhan Zhou, Yuqiang Li, Houqiang Li, Erik Cambria, Wanli Ouyang:
MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback. CoRR abs/2505.17873 (2025) - [i64]Ziqiao Peng, Jiwen Liu, Haoxian Zhang, Xiaoqiang Liu, Songlin Tang, Pengfei Wan, Di Zhang, Hongyan Liu, Jun He:
OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers. CoRR abs/2505.21448 (2025) - [i63]Di Zhang, Weida Wang, Junxian Li, Xunzhi Wang, Jiatong Li, Jianbo Wu, Jingdi Lei, Haonan He, Peng Ye, Shufei Zhang, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou:
Control-R: Towards controllable test-time scaling. CoRR abs/2506.00189 (2025) - [i62]Xiao Fu, Xintao Wang, Xian Liu, Jianhong Bai, Runsen Xu, Pengfei Wan, Di Zhang, Dahua Lin:
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control. CoRR abs/2506.01943 (2025) - [i61]Yawen Luo, Jianhong Bai, Xiaoyu Shi, Menghan Xia, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Tianfan Xue:
CamCloneMaster: Enabling Reference-based Camera Control for Video Generation. CoRR abs/2506.03140 (2025) - [i60]Jiwen Yu, Jianhong Bai, Yiran Qin, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu:
Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval. CoRR abs/2506.03141 (2025) - [i59]Xuanhua He, Quande Liu, Zixuan Ye, Weicai Ye, Qiulin Wang, Xintao Wang, Qifeng Chen, Pengfei Wan, Di Zhang, Kun Gai:
FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers. CoRR abs/2506.04213 (2025) - [i58]Zixuan Ye, Xuanhua He, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qifeng Chen, Wenhan Luo:
UNIC: Unified In-Context Video Editing. CoRR abs/2506.04216 (2025) - [i57]Chuhao Jin, Haosen Li, Bingzi Zhang, Che Liu, Xiting Wang, Ruihua Song, Wenbing Huang, Ying Qin, Fuzheng Zhang, Di Zhang:
PlanMoGPT: Flow-Enhanced Progressive Planning for Text to Motion Synthesis. CoRR abs/2506.17912 (2025) - [i56]Jun Wang, Xijuan Zeng, Chunyu Qiang, Ruilong Chen, Shiyao Wang, Le Wang, Wangjing Zhou, Pengfei Cai, Jiahui Zhao, Nan Li, Zihan Li, Yuzhe Liang, Xiaopeng Wang, Haorui Zheng, Ming Wen, Kang Yin, Yiran Wang, Nan Li, Feng Deng, Liang Dong, Chen Zhang, Di Zhang, Kun Gai:
Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation. CoRR abs/2506.19774 (2025) - [i55]Jianzong Wu, Liang Hou, Haotian Yang, Xin Tao, Ye Tian, Pengfei Wan, Di Zhang, Yunhai Tong:
VMoBA: Mixture-of-Block Attention for Video Diffusion Models. CoRR abs/2506.23858 (2025) - [i54]Yukai Shi, Jiarong Ou, Rui Chen, Haotian Yang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Kun Gai:
Imbalance in Balance: Online Concept Balancing in Generation Models. CoRR abs/2507.13345 (2025) - [i53]Le Wang, Jun Wang, Chunyu Qiang, Feng Deng, Chen Zhang, Di Zhang, Kun Gai:
AudioGen-Omni: A Unified Multimodal Diffusion Transformer for Video-Synchronized Audio, Speech, and Song Generation. CoRR abs/2508.00733 (2025) - [i52]Liang Hou, Yuan Gao, Boyuan Jiang, Xin Tao, Qi Yan, Renjie Liao, Pengfei Wan, Di Zhang, Kun Gai:
Score Augmentation for Diffusion Models. CoRR abs/2508.07926 (2025) - [i51]Jiatong Li, Weida Wang, Qinggang Zhang, Junxian Li, Di Zhang, Changmeng Zheng, Shufei Zhang, Xiaoyong Wei, Qing Li:
Mol-R1: Towards Explicit Long-CoT Reasoning in Molecule Discovery. CoRR abs/2508.08401 (2025) - [i50]Can Jin, Yang Zhou, Qixin Zhang, Hongwu Peng, Di Zhang, Marco Pavone, Ligong Han, Zhang-Wei Hong, Tong Che, Dimitris N. Metaxas:
Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS. CoRR abs/2508.14313 (2025) - 2024
- [c19]Lei Lin, Jia-Yi Fu, Pengli Liu, Qingyang Li, Yan Gong, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai:
Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios. ACL (Findings) 2024: 3829-3852 - [c18]Zhipeng Chen, Kun Zhou, Xin Zhao, Junchen Wan, Fuzheng Zhang, Di Zhang, Ji-Rong Wen:
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint. ACL (Findings) 2024: 5694-5711 - [c17]Yuchong Sun, Che Liu, Kun Zhou, Jinwen Huang, Ruihua Song, Xin Zhao, Fuzheng Zhang, Di Zhang, Kun Gai:
Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models. ACL (1) 2024: 9729-9750 - [c16]Chenxi Sun, Hongzhi Zhang, Zijia Lin, Jingyuan Zhang, Fuzheng Zhang, Zhongyuan Wang, Bin Chen, Chengru Song, Di Zhang, Kun Gai, Deyi Xiong:
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs. LREC/COLING 2024: 4476-4487 - [c15]Sixian Zhang, Bohan Wang, Junqiang Wu, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang:
Learning Multi-Dimensional Human Preference for Text-to-Image Generation. CVPR 2024: 8018-8027 - [c14]Xiaoxue Cheng, Junyi Li, Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di Zhang, Kun Gai, Ji-Rong Wen:
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector. EMNLP 2024: 14600-14615 - [c13]Jiao Ou, Jiayu Wu, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai:
Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues. EMNLP 2024: 17402-17431 - [c12]Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chengru Song, Dai Meng, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu:
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization. ICLR 2024 - [c11]Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang Song, Kun Gai, Yadong Mu:
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization. ICML 2024 - [c10]Shuo Huang
, Shikun Sun
, Zixuan Wang
, Xiaoyu Qin
, Yanmin Xiong
, Yuan Zhang
, Pengfei Wan
, Di Zhang
, Jia Jia
:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. ACM Multimedia 2024: 6880-6889 - [c9]Jiao Ou, Junda Lu, Che Liu, Yihong Tang, Fuzheng Zhang, Di Zhang, Kun Gai:
DialogBench: Evaluating LLMs as Human-like Dialogue Systems. NAACL-HLT 2024: 6137-6170 - [c8]Ye Tian, Ling Yang, Haotian Yang, Yuan Gao, Yufan Deng, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui:
VideoTetris: Towards Compositional Text-to-Video Generation. NeurIPS 2024 - [c7]Xun Guo
, Mingwu Zheng
, Liang Hou
, Yuan Gao
, Yufan Deng
, Pengfei Wan
, Di Zhang
, Yufan Liu
, Weiming Hu
, Zhengjun Zha
, Haibin Huang
, Chongyang Ma
:
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models. SIGGRAPH (Conference Paper Track) 2024: 112 - [c6]Shiyuan Yang
, Liang Hou
, Haibin Huang
, Chongyang Ma
, Pengfei Wan
, Di Zhang
, Xiaodong Chen
, Jing Liao
:
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion. SIGGRAPH (Conference Paper Track) 2024: 113 - [c5]Yujian Zheng
, Yuda Qiu
, Leyang Jin
, Chongyang Ma
, Haibin Huang
, Di Zhang
, Pengfei Wan
, Xiaoguang Han
:
Towards Unified 3D Hair Reconstruction from Single-View Portraits. SIGGRAPH Asia 2024: 114:1-114:11 - [i49]Zhipeng Chen, Kun Zhou, Wayne Xin Zhao, Junchen Wan, Fuzheng Zhang, Di Zhang, Ji-Rong Wen:
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint. CoRR abs/2401.06081 (2024) - [i48]Yang Jin, Zhicheng Sun, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang Song, Kun Gai, Yadong Mu:
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional Tokenization. CoRR abs/2402.03161 (2024) - [i47]Shiyuan Yang, Liang Hou, Haibin Huang, Chongyang Ma, Pengfei Wan, Di Zhang, Xiaodong Chen, Jing Liao:
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion. CoRR abs/2402.03162 (2024) - [i46]Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan, Jiatong Li
, Weiran Huang, Xiangyu Yue, Dongzhan Zhou, Shufei Zhang, Mao Su, Hansen Zhong, Yuqiang Li, Wanli Ouyang
:
ChemLLM: A Chemical Large Language Model. CoRR abs/2402.06852 (2024) - [i45]Luozhou Wang, Guibao Shen, Yixun Liang, Xin Tao, Pengfei Wan, Di Zhang, Yijun Li, Yingcong Chen:
Motion Inversion for Video Customization. CoRR abs/2403.20193 (2024) - [i44]Zhaokun Zhou
, Qiulin Wang, Bin Lin, Yiwei Su, Rui Chen, Xin Tao, Amin Zheng, Li Yuan, Pengfei Wan, Di Zhang:
UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark. CoRR abs/2404.09619 (2024) - [i43]Jiao Ou, Jiayu Wu, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai:
Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues. CoRR abs/2404.11095 (2024) - [i42]Zhicheng Sun, Zhenhao Yang, Yang Jin, Haozhe Chi, Kun Xu, Kun Xu, Liwei Chen, Hao Jiang, Di Zhang, Yang Song, Kun Gai, Yadong Mu:
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance. CoRR abs/2405.14677 (2024) - [i41]Sixian Zhang, Bohan Wang, Junqiang Wu, Yan Li, Tingting Gao, Di Zhang, Zhongyuan Wang:
Learning Multi-dimensional Human Preference for Text-to-Image Generation. CoRR abs/2405.14705 (2024) - [i40]Chenxi Sun, Hongzhi Zhang, Zijia Lin, Jingyuan Zhang, Fuzheng Zhang, Zhongyuan Wang, Bin Chen, Chengru Song, Di Zhang, Kun Gai, Deyi Xiong:
Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs. CoRR abs/2405.15208 (2024) - [i39]Jinchao Zhu, Yuxuan Wang, Siyuan Pan, Pengfei Wan, Di Zhang, Gao Huang:
A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature Inheritance Strategies. CoRR abs/2406.00210 (2024) - [i38]Ye Tian, Ling Yang
, Haotian Yang, Yuan Gao, Yufan Deng, Jingmin Chen, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui:
VideoTetris: Towards Compositional Text-to-Video Generation. CoRR abs/2406.04277 (2024) - [i37]Di Zhang, Xiaoshui Huang, Dongzhan Zhou, Yuqiang Li, Wanli Ouyang
:
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B. CoRR abs/2406.07394 (2024) - [i36]Xiaoxue Cheng, Junyi Li, Wayne Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di Zhang, Kun Gai, Ji-Rong Wen:
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector. CoRR abs/2406.11277 (2024) - [i35]Longrong Yang, Dong Sheng, Chaoxiang Cai, Fan Yang, Size Li, Di Zhang, Xi Li:
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model. CoRR abs/2406.19905 (2024) - [i34]Jianzhu Guo, Dingyun Zhang, Xiaoqiang Liu, Zhizhou Zhong, Yuan Zhang, Pengfei Wan, Di Zhang:
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control. CoRR abs/2407.03168 (2024) - [i33]Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan Zhang, Pengfei Wan, Di Zhang, Jia Jia:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. CoRR abs/2407.13976 (2024) - [i32]Kaibing Chen, Dong Shen, Hanwen Zhong, Huasong Zhong, Kui Xia, Di Xu, Wei Yuan, Yifei Hu, Bin Wen, Tianke Zhang, Changyi Liu, Dewen Fan, Huihui Xiao, Jiahong Wu, Fan Yang, Size Li, Di Zhang:
EVLM: An Efficient Vision-Language Model for Visual Understanding. CoRR abs/2407.14177 (2024) - [i31]Liangdong Qiu, Chengxing Yu, Yanran Li, Zhao Wang, Haibin Huang, Chongyang Ma, Di Zhang, Pengfei Wan, Xiaoguang Han:
ViMo: Generating Motions from Casual Videos. CoRR abs/2408.06614 (2024) - [i30]Junxian Li
, Di Zhang, Xunzhi Wang, Zeying Hao, Jingdi Lei, Qian Tan, Cai Zhou, Wei Liu, Yaotian Yang, Xinrui Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Wei Li, Shufei Zhang, Mao Su, Wanli Ouyang
, Yuqiang Li, Dongzhan Zhou:
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area. CoRR abs/2408.07246 (2024) - [i29]Yuanyang Yin, Yaqi Zhao, Yajie Zhang
, Ke Lin, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang:
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs. CoRR abs/2408.11813 (2024) - [i28]Yihong Tang, Jiao Ou, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai:
ERABAL: Enhancing Role-Playing Agents through Boundary-Aware Learning. CoRR abs/2409.14710 (2024) - [i27]Yujian Zheng, Yuda Qiu, Leyang Jin, Chongyang Ma, Haibin Huang, Di Zhang, Pengfei Wan, Xiaoguang Han:
Towards Unified 3D Hair Reconstruction from Single-View Portraits. CoRR abs/2409.16863 (2024) - [i26]Xiao Wang, Jianlong Wu, Zijia Lin, Fuzheng Zhang, Di Zhang, Liqiang Nie:
Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding. CoRR abs/2409.19532 (2024) - [i25]Di Zhang, Jianbo Wu, Jingdi Lei, Tong Che, Jiatong Li
, Tong Xie, Xiaoshui Huang, Shufei Zhang, Marco Pavone, Yuqiang Li, Wanli Ouyang, Dongzhan Zhou:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning. CoRR abs/2410.02884 (2024) - [i24]Qiuheng Wang, Yukai Shi, Jiarong Ou, Rui Chen, Ke Lin, Jiahao Wang, Boyuan Jiang, Haotian Yang, Mingwu Zheng, Xin Tao, Fei Yang, Pengfei Wan, Di Zhang:
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content. CoRR abs/2410.08260 (2024) - [i23]Xingyu Lu, Yuhang Hu, Changyi Liu, Tianke Zhang, Zhenyu Yang, Zhixiang Ding, Shengsheng Qian, Meng Du, Ruiwen Kang, Kaiyu Tang, Fan Yang, Tingting Gao, Di Zhang, Hai-Tao Zheng, Bin Wen:
Kwai-STaR: Transform LLMs into State-Transition Reasoners. CoRR abs/2411.04799 (2024) - [i22]Zhicong Li, Jiahao Wang, Zhishu Jiang, Hangyu Mao, Zhongxia Chen, Jiazhen Du, Yuanxing Zhang, Fuzheng Zhang, Di Zhang, Yong Liu:
DMQR-RAG: Diverse Multi-Query Rewriting for RAG. CoRR abs/2411.13154 (2024) - [i21]Zhuoman Liu, Weicai Ye, Yan Luximon, Pengfei Wan, Di Zhang:
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation. CoRR abs/2411.14423 (2024) - [i20]Jiatong Li
, Yunqing Liu, Wei Liu, Jingdi Lei, Di Zhang, Wenqi Fan, Dongzhan Zhou, Yuqiang Li, Qing Li:
MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts. CoRR abs/2411.14721 (2024) - [i19]Jiahao Hu, Tianxiong Zhong, Xuebo Wang, Boyuan Jiang, Xingye Tian, Fei Yang, Pengfei Wan, Di Zhang:
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing. CoRR abs/2411.15260 (2024) - [i18]Hao Yi, Qingyang Li, Yulan Hu, Fuzheng Zhang, Di Zhang, Yong Liu:
Video-Text Dataset Construction from Multi-AI Feedback: Promoting Weak-to-Strong Preference Learning for Video Large Language Models. CoRR abs/2411.16201 (2024) - [i17]Yuanyang Yin, Yaqi Zhao, Mingwu Zheng, Ke Lin, Jiarong Ou, Rui Chen, Victor Shea-Jay Huang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang, Kun Gai:
Towards Precise Scaling Laws for Video Diffusion Transformers. CoRR abs/2411.17470 (2024) - [i16]Di Zhang, Junxian Li
, Jingdi Lei, Xunzhi Wang, Yujie Liu, Zonglin Yang, Jiatong Li
, Weida Wang, Suorong Yang, Jianbo Wu, Peng Ye, Wanli Ouyang, Dongzhan Zhou:
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning. CoRR abs/2411.18203 (2024) - [i15]Haoran Lian, Junmin Chen, Wei Huang, Yizhe Xiong, Wenping Hu, Guiguang Ding, Hui Chen, Jianwei Niu
, Zijia Lin, Fuzheng Zhang, Di Zhang:
Breaking the Stage Barrier: A Novel Single-Stage Approach to Long Context Extension for Large Language Models. CoRR abs/2412.07171 (2024) - [i14]Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo:
StyleMaster: Stylize Your Video with Artistic Generation and Translation. CoRR abs/2412.07744 (2024) - [i13]Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin:
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation. CoRR abs/2412.07759 (2024) - [i12]Jianhong Bai, Menghan Xia, Xintao Wang, Ziyang Yuan, Xiao Fu, Zuozhu Liu, Haoji Hu, Pengfei Wan, Di Zhang:
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints. CoRR abs/2412.07760 (2024) - [i11]Yuanhui Huang, Wenzhao Zheng, Yuan Gao, Xin Tao, Pengfei Wan, Di Zhang, Jie Zhou, Jiwen Lu:
Owl-1: Omni World Model for Consistent Long Video Generation. CoRR abs/2412.09600 (2024) - [i10]Haonan He, Yuchen Ren, Yining Tang, Ziyang Xu, Junxian Li, Minghao Yang, Di Zhang, Dong Yuan, Tao Chen, Shufei Zhang, Yuqiang Li, Nanqing Dong, Wanli Ouyang, Dongzhan Zhou, Peng Ye:
Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models. CoRR abs/2412.19191 (2024) - 2023
- [c4]Jue Chen
, Huan Yuan
, Jianchao Tan
, Bin Chen
, Chengru Song
, Di Zhang
:
Resource Constrained Model Compression via Minimax Optimization for Spiking Neural Networks. ACM Multimedia 2023: 5204-5213 - [i9]Yang Jin, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu:
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization. CoRR abs/2309.04669 (2023) - [i8]Yuchong Sun, Che Liu, Jinwen Huang, Ruihua Song, Fuzheng Zhang, Di Zhang, Zhongyuan Wang, Kun Gai:
Parrot: Enhancing Multi-Turn Chat Models by Learning to Ask Questions. CoRR abs/2310.07301 (2023) - [i7]Jia-Yi Fu, Lei Lin, Xiaoyang Gao, Pengli Liu, Zhengzong Chen, Zhirui Yang, Shengnan Zhang, Xue Zheng, Yan Li, Yuliang Liu, Xucheng Ye, Yiqiao Liao, Chao Liao, Bin Chen, Chengru Song, Junchen Wan, Zijia Lin, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai:
KwaiYiiMath: Technical Report. CoRR abs/2310.07488 (2023) - [i6]Jiao Ou, Junda Lu, Che Liu, Yihong Tang, Fuzheng Zhang, Di Zhang, Zhongyuan Wang, Kun Gai:
DialogBench: Evaluating LLMs as Human-like Dialogue Systems. CoRR abs/2311.01677 (2023) - [i5]Lei Lin, Jia-Yi Fu, Pengli Liu, Junchen Wan, Fuzheng Zhang, Zhongyuan Wang, Di Zhang, Kun Gai:
Ask One More Time: Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios. CoRR abs/2311.08154 (2023) - [i4]Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Chongyang Ma, Weiming Hu, Zhengjun Zha, Haibin Huang, Pengfei Wan, Di Zhang:
I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models. CoRR abs/2312.16693 (2023) - 2022
- [c3]Zhirong Xu, Shiyang Wen, Junshan Wang, Guojun Liu
, Liang Wang, Zhi Yang, Lei Ding, Yan Zhang, Di Zhang, Jian Xu, Bo Zheng:
AMCAD: Adaptive Mixed-Curvature Representation based Advertisement Retrieval System. ICDE 2022: 3439-3452 - [c2]Yuanxing Zhang, Langshi Chen, Siran Yang, Man Yuan
, Huimin Yi, Jie Zhang, Jiamang Wang, Jianbo Dong, Yunlong Xu, Yue Song, Yong Li, Di Zhang, Wei Lin, Lin Qu, Bo Zheng:
PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems. ICDE 2022: 3453-3466 - [i3]Zhirong Xu, Shiyang Wen, Junshan Wang, Guojun Liu, Liang Wang, Zhi Yang, Lei Ding, Yan Zhang, Di Zhang, Jian Xu, Bo Zheng:
AMCAD: Adaptive Mixed-Curvature Representation based Advertisement Retrieval System. CoRR abs/2203.14683 (2022) - [i2]Yuanxing Zhang, Langshi Chen, Siran Yang, Man Yuan, Huimin Yi, Jie Zhang, Jiamang Wang, Jianbo Dong, Yunlong Xu, Yue Song, Yong Li, Di Zhang, Wei Lin, Lin Qu, Bo Zheng:
PICASSO: Unleashing the Potential of GPU-centric Training for Wide-and-deep Recommender Systems. CoRR abs/2204.04903 (2022) - 2021
- [c1]Shiyang Wen, Yiran Chen, Zhi Yang, Yan Zhang, Di Zhang, Liang Wang, Bo Zheng:
SMAD: Scalable Multi-view Ad Retrieval System for E-Commerce Sponsored Search. CIKM 2021: 3543-3547 - [i1]An Yang, Junyang Lin, Rui Men, Chang Zhou, Le Jiang, Xianyan Jia, Ang Wang, Jie Zhang, Jiamang Wang, Yong Li, Di Zhang, Wei Lin, Lin Qu, Jingren Zhou, Hongxia Yang:
Exploring Sparse Expert Models and Beyond. CoRR abs/2105.15082 (2021)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-10-07 23:25 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint