default search action
Weinan Zhang 0001
Person information
- affiliation: Shanghai Jiao Tong University, John Hopcroft Center for Computer Science, China
- affiliation: University College London, Department of Computer Science, UK
Other persons with the same name
- Weinan Zhang
- Weinan Zhang 0002 — University of Missouri, Department of Electrical Engineer and Computer Science, Columbia, USA
- Weinan Zhang 0003 (aka: Wei-Nan Zhang 0003) — Harbin Institute of Technology, Research Center for Social Computing and Information Retrieval, China (and 1 more)
- Weinan Zhang 0004 — Harbin Institute of Technology, School of Mathematics, China (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j40]Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu:
A survey on model-based reinforcement learning. Sci. China Inf. Sci. 67(2) (2024) - [j39]Yan Song, He Jiang, Zheng Tian, Haifeng Zhang, Yingping Zhang, Jiangcheng Zhu, Zonghong Dai, Weinan Zhang, Jun Wang:
An Empirical Study on Google Research Football Multi-agent Scenarios. Mach. Intell. Res. 21(3): 549-570 (2024) - [j38]Weitong Ou, Bo Chen, Xinyi Dai, Weinan Zhang, Weiwen Liu, Ruiming Tang, Yong Yu:
A Survey on Bid Optimization in Real-Time Bidding Display Advertising. ACM Trans. Knowl. Discov. Data 18(3): 58:1-58:31 (2024) - [j37]Mingxing Duan, Kenli Li, Weinan Zhang, Jiarui Qin, Bin Xiao:
Attacking Click-through Rate Predictors via Generating Realistic Fake Samples. ACM Trans. Knowl. Discov. Data 18(5): 110:1-110:24 (2024) - [j36]Yunjia Xi, Weiwen Liu, Xinyi Dai, Ruiming Tang, Qing Liu, Weinan Zhang, Yong Yu:
Utility-Oriented Reranking with Counterfactual Context. ACM Trans. Knowl. Discov. Data 18(8): 193:1-193:22 (2024) - [j35]Zhengbang Zhu, Rongjun Qin, Junjie Huang, Xinyi Dai, Yang Yu, Yong Yu, Weinan Zhang:
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems. ACM Trans. Inf. Syst. 42(4): 90:1-90:32 (2024) - [j34]Lei Zheng, Huacan Chai, Xianyu Chen, Jiarui Jin, Weinan Zhang, Yong Yu, Xiaodong Guo, Can Ge, Ziming Feng:
Search-based Time-aware Graph-enhanced Recommendation with Sequential Behavior Data. Trans. Recomm. Syst. 2(4): 26:1-26:29 (2024) - [c235]Yan Song, He Jiang, Haifeng Zhang, Zheng Tian, Weinan Zhang, Jun Wang:
Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: The Past, Present, and Future. AAMAS 2024: 1772-1781 - [c234]Zhenyu Mu, Jianghao Lin, Xiaoyu Zhu, Weinan Zhang, Yong Yu:
Invariant Graph Contrastive Learning for Mitigating Neighborhood Bias in Graph Neural Network Based Recommender Systems. ICANN (5) 2024: 143-158 - [c233]Xinghang Li, Minghuan Liu, Hanbo Zhang, Cunjun Yu, Jie Xu, Hongtao Wu, Chilam Cheang, Ya Jing, Weinan Zhang, Huaping Liu, Hang Li, Tao Kong:
Vision-Language Foundation Models as Effective Robot Imitators. ICLR 2024 - [c232]Liyuan Mao, Haoran Xu, Weinan Zhang, Xianyuan Zhan:
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update. ICLR 2024 - [c231]Yuxuan Mu, Shihao Zou, Kangning Yin, Zheng Tian, Li Cheng, Weinan Zhang, Jun Wang:
RACon: Retrieval-Augmented Simulated Character Locomotion Control. ICME 2024: 1-6 - [c230]Guanghe Li, Yixiang Shan, Zhengbang Zhu, Ting Long, Weinan Zhang:
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching. ICML 2024 - [c229]Ziyu Wan, Xidong Feng, Muning Wen, Stephen Marcus McAleer, Ying Wen, Weinan Zhang, Jun Wang:
AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training. ICML 2024 - [c228]Kounianhua Du, Jizheng Chen, Jianghao Lin, Yunjia Xi, Hangyu Wang, Xinyi Dai, Bo Chen, Ruiming Tang, Weinan Zhang:
DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation. KDD 2024: 666-676 - [c227]Yifan Liu, Weiwen Liu, Wei Xia, Jieming Zhu, Weinan Zhang, Zhenhua Dong, Yang Wang, Ruiming Tang, Rui Zhang, Yong Yu:
Multi-sourced Integrated Ranking with Exposure Fairness. PAKDD (5) 2024: 207-218 - [c226]Longchao Da, Chen Chu, Weinan Zhang, Hua Wei:
CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models. ECML/PKDD (8) 2024: 368-373 - [c225]Yunjia Xi, Weiwen Liu, Jianghao Lin, Xiaoling Cai, Hong Zhu, Jieming Zhu, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu:
Towards Open-World Recommendation with Knowledge Augmentation from Large Language Models. RecSys 2024: 12-22 - [c224]Hangyu Wang, Jianghao Lin, Xiangyang Li, Bo Chen, Chenxu Zhu, Ruiming Tang, Weinan Zhang, Yong Yu:
FLIP: Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction. RecSys 2024: 94-104 - [c223]Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang:
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision. SIGIR 2024: 3-13 - [c222]Qingpeng Cai, Xiangyu Zhao, Ling Pan, Xin Xin, Jin Huang, Weinan Zhang, Li Zhao, Dawei Yin, Grace Hui Yang:
AgentIR: 1st Workshop on Agent-based Information Retrieval. SIGIR 2024: 3025-3028 - [c221]Han Zhang, Quan Gan, David Wipf, Weinan Zhang:
GFS: Graph-based Feature Synthesis for Prediction over Relational Databases. VLDB Workshops 2024 - [c220]Minjie Wang, Quan Gan, David Wipf, Zhenkun Cai, Ning Li, Jianheng Tang, Yanlin Zhang, Zizhao Zhang, Zunyao Mao, Yakun Song, Yanbo Wang, Jiahang Li, Han Zhang, Guang Yang, Xiao Qin, Chuan Lei, Muhan Zhang, Weinan Zhang, Christos Faloutsos, Zheng Zhang:
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBs. VLDB Workshops 2024 - [c219]Cheng Deng, Tianhang Zhang, Zhongmou He, Qiyuan Chen, Yuanyuan Shi, Yi Xu, Luoyi Fu, Weinan Zhang, Xinbing Wang, Chenghu Zhou, Zhouhan Lin, Junxian He:
K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization. WSDM 2024: 161-170 - [c218]Yifan Liu, Wei Xia, Weiwen Liu, Menghui Zhu, Weinan Zhang, Ruiming Tang, Yong Yu:
HiFI: Hierarchical Fairness-aware Integrated Ranking with Constrained Reinforcement Learning. WWW (Companion Volume) 2024: 196-205 - [c217]Junjie Huang, Guohao Cai, Jieming Zhu, Zhenhua Dong, Ruiming Tang, Weinan Zhang, Yong Yu:
Recall-Augmented Ranking: Enhancing Click-Through Rate Prediction Accuracy with Cross-Stage Data. WWW (Companion Volume) 2024: 830-833 - [c216]Jiarui Jin, Zexue He, Mengyue Yang, Weinan Zhang, Yong Yu, Jun Wang, Julian J. McAuley:
InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization. WWW 2024: 1350-1361 - [c215]Jianghao Lin, Bo Chen, Hangyu Wang, Yunjia Xi, Yanru Qu, Xinyi Dai, Kangning Zhang, Ruiming Tang, Yong Yu, Weinan Zhang:
ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction. WWW 2024: 3319-3330 - [c214]Jianghao Lin, Rong Shan, Chenxu Zhu, Kounianhua Du, Bo Chen, Shigang Quan, Ruiming Tang, Yong Yu, Weinan Zhang:
ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation. WWW 2024: 3497-3508 - [c213]Jiachen Zhu, Yichao Wang, Jianghao Lin, Jiarui Qin, Ruiming Tang, Weinan Zhang, Yong Yu:
M-scan: A Multi-Scenario Causal-driven Adaptive Network for Recommendation. WWW 2024: 3844-3853 - [i233]Zhouhan Lin, Cheng Deng, Le Zhou, Tianhang Zhang, Yi Xu, Yutong Xu, Zhongmou He, Yuanyuan Shi, Beiya Dai, Yunchong Song, Boyi Zeng, Qiyuan Chen, Tao Shi, Tianyu Huang, Yiwei Xu, Shu Wang, Luoyi Fu, Weinan Zhang, Junxian He, Chao Ma, Yunqiang Zhu, Xinbing Wang, Chenghu Zhou:
GeoGalactica: A Scientific Large Language Model in Geoscience. CoRR abs/2401.00434 (2024) - [i232]Qingyao Li, Lingyue Fu, Weiming Zhang, Xianyu Chen, Jingwei Yu, Wei Xia, Weinan Zhang, Ruiming Tang, Yong Yu:
Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges. CoRR abs/2401.08664 (2024) - [i231]Jiarui Qin, Weiwen Liu, Ruiming Tang, Weinan Zhang, Yong Yu:
D2K: Turning Historical Data into Retrievable Knowledge for Recommender Systems. CoRR abs/2401.11478 (2024) - [i230]Jiarui Jin, Zexue He, Mengyue Yang, Weinan Zhang, Yong Yu, Jun Wang, Julian J. McAuley:
InfoRank: Unbiased Learning-to-Rank via Conditional Mutual Information Minimization. CoRR abs/2401.12553 (2024) - [i229]Liyuan Mao, Haoran Xu, Weinan Zhang, Xianyuan Zhan:
ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-gradient Update. CoRR abs/2402.00348 (2024) - [i228]Guanghe Li, Yixiang Shan, Zhengbang Zhu, Ting Long, Weinan Zhang:
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching. CoRR abs/2402.02439 (2024) - [i227]Yixiang Shan, Zhengbang Zhu, Ting Long, Qifan Liang, Yi Chang, Weinan Zhang, Liang Yin:
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning. CoRR abs/2402.02772 (2024) - [i226]Longchao Da, Chen Chu, Weinan Zhang, Hua Wei:
CityFlowER: An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models. CoRR abs/2402.06127 (2024) - [i225]Muning Wen, Cheng Deng, Jun Wang, Weinan Zhang, Ying Wen:
Entropy-Regularized Token-Level Policy Optimization for Large Language Models. CoRR abs/2402.06700 (2024) - [i224]Haoran He, Chenjia Bai, Ling Pan, Weinan Zhang, Bin Zhao, Xuelong Li:
Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning. CoRR abs/2402.14407 (2024) - [i223]Jingxiao Chen, Weiji Xie, Weinan Zhang, Yong Yu, Ying Wen:
Offline Fictitious Self-Play for Competitive Games. CoRR abs/2403.00841 (2024) - [i222]Hangyu Wang, Jianghao Lin, Bo Chen, Yang Yang, Ruiming Tang, Weinan Zhang, Yong Yu:
Towards Efficient and Effective Unlearning of Large Language Models for Recommendation. CoRR abs/2403.03536 (2024) - [i221]Jingxiao Chen, Ziqin Gong, Minghuan Liu, Jun Wang, Yong Yu, Weinan Zhang:
Looking Ahead to Avoid Being Late: Solving Hard-Constrained Traveling Salesman Problem. CoRR abs/2403.05318 (2024) - [i220]Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang:
TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision. CoRR abs/2403.06221 (2024) - [i219]Yifan Liu, Kangning Zhang, Xiangyuan Ren, Yanhua Huang, Jiarui Jin, Yingjie Qin, Ruilong Su, Ruiwen Xu, Weinan Zhang:
An Aligning and Training Framework for Multimodal Recommendations. CoRR abs/2403.12384 (2024) - [i218]Yunjia Xi, Weiwen Liu, Jianghao Lin, Chuhan Wu, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu:
Play to Your Strengths: Collaborative Intelligence of Conventional Recommender Models and Large Language Models. CoRR abs/2403.16378 (2024) - [i217]Jiachen Zhu, Yichao Wang, Jianghao Lin, Jiarui Qin, Ruiming Tang, Weinan Zhang, Yong Yu:
M-scan: A Multi-Scenario Causal-driven Adaptive Network for Recommendation. CoRR abs/2404.07581 (2024) - [i216]Junjie Huang, Guohao Cai, Jieming Zhu, Zhenhua Dong, Ruiming Tang, Weinan Zhang, Yong Yu:
Recall-Augmented Ranking: Enhancing Click-Through Rate Prediction Accuracy with Cross-Stage Data. CoRR abs/2404.09578 (2024) - [i215]Kangning Zhang, Yingjie Qin, Ruilong Su, Yifan Liu, Jiarui Jin, Weinan Zhang, Yong Yu:
DRepMRec: A Dual Representation Learning Framework for Multimodal Recommendation. CoRR abs/2404.11119 (2024) - [i214]Lei Zheng, Ning Li, Weinan Zhang, Yong Yu:
Retrieval and Distill: A Temporal Data Shift-Free Paradigm for Online Recommendation System. CoRR abs/2404.15678 (2024) - [i213]Minjie Wang, Quan Gan, David Wipf, Zhenkun Cai, Ning Li, Jianheng Tang, Yanlin Zhang, Zizhao Zhang, Zunyao Mao, Yakun Song, Yanbo Wang, Jiahang Li, Han Zhang, Guang Yang, Xiao Qin, Chuan Lei, Muhan Zhang, Weinan Zhang, Christos Faloutsos, Zheng Zhang:
4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs. CoRR abs/2404.18209 (2024) - [i212]Kounianhua Du, Renting Rui, Huacan Chai, Lingyue Fu, Wei Xia, Yasheng Wang, Ruiming Tang, Yong Yu, Weinan Zhang:
CodeGRAG: Extracting Composed Syntax Graphs for Retrieval Augmented Cross-Lingual Code Generation. CoRR abs/2405.02355 (2024) - [i211]Qingyao Li, Wei Xia, Kounianhua Du, Qiji Zhang, Weinan Zhang, Ruiming Tang, Yong Yu:
Learning Structure and Knowledge Aware Representation with Large Language Models for Concept Recommendation. CoRR abs/2405.12442 (2024) - [i210]Lei Zheng, Ning Li, Yanhuan Huang, Ruiwen Xu, Weinan Zhang, Yong Yu:
Look into the Future: Deep Contextualized Sequential Recommendation. CoRR abs/2405.14359 (2024) - [i209]Muning Wen, Ziyu Wan, Weinan Zhang, Jun Wang, Ying Wen:
Reinforcing Language Agents via Policy Optimization with Action Decomposition. CoRR abs/2405.15821 (2024) - [i208]Shutong Ding, Ke Hu, Zhenhao Zhang, Kan Ren, Weinan Zhang, Jingyi Yu, Jingya Wang, Ye Shi:
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization. CoRR abs/2405.16173 (2024) - [i207]Hanye Zhao, Xiaoshen Han, Zhengbang Zhu, Minghuan Liu, Yong Yu, Weinan Zhang:
Diffusion-based Dynamics Models for Long-Horizon Rollout in Offline Reinforcement Learning. CoRR abs/2405.19189 (2024) - [i206]Kounianhua Du, Jizheng Chen, Jianghao Lin, Yunjia Xi, Hangyu Wang, Xinyi Dai, Bo Chen, Ruiming Tang, Weinan Zhang:
DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation. CoRR abs/2406.00011 (2024) - [i205]Jianghao Lin, Xinyi Dai, Rong Shan, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang:
Large Language Models Make Sample-Efficient Recommender Systems. CoRR abs/2406.02368 (2024) - [i204]Tairan He, Zhengyi Luo, Xialin He, Wenli Xiao, Chong Zhang, Weinan Zhang, Kris Kitani, Changliu Liu, Guanya Shi:
OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning. CoRR abs/2406.08858 (2024) - [i203]Yuxuan Mu, Shihao Zou, Kangning Yin, Zheng Tian, Li Cheng, Weinan Zhang, Jun Wang:
RACon: Retrieval-Augmented Simulated Character Locomotion Control. CoRR abs/2406.17795 (2024) - [i202]Jizheng Chen, Kounianhua Du, Jianghao Lin, Bo Chen, Ruiming Tang, Weinan Zhang:
ELCoRec: Enhance Language Understanding with Co-Propagation of Numerical and Categorical Features for Recommendation. CoRR abs/2406.18825 (2024) - [i201]Lingyue Fu, Hao Guan, Kounianhua Du, Jianghao Lin, Wei Xia, Weinan Zhang, Ruiming Tang, Yasheng Wang, Yong Yu:
SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language Model. CoRR abs/2407.01245 (2024) - [i200]Yunjia Xi, Weiwen Liu, Jianghao Lin, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu:
MemoCRS: Memory-enhanced Sequential Conversational Recommender Systems with Large Language Models. CoRR abs/2407.04960 (2024) - [i199]Liyuan Mao, Haoran Xu, Weinan Zhang, Xianyuan Zhan, Amy Zhang:
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning. CoRR abs/2407.20109 (2024) - [i198]Junjie Huang, Jizheng Chen, Jianghao Lin, Jiarui Qin, Ziming Feng, Weinan Zhang, Yong Yu:
A Comprehensive Survey on Retrieval Methods in Recommender Systems. CoRR abs/2407.21022 (2024) - [i197]Jiachen Zhu, Jianghao Lin, Xinyi Dai, Bo Chen, Rong Shan, Jieming Zhu, Ruiming Tang, Yong Yu, Weinan Zhang:
Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation. CoRR abs/2408.03533 (2024) - [i196]Yingxuan Yang, Huayi Wang, Muning Wen, Weinan Zhang:
P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for Optimizing LLM Training. CoRR abs/2408.05541 (2024) - [i195]Yunjia Xi, Hangyu Wang, Bo Chen, Jianghao Lin, Menghui Zhu, Weiwen Liu, Ruiming Tang, Weinan Zhang, Yong Yu:
A Decoding Acceleration Framework for Industrial Deployable LLM-based Recommender Systems. CoRR abs/2408.05676 (2024) - [i194]Yunjia Xi, Weiwen Liu, Jianghao Lin, Muyan Weng, Xiaoling Cai, Hong Zhu, Jieming Zhu, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang:
Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models. CoRR abs/2408.10520 (2024) - [i193]Jianghao Lin, Jiaqi Liu, Jiachen Zhu, Yunjia Xi, Chengkai Liu, Yangtian Zhang, Yong Yu, Weinan Zhang:
A Survey on Diffusion Models for Recommender Systems. CoRR abs/2409.05033 (2024) - [i192]Shao Zhang, Xihuai Wang, Wenhao Zhang, Yongshan Chen, Landi Gao, Dakuo Wang, Weinan Zhang, Xinbing Wang, Ying Wen:
Mutual Theory of Mind in Human-AI Collaboration: An Empirical Study with LLM-driven AI Agents in a Real-time Shared Workspace Task. CoRR abs/2409.08811 (2024) - [i191]Yiwei Shi, Muning Wen, Qi Zhang, Weinan Zhang, Cunjia Liu, Weiru Liu:
Autonomous Goal Detection and Cessation in Reinforcement Learning: A Case Study on Source Term Estimation. CoRR abs/2409.09541 (2024) - [i190]Qingyao Li, Wei Xia, Kounianhua Du, Xinyi Dai, Ruiming Tang, Yasheng Wang, Yong Yu, Weinan Zhang:
RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation. CoRR abs/2409.09584 (2024) - [i189]Hang Lai, Jiahang Cao, Jiafeng Xu, Hongtao Wu, Yunfeng Lin, Tao Kong, Yong Yu, Weinan Zhang:
World Model-based Perception for Visual Legged Locomotion. CoRR abs/2409.16784 (2024) - [i188]Naming Liu, Mingzhi Wang, Xihuai Wang, Weinan Zhang, Yaodong Yang, Youzhi Zhang, Bo An, Ying Wen:
Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games. CoRR abs/2410.01575 (2024) - 2023
- [j33]Qingyao Ai, Ting Bai, Zhao Cao, Yi Chang, Jiawei Chen, Zhumin Chen, Zhiyong Cheng, Shoubin Dong, Zhicheng Dou, Fuli Feng, Shen Gao, Jiafeng Guo, Xiangnan He, Yanyan Lan, Chenliang Li, Yiqun Liu, Ziyu Lyu, Weizhi Ma, Jun Ma, Zhaochun Ren, Pengjie Ren, Zhiqiang Wang, Mingwen Wang, Ji-Rong Wen, Le Wu, Xin Xin, Jun Xu, Dawei Yin, Peng Zhang, Fan Zhang, Weinan Zhang, Min Zhang, Xiaofei Zhu:
Information Retrieval meets Large Language Models: A strategic report from Chinese IR community. AI Open 4: 80-90 (2023) - [j32]Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Hai-Feng Zhang, Weinan Zhang:
Large sequence models for sequential decision-making: a survey. Frontiers Comput. Sci. 17(6): 176349 (2023) - [j31]Linghui Meng, Muning Wen, Chenyang Le, Xiyun Li, Dengpeng Xing, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Yaodong Yang, Bo Xu:
Offline Pre-trained Multi-agent Decision Transformer. Mach. Intell. Res. 20(2): 233-248 (2023) - [j30]Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Yong Yu, Jun Wang, Weinan Zhang:
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning. J. Mach. Learn. Res. 24: 150:1-150:12 (2023) - [j29]Jian Shen, Hang Lai, Minghuan Liu, Han Zhao, Yong Yu, Weinan Zhang:
Adaptation Augmented Model-based Policy Optimization. J. Mach. Learn. Res. 24: 218:1-218:35 (2023) - [j28]Haoran Zhao, Yuchen Fang, Yuxiang Zhao, Zheng Tian, Weinan Zhang, Xidong Feng, Li Yu, Wei Li, Hulei Fan, Tiema Mu:
Time-Series Representation Learning in Topology Prediction for Passive Optical Network of Telecom Operators. Sensors 23(6): 3345 (2023) - [j27]Chenxu Zhu, Bo Chen, Weinan Zhang, Jincai Lai, Ruiming Tang, Xiuqiang He, Zhenguo Li, Yong Yu:
AIM: Automatic Interaction Machine for Click-Through Rate Prediction. IEEE Trans. Knowl. Data Eng. 35(4): 3389-3403 (2023) - [j26]Haokun Chen, Chenxu Zhu, Ruiming Tang, Weinan Zhang, Xiuqiang He, Yong Yu:
Large-Scale Interactive Recommendation With Tree-Structured Reinforcement Learning. IEEE Trans. Knowl. Data Eng. 35(4): 4018-4032 (2023) - [j25]Jiarui Qin, Weinan Zhang, Rong Su, Zhirong Liu, Weiwen Liu, Guangpeng Zhao, Hao Li, Ruiming Tang, Xiuqiang He, Yong Yu:
Learning to Retrieve User Behaviors for Click-through Rate Estimation. ACM Trans. Inf. Syst. 41(4): 98:1-98:31 (2023) - [c212]Xianyu Chen, Jian Shen, Wei Xia, Jiarui Jin, Yakun Song, Weinan Zhang, Weiwen Liu, Menghui Zhu, Ruiming Tang, Kai Dong, Dingyin Xia, Yong Yu:
Set-to-Sequence Ranking-Based Concept-Aware Learning Path Recommendation. AAAI 2023: 5027-5035 - [c211]Yuchen Fang, Kan Ren, Caihua Shan, Yifei Shen, You Li, Weinan Zhang, Yong Yu, Dongsheng Li:
Learning Decomposed Spatial Relations for Multi-Variate Time-Series Modeling. AAAI 2023: 7530-7538 - [c210]Jiarui Jin, Xianyu Chen, Weinan Zhang, Mengyue Yang, Yang Wang, Yali Du, Yong Yu, Jun Wang:
Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank. CIKM 2023: 1004-1013 - [c209]Qingyao Li, Wei Xia, Li'ang Yin, Jian Shen, Renting Rui, Weinan Zhang, Xianyu Chen, Ruiming Tang, Yong Yu:
Graph Enhanced Hierarchical Reinforcement Learning for Goal-oriented Learning Path Recommendation. CIKM 2023: 1318-1327 - [c208]Weitong Ou, Bo Chen, Weiwen Liu, Xinyi Dai, Weinan Zhang, Wei Xia, Xuan Li, Ruiming Tang, Yong Yu:
Optimal Real-Time Bidding Strategy for Position Auctions in Online Advertising. CIKM 2023: 4766-4772 - [c207]Xin Xin, Xiangyu Zhao, Jin Huang, Weinan Zhang, Li Zhao, Dawei Yin, Grace Hui Yang:
DRL4IR: 4th Workshop on Deep Reinforcement Learning for Information Retrieval. CIKM 2023: 5304-5307 - [c206]Jiexing Qi, Shuhao Li, Zhixin Guo, Yusheng Huang, Chenghu Zhou, Weinan Zhang, Xinbing Wang, Zhouhan Lin:
Text Classification In The Wild: A Large-Scale Long-Tailed Name Normalization Dataset. ICASSP 2023: 1-5 - [c205]Weiwen Liu, Yunjia Xi, Jiarui Qin, Xinyi Dai, Ruiming Tang, Shuai Li, Weinan Zhang, Rui Zhang:
Personalized Diversification for Neural Re-ranking in Recommendation. ICDE 2023: 802-815 - [c204]Minghuan Liu, Tairan He, Weinan Zhang, Shuicheng Yan, Zhongwen Xu:
Visual Imitation Learning with Patch Rewards. ICLR 2023 - [c203]Xihuai Wang, Zheng Tian, Ziyu Wan, Ying Wen, Jun Wang, Weinan Zhang:
Order Matters: Agent-by-agent Policy Optimization. ICLR 2023 - [c202]Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai:
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models. ICML 2023: 36380-36390 - [c201]Hang Lai, Weinan Zhang, Xialin He, Chen Yu, Zheng Tian, Yong Yu, Jun Wang:
Sim-to-Real Transfer for Quadrupedal Locomotion via Terrain Transformer. ICRA 2023: 5141-5147 - [c200]Chen Yu, Weinan Zhang, Hang Lai, Zheng Tian, Laurent Kneip, Jun Wang:
Multi-embodiment Legged Robot Control as a Sequence Modeling Problem. ICRA 2023: 7250-7257 - [c199]Weinan Zhang:
Large Decision Models. IJCAI 2023: 7062-7067 - [c198]Jianghao Lin, Yanru Qu, Wei Guo, Xinyi Dai, Ruiming Tang, Yong Yu, Weinan Zhang:
MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction. KDD 2023: 1384-1395 - [c197]Hangyu Wang, Ting Long, Liang Yin, Weinan Zhang, Wei Xia, Qichen Hong, Dingyin Xia, Ruiming Tang, Yong Yu:
GMOCAT: A Graph-Enhanced Multi-Objective Method for Computerized Adaptive Testing. KDD 2023: 2279-2289 - [c196]