


default search action
Yali Du 0001
Person information
- affiliation: King's College London, London, UK
- affiliation (PhD 2019): University of Technology Sydney, Faculty of Engineering and Information Technology, Ultimo, NSW, Australia
Other persons with the same name
- Yali Du
- Yali Du 0002
— Nanjing University, China
- Yali Du 0003 — Peking University Third Hospital, Peking University, Beijing, China
- Yali Du 0004
— Xi'an Jiaotong University, Shaanxi Engineering Research Center of Nondestructive Testing and Structural Integrity Evaluation, China
- Yali Du 0005 — Hebei University of Technology, Tianjin, China
- Yali Du 0006 — Xi'an Polytechnic University, Xi'an, China
- Yali Du 0008 — Shandong University, Qingdao, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j13]Ming Yang, Kaiyan Zhao, Yiming Wang, Renzhi Dong, Yali Du, Furui Liu, Mingliang Zhou, Leong Hou U:
Team-wise effective communication in multi-agent reinforcement learning. Auton. Agents Multi Agent Syst. 38(2): 36 (2024) - [j12]Richard Willis, Yali Du, Joel Z. Leibo, Michael Luck:
Resolving social dilemmas with minimal reward transfer. Auton. Agents Multi Agent Syst. 38(2): 49 (2024) - [j11]Yang Li
, Shao Zhang, Jichen Sun, Wenhao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan
:
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination. J. Artif. Intell. Res. 80: 1139-1185 (2024) - [j10]Shangding Gu
, Long Yang, Yali Du
, Guang Chen
, Florian Walter, Jun Wang
, Alois Knoll
:
A Review of Safe Reinforcement Learning: Methods, Theories, and Applications. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 11216-11235 (2024) - [j9]Xingzhou Lou
, Junge Zhang
, Yali Du
, Chao Yu
, Zhaofeng He
, Kaiqi Huang
:
Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games. IEEE Trans. Games 16(2): 470-482 (2024) - [c52]Sirui Chen, Zhaowei Zhang, Yaodong Yang, Yali Du:
STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning. AAAI 2024: 17337-17345 - [c51]Xingzhou Lou, Junge Zhang, Timothy J. Norman
, Kaiqi Huang, Yali Du:
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient. AAAI 2024: 17496-17504 - [c50]Zijing Shi, Meng Fang, Ling Chen, Yali Du, Jun Wang:
Human-Guided Moral Decision Making in Text-Based Games. AAAI 2024: 21574-21582 - [c49]Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du:
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models. AAMAS 2024: 1274-1282 - [c48]Stefan Roesch, Stefanos Leonardos, Yali Du:
The Selfishness Level of Social Dilemmas. AAMAS 2024: 2441-2443 - [c47]Mark Towers
, Yali Du
, Christopher T. Freeman
, Timothy J. Norman
:
Explaining an Agent's Future Beliefs Through Temporally Decomposing Future Reward Estimators. ECAI 2024: 2790-2797 - [c46]Mark Towers
, Yali Du
, Christopher T. Freeman
, Timothy J. Norman
:
Temporal Explanations of Deep Reinforcement Learning Agents. EXTRAAMAS 2024: 99-115 - [c45]Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation. ICML 2024 - [c44]Wenxi Wu, Fabio Pierazzi, Yali Du, Martim Brandão:
Characterizing Physical Adversarial Attacks on Robot Motion Planners. ICRA 2024: 14319-14325 - [c43]Jinyu Cai, Yunhe Zhang, Jicong Fan, Yali Du, Wenzhong Guo:
Dual Contrastive Graph-Level Clustering with Multiple Cluster Perspectives Alignment. IJCAI 2024: 3770-3779 - [c42]Ruiqing Chen, Xiaoyuan Zhang, Yali Du, Yifan Zhong, Zheng Tian, Fanglei Sun, Yaodong Yang:
Off-Agent Trust Region Policy Optimization. IJCAI 2024: 3798-3806 - [c41]Xiong-Hui Chen, Ziyan Wang, Yali Du, Shengyi Jiang, Meng Fang, Yang Yu, Jun Wang:
Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting. NeurIPS 2024 - [c40]Zangir Iklassov, Yali Du, Farkhad Akimov, Martin Takác:
Self-Guiding Exploration for Combinatorial Problems. NeurIPS 2024 - [c39]Xuanfa Jin, Ziyan Wang, Yali Du, Meng Fang, Haifeng Zhang, Jun Wang:
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf. NeurIPS 2024 - [c38]Yang Li, Wenhao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan:
Aligning Individual and Collective Objectives in Multi-Agent Cooperation. NeurIPS 2024 - [c37]Nam Phuong Tran, The-Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du, Long Tran-Thanh:
Learning the Expected Core of Strictly Convex Stochastic Cooperative Games. NeurIPS 2024 - [i46]Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du:
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models. CoRR abs/2401.07553 (2024) - [i45]Nam Phuong Tran, The-Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du, Long Tran-Thanh:
Learning the Expected Core of Strictly Convex Stochastic Cooperative Games. CoRR abs/2402.07067 (2024) - [i44]Xidong Feng, Ziyu Wan, Mengyue Yang, Ziyan Wang, Girish A. Koushik, Yali Du, Ying Wen, Jun Wang:
Natural Language Reinforcement Learning. CoRR abs/2402.07157 (2024) - [i43]Zhixun Chen, Yali Du, David Mguni:
All Language Models Large and Small. CoRR abs/2402.12061 (2024) - [i42]Yang Li, Wenhao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan
:
Aligning Individual and Collective Objectives in Multi-Agent Cooperation. CoRR abs/2402.12416 (2024) - [i41]Zangir Iklassov, Yali Du, Farkhad Akimov, Martin Takác:
Self-Guiding Exploration for Combinatorial Problems. CoRR abs/2405.17950 (2024) - [i40]Xuanfa Jin, Ziyan Wang, Yali Du, Meng Fang, Haifeng Zhang, Jun Wang:
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf. CoRR abs/2405.19946 (2024) - [i39]Ziyan Wang, Meng Fang, Tristan Tomilin, Fei Fang, Yali Du:
Safe Multi-agent Reinforcement Learning with Natural Language Constraints. CoRR abs/2405.20018 (2024) - [i38]Mark Towers, Yali Du, Christopher T. Freeman, Timothy J. Norman:
Explaining an Agent's Future Beliefs through Temporally Decomposing Future Reward Estimators. CoRR abs/2408.08230 (2024) - [i37]Ruiqi Zhang, Jing Hou, Florian Walter, Shangding Gu, Jiayi Guan, Florian Röhrbein, Yali Du, Panpan Cai, Guang Chen, Alois Knoll:
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey. CoRR abs/2408.09675 (2024) - [i36]Yudi Zhang, Pei Xiao, Lu Wang, Chaoyun Zhang, Meng Fang, Yali Du, Yevgeniy Puzyrev, Randolph Yao, Si Qin, Qingwei Lin, Mykola Pechenizkiy, Dongmei Zhang, Saravan Rajmohan, Qi Zhang:
RuAG: Learned-rule-augmented Generation for Large Language Models. CoRR abs/2411.03349 (2024) - [i35]Fengshuo Bai, Runze Liu, Yali Du, Ying Wen, Yaodong Yang:
RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors. CoRR abs/2412.10713 (2024) - 2023
- [j8]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Gangyan Xu
, Chengqi Zhang
:
Shared dynamics learning for large-scale traveling salesman problem. Adv. Eng. Informatics 56: 102005 (2023) - [j7]Shangding Gu
, Jakub Grudzien Kuba, Yuanpei Chen, Yali Du, Long Yang, Alois C. Knoll, Yaodong Yang
:
Safe multi-agent reinforcement learning for multi-robot control. Artif. Intell. 319: 103905 (2023) - [j6]Shangding Gu, Alap Kshirsagar, Yali Du, Guang Chen, Jan Peters, Alois Knoll:
A human-centered safe robot reinforcement learning framework with interactive behaviors. Frontiers Neurorobotics 17 (2023) - [c36]Yali Du:
Cooperative Multi-Agent Learning in a Complex World: Challenges and Solutions. AAAI 2023: 15436 - [c35]Zhijian Duan, Wenhan Huang, Dinghuai Zhang, Yali Du, Jun Wang, Yaodong Yang, Xiaotie Deng:
Is Nash Equilibrium Approximator Learnable? AAMAS 2023: 233-241 - [c34]Xingzhou Lou, Jiaxian Guo, Junge Zhang, Jun Wang, Kaiqi Huang, Yali Du:
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination. AAMAS 2023: 679-688 - [c33]Jiarui Jin
, Xianyu Chen
, Weinan Zhang
, Mengyue Yang
, Yang Wang
, Yali Du
, Yong Yu
, Jun Wang
:
Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank. CIKM 2023: 1004-1013 - [c32]Ming Yang, Renzhi Dong, Yiming Wang, Furui Liu, Yali Du, Mingliang Zhou, Leong Hou U
:
TieComm: Learning a Hierarchical Communication Topology Based on Tie Theory. DASFAA (1) 2023: 604-613 - [c31]Zijing Shi, Meng Fang, Yunqiu Xu, Ling Chen, Yali Du:
Stay Moral and Explore: Learn to Behave Morally in Text-based Games. ICLR 2023 - [c30]Yang Li, Shao Zhang
, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan:
Cooperative Open-ended Learning Framework for Zero-Shot Coordination. ICML 2023: 20470-20484 - [c29]Yabin Zhang, Weiqi Shao, Xu Chen, Yali Du, Xiaoxiao Xu, Dong Zheng, Changhua Pei, Shuai Zhang, Peng Jiang, Kun Gai:
A Multi-Agent Framework for Recommendation with Heterogeneous Sources. IJCNN 2023: 1-8 - [c28]Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy:
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach. NeurIPS 2023 - [c27]Shutong Ding, Jingya Wang, Yali Du, Ye Shi:
Reduced Policy Optimization for Continuous Control with Hard Constraints. NeurIPS 2023 - [c26]Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang:
ChessGPT: Bridging Policy Learning and Language Modeling. NeurIPS 2023 - [c25]Xue Yan, Jiaxian Guo, Xingzhou Lou, Jun Wang, Haifeng Zhang, Yali Du:
An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination. NeurIPS 2023 - [c24]Mengyue Yang, Yonggang Zhang, Zhen Fang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang:
Invariant Learning via Probability of Sufficient and Necessary Causes. NeurIPS 2023 - [i34]Xingzhou Lou, Jiaxian Guo, Junge Zhang, Jun Wang, Kaiqi Huang, Yali Du:
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination. CoRR abs/2301.06387 (2023) - [i33]Lukas Schäfer, Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V. Albrecht, David Mguni:
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning. CoRR abs/2302.03439 (2023) - [i32]Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan
:
Cooperative Open-ended Learning Framework for Zero-shot Coordination. CoRR abs/2302.04831 (2023) - [i31]Shangding Gu, Alap Kshirsagar, Yali Du, Guang Chen, Yaodong Yang, Jan Peters, Alois C. Knoll:
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors. CoRR abs/2302.13137 (2023) - [i30]Sirui Chen, Zhaowei Zhang, Yali Du, Yaodong Yang
:
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning. CoRR abs/2304.07520 (2023) - [i29]Liting Chen, Lu Wang, Hang Dong, Yali Du, Jie Yan, Fangkai Yang, Shuang Li, Pu Zhao, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Introspective Tips: Large Language Model for In-Context Decision Making. CoRR abs/2305.11598 (2023) - [i28]Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy:
GRD: A Generative Approach for Interpretable Reward Redistribution in Reinforcement Learning. CoRR abs/2305.18427 (2023) - [i27]Yang Li, Shao Zhang, Jichen Sun, Wenhao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan
:
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination. CoRR abs/2306.03034 (2023) - [i26]Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
Zero-shot Preference Learning for Offline RL via Optimal Transport. CoRR abs/2306.03615 (2023) - [i25]Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang:
ChessGPT: Bridging Policy Learning and Language Modeling. CoRR abs/2306.09200 (2023) - [i24]Jiarui Jin, Xianyu Chen, Weinan Zhang, Mengyue Yang, Yang Wang, Yali Du, Yong Yu, Jun Wang:
Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank. CoRR abs/2308.02860 (2023) - [i23]Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jun Wang:
Invariant Learning via Probability of Sufficient and Necessary Causes. CoRR abs/2309.12559 (2023) - [i22]Shutong Ding, Jingya Wang, Yali Du, Ye Shi:
Reduced Policy Optimization for Continuous Control with Hard Constraints. CoRR abs/2310.09574 (2023) - [i21]Richard Willis, Yali Du, Joel Z. Leibo, Michael Luck:
Resolving social dilemmas with minimal reward transfer. CoRR abs/2310.12928 (2023) - [i20]Ziyan Wang, Yali Du, Yudi Zhang, Meng Fang, Biwei Huang:
MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment. CoRR abs/2312.03644 (2023) - [i19]Yali Du, Joel Z. Leibo, Usman Islam, Richard Willis, Peter Sunehag:
A Review of Cooperation in Multi-agent Learning. CoRR abs/2312.05162 (2023) - [i18]Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du:
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient. CoRR abs/2312.15667 (2023) - [i17]Zijing Shi, Meng Fang, Shunfeng Zheng, Shilong Deng, Ling Chen, Yali Du:
Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in the Avalon Game. CoRR abs/2312.17515 (2023) - 2022
- [j5]Elizabeth Black
, Martim Brandão
, Oana Cocarascu, Bart de Keijzer, Yali Du, Derek Long
, Michael Luck
, Peter McBurney
, Albert Meroño-Peñuela
, Simon Miles, Sanjay Modgil, Luc Moreau
, Maria Polukarov, Odinaldo Rodrigues
, Carmine Ventre
:
Reasoning and interaction for social artificial intelligence. AI Commun. 35(4): 309-325 (2022) - [j4]Tianhong Dai
, Yali Du, Meng Fang, Anil Anthony Bharath:
Diversity-augmented intrinsic motivation for deep reinforcement learning. Neurocomputing 468: 396-406 (2022) - [j3]Yunqiu Xu
, Meng Fang
, Ling Chen
, Gangyan Xu
, Yali Du, Chengqi Zhang
:
Reinforcement Learning With Multiple Relational Attention for Solving Vehicle Routing Problems. IEEE Trans. Cybern. 52(10): 11107-11120 (2022) - [c23]Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen:
Learning to Identify Top Elo Ratings: A Dueling Bandits Approach. AAAI 2022: 8797-8805 - [c22]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang
:
Perceiving the World: Question-guided Reinforcement Learning for Text-based Games. ACL (1) 2022: 538-560 - [c21]Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu:
GCS: Graph-Based Coordination Strategy for Multi-Agent Reinforcement Learning. AAMAS 2022: 1128-1136 - [c20]Ilias Kazantzidis, Timothy J. Norman, Yali Du, Christopher T. Freeman:
How to Train Your Agent: Active Learning from Human Preferences and Justifications in Safety-critical Environments. AAMAS 2022: 1654-1656 - [c19]Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang:
Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL. ICLR 2022 - [c18]Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang
:
Scalable Model-based Policy Optimization for Decentralized Networked Systems. IROS 2022: 9019-9026 - [c17]Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang:
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning. NeurIPS 2022 - [i16]Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen:
Learning to Identify Top Elo Ratings: A Dueling Bandits Approach. CoRR abs/2201.04480 (2022) - [i15]Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu:
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning. CoRR abs/2201.06257 (2022) - [i14]Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang:
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL. CoRR abs/2202.04478 (2022) - [i13]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang:
Perceiving the World: Question-guided Reinforcement Learning for Text-based Games. CoRR abs/2204.09597 (2022) - [i12]Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang
, Alois C. Knoll:
A Review of Safe Reinforcement Learning: Methods, Theory and Applications. CoRR abs/2205.10330 (2022) - [i11]Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang
:
Fully Decentralized Model-based Policy Optimization for Networked Systems. CoRR abs/2207.06559 (2022) - [i10]Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang
:
Contextual Transformer for Offline Meta Reinforcement Learning. CoRR abs/2211.08016 (2022) - [i9]Yang Yang, Hongjian Sun, Jialei Gong, Yali Du, Di Yu:
Interpretable Dimensionality Reduction by Feature Preserving Manifold Approximation and Projection. CoRR abs/2211.09321 (2022) - 2021
- [c16]Yali Du, Bo Liu, Vincent Moens, Ziqi Liu, Zhicheng Ren, Jun Wang, Xu Chen, Haifeng Zhang:
Learning Correlated Communication Topology in Multi-Agent Reinforcement learning. AAMAS 2021: 456-464 - [c15]Liheng Chen, Hongyi Guo, Yali Du, Fei Fang
, Haifeng Zhang, Weinan Zhang, Yong Yu:
Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning. DAI 2021: 185-205 - [c14]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Chengqi Zhang:
Generalization in Text-based Games via Hierarchical Reinforcement Learning. EMNLP (Findings) 2021: 1343-1353 - [c13]Yali Du, Xue Yan, Xu Chen, Jun Wang, Haifeng Zhang:
Estimating α-Rank from A Few Entries with Low Rank Matrix Completion. ICML 2021: 2870-2879 - [c12]David Henry Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang:
Learning in Nonzero-Sum Stochastic Games with Potentials. ICML 2021: 7688-7699 - [c11]Xiaoqiang Wang, Yali Du, Shengyu Zhu, Liangjun Ke, Zhitang Chen, Jianye Hao, Jun Wang:
Ordering-Based Causal Discovery with Reinforcement Learning. IJCAI 2021: 3566-3573 - [c10]Xu Chen, Yali Du
, Long Xia, Jun Wang:
Reinforcement Recommendation with User Multi-aspect Preference. WWW 2021: 425-435 - [i8]David Mguni, Jianhong Wang, Taher Jafferjee, Nicolas Perez Nieves, Wenbin Song, Yaodong Yang, Feifei Tong, Hui Chen, Jiangcheng Zhu, Yali Du, Jun Wang:
Learning to Shape Rewards using a Game of Switching Controls. CoRR abs/2103.09159 (2021) - [i7]David Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang:
Learning in Nonzero-Sum Stochastic Games with Potentials. CoRR abs/2103.09284 (2021) - [i6]Xiaoqiang Wang, Yali Du, Shengyu Zhu, Liangjun Ke, Zhitang Chen, Jianye Hao, Jun Wang:
Ordering-Based Causal Discovery with Reinforcement Learning. CoRR abs/2105.06631 (2021) - [i5]Rui Yang, Meng Fang, Lei Han, Yali Du, Feng Luo, Xiu Li:
MHER: Model-based Hindsight Experience Replay. CoRR abs/2107.00306 (2021) - [i4]Zhijian Duan, Yali Du, Jun Wang, Xiaotie Deng:
Learning to Compute Approximate Nash Equilibrium for Normal-form Games. CoRR abs/2108.07472 (2021) - [i3]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Chengqi Zhang
:
Generalization in Text-based Games via Hierarchical Reinforcement Learning. CoRR abs/2109.09968 (2021) - 2020
- [c9]Yifan Zhao, Gangyan Xu
, Yali Du, Meng Fang:
Learning Multi-Agent Communication with Policy Fingerprints for Adaptive Traffic Signal Control. CASE 2020: 266-273 - [c8]Yunqiu Xu, Meng Fang, Ling Chen
, Yali Du, Joey Tianyi Zhou, Chengqi Zhang:
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games. NeurIPS 2020 - [i2]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang:
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games. CoRR abs/2010.11655 (2020)
2010 – 2019
- 2019
- [b1]Yali Du:
Design and evaluation of factorization-based algorithms for user preference analysis. University of Technology Sydney, Australia, 2019 - [j2]Yali Du
, Meng Fang
, Jinfeng Yi, Chang Xu
, Jun Cheng
, Dacheng Tao
:
Enhancing the Robustness of Neural Collaborative Filtering Systems Under Malicious Attacks. IEEE Trans. Multim. 21(3): 555-565 (2019) - [c7]Lei Han, Peng Sun, Yali Du, Jiechao Xiong, Qing Wang, Xinghai Sun, Han Liu, Tong Zhang:
Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI. ICML 2019: 2576-2585 - [c6]Xun Wang, Yali Du, Leimin Zhang
, Xirong Li, Miao Zhang, Jianfeng Dong:
Exploring Content-based Video Relevance for Video Click-Through Rate Prediction. ACM Multimedia 2019: 2602-2606 - [c5]Yali Du, Lei Han, Meng Fang, Ji Liu, Tianhong Dai, Dacheng Tao:
LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning. NeurIPS 2019: 4405-4416 - [c4]Meng Fang, Tianyi Zhou, Yali Du, Lei Han, Zhengyou Zhang:
Curriculum-guided Hindsight Experience Replay. NeurIPS 2019: 12602-12613 - 2018
- [j1]Yali Du, Chang Xu
, Dacheng Tao:
Matrix Factorization for Collaborative Budget Allocation. IEEE Trans Autom. Sci. Eng. 15(4): 1471-1482 (2018) - [c3]Yali Du, Meng Fang, Jinfeng Yi, Jun Cheng, Dacheng Tao:
Towards Query Efficient Black-box Attacks: An Input-free Perspective. AISec@CCS 2018: 13-24 - [i1]Yali Du, Meng Fang, Jinfeng Yi, Jun Cheng, Dacheng Tao:
Towards Query Efficient Black-box Attacks: An Input-free Perspective. CoRR abs/1809.02918 (2018) - 2017
- [c2]Yali Du
, Chang Xu
, Dacheng Tao:
Privileged Matrix Factorization for Collaborative Filtering. IJCAI 2017: 1610-1616 - [c1]Yali Du
, Chang Xu
, Dacheng Tao:
Collaborative Rating Allocation. IJCAI 2017: 1617-1623