Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Yaodong Yang 0001

杨耀东

> Home > Persons

Person information

unicode name: 杨耀东
affiliation: Peking University, Institute for AI, Beijing, China
affiliation (former): King's College London, UK
affiliation (former): Huawei Technologies, Noah's Ark Lab, UK
affiliation (PhD): University College London, UK

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/ChenGZJJLDY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/ChenGZJJLDY24
Yuanpei Chen, Yiran Geng, Fangwei Zhong, Jiaming Ji, Jiechuang Jiang, Zongqing Lu, Hao Dong, Yaodong Yang:
Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2804-2818 (2024)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/WangYMYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/WangYMYY24
Chenguang Wang, Zhouliang Yu, Stephen McAleer, Tianshu Yu, Yaodong Yang:
ASP: Learn a Universal Neural Solver! IEEE Trans. Pattern Anal. Mach. Intell. 46(6): 4102-4114 (2024)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/ral/LiLGLYZLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ral/LiLGLYZLH24
Yuyang Li, Bo Liu, Yiran Geng, Puhao Li, Yaodong Yang, Yixin Zhu, Tengyu Liu, Siyuan Huang:
Grasp Multiple Objects With One Hand. IEEE Robotics Autom. Lett. 9(5): 4027-4034 (2024)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/tmc/LiSHLWLWTYZCWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmc/LiSHLWLWTYZCWY24
Yang Li, Fanglei Sun, Jingchen Hu, Chang Liu, Fan Wu, Kai Li, Ying Wen, Zheng Tian, Yaodong Yang, Jiangcheng Zhu, Zhifeng Chen, Jun Wang, Yang Yang:
Self-Supervised MAFENN for Classifying Low-Labeled Distorted Images Over Mobile Fading Channels. IEEE Trans. Mob. Comput. 23(8): 8077-8091 (2024)
[c61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangLLN0LO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangLLN0LO24
Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning. AAAI 2024: 16908-16916
[c60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ChenZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ChenZ0024
Sirui Chen, Zhaowei Zhang, Yaodong Yang, Yali Du:
STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning. AAAI 2024: 17337-17345
[c59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangYHWLSZZLZC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangYHWLSZZLZC24
Ceyao Zhang, Kaijie Yang, Siyi Hu, Zihao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang:
ProAgent: Building Proactive Cooperative Agents with Large Language Models. AAAI 2024: 17591-17599
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/DinhMTWY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/DinhMTWY24
Le Cong Dinh, David Henry Mguni, Long Tran-Thanh, Jun Wang, Yaodong Yang:
A Summary of Online Markov Decision Processes with Non-oblivious Strategic Adversary. AAMAS 2024: 2830-2832
[i104]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10568
Siyuan Qi, Shuo Chen, Yexin Li, Xiangyu Kong, Junqi Wang, Bangcheng Yang, Pring Wong, Yifan Zhong, Xiaoyuan Zhang, Zhaowei Zhang, Nian Liu, Wei Wang, Yaodong Yang, Song-Chun Zhu:
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents. CoRR abs/2401.10568 (2024)
[i103]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02030
Yifan Zhong, Chengdong Ma, Xiaoyuan Zhang, Ziran Yang, Qingfu Zhang, Siyuan Qi, Yaodong Yang:
Panacea: Pareto Alignment via Preference Adaptation for LLMs. CoRR abs/2402.02030 (2024)
[i102]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02416
Jiaming Ji, Boyuan Chen, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Juntao Dai, Yaodong Yang:
Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction. CoRR abs/2402.02416 (2024)
[i101]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10184
Tianyi Qiu, Fanzhi Zeng, Jiaming Ji, Dong Yan, Kaile Wang, Jiayi Zhou, Han Yang, Josef Dai, Xuehai Pan, Yaodong Yang:
Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective. CoRR abs/2402.10184 (2024)
[i100]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12907
Zhaowei Zhang, Fengshuo Bai, Mingzhi Wang, Haoyang Ye, Chengdong Ma, Yaodong Yang:
Incentive Compatibility for AI Alignment in Sociotechnical Systems: Positions and Prospects. CoRR abs/2402.12907 (2024)
[i99]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-00255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-00255
Naming Liu, Mingzhi Wang, Youzhi Zhang, Yaodong Yang, Bo An, Ying Wen:
Leveraging Team Correlation for Approximating Equilibrium in Two-Team Zero-Sum Games. CoRR abs/2403.00255 (2024)
[i98]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12421
Tianhao Wu, Yunchong Gan, Mingdong Wu, Jingbo Cheng, Yaodong Yang, Yixin Zhu, Hao Dong:
UniDexFPM: Universal Dexterous Functional Pre-grasp Manipulation Via Diffusion Policy. CoRR abs/2403.12421 (2024)
[i97]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12835
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12835
Jieming Cui, Tengyu Liu, Nian Liu, Yaodong Yang, Yixin Zhu, Siyuan Huang:
AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents. CoRR abs/2403.12835 (2024)
[i96]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09324
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09324
Zhiyu Zhao, Ning Yang, Xue Yan, Haifeng Zhang, Jun Wang, Yaodong Yang:
Correlated Mean Field Imitation Learning. CoRR abs/2404.09324 (2024)
[i95]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18688
Fengshuo Bai, Rui Zhao, Hongming Zhang, Sijia Cui, Ying Wen, Yaodong Yang, Bo Xu, Lei Han:
Efficient Preference-based Reinforcement Learning via Aligned Experience Estimation. CoRR abs/2405.18688 (2024)
[i94]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-18718
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-18718
Fengshuo Bai, Mingzhi Wang, Zhaowei Zhang, Boyuan Chen, Yinda Xu, Ying Wen, Yaodong Yang:
Efficient Model-agnostic Alignment via Bayesian Persuasion. CoRR abs/2405.18718 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-21027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-21027
Jiesong Lian, Yucong Huang, Mingzhi Wang, Chengdong Ma, Yixue Hao, Ying Wen, Yaodong Yang:
Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles. CoRR abs/2405.21027 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-06144
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-06144
Jiaming Ji, Kaile Wang, Tianyi Qiu, Boyuan Chen, Jiayi Zhou, Changye Li, Hantao Lou, Yaodong Yang:
Language Models Resist Alignment. CoRR abs/2406.06144 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08002
Yizhe Huang, Anji Liu, Fanqi Kong, Yaodong Yang, Song-Chun Zhu, Xue Feng:
Efficient Adaptation in Mixed-Motive Environments via Hierarchical Opponent Modeling and Planning. CoRR abs/2406.08002 (2024)
[i90]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-14477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-14477
Josef Dai, Tianle Chen, Xuyao Wang, Ziran Yang, Taiye Chen, Jiaming Ji, Yaodong Yang:
SafeSora: Towards Safety Alignment of Text2Video Generation via a Human Preference Dataset. CoRR abs/2406.14477 (2024)
[i89]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15513
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15513
Jiaming Ji, Donghai Hong, Borong Zhang, Boyuan Chen, Josef Dai, Boren Zheng, Tianyi Qiu, Boxun Li, Yaodong Yang:
PKU-SafeRLHF: A Safety Alignment Preference Dataset for Llama Family Models. CoRR abs/2406.15513 (2024)
[i88]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-20087
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-20087
Tianyi Qiu, Yang Zhang, Xuchuan Huang, Jasmine Xinze Li, Jiaming Ji, Yaodong Yang:
ProgressGym: Alignment with a Millennium of Moral Progress. CoRR abs/2406.20087 (2024)
2023
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/aamas/DinhMTWY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aamas/DinhMTWY23
Le Cong Dinh, David Henry Mguni, Long Tran-Thanh, Jun Wang, Yaodong Yang:
Online Markov decision processes with non-oblivious strategic adversary. Auton. Agents Multi Agent Syst. 37(1): 15 (2023)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/GuKCDYKY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/GuKCDYKY23
Shangding Gu, Jakub Grudzien Kuba, Yuanpei Chen, Yali Du, Long Yang, Alois C. Knoll, Yaodong Yang:
Safe multi-agent reinforcement learning for multi-robot control. Artif. Intell. 319: 103905 (2023)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/fcsc/WenLWYWMWZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/fcsc/WenLWYWMWZZ23
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Hai-Feng Zhang, Weinan Zhang:
Large sequence models for sequential decision-making: a survey. Frontiers Comput. Sci. 17(6): 176349 (2023)
[j7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ijautcomp/MengWLLXZWZWYX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijautcomp/MengWLLXZWZWYX23
Linghui Meng, Muning Wen, Chenyang Le, Xiyun Li, Dengpeng Xing, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Yaodong Yang, Bo Xu:
Offline Pre-trained Multi-agent Decision Transformer. Mach. Intell. Res. 20(2): 233-248 (2023)
[j6]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/ZhouWWWW0000023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/ZhouWWWW0000023
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Yong Yu, Jun Wang, Weinan Zhang:
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning. J. Mach. Learn. Res. 24: 150:1-150:12 (2023)
[c57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiLZWN0LO23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiLZWN0LO23
Chuming Li, Jie Liu, Yinmin Zhang, Yuhong Wei, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency. AAAI 2023: 8536-8544
[c56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MguniJWNSTTYDCZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MguniJWNSTTYDCZ23
David Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez Nieves, Wenbin Song, Feifei Tong, Matthew E. Taylor, Tianpei Yang, Zipeng Dai, Hui Chen, Jiangcheng Zhu, Kun Shao, Jun Wang, Yaodong Yang:
Learning to Shape Rewards Using a Game of Two Partners. AAAI 2023: 11604-11612
[c55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XuZYYYH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XuZYYYH23
Pei Xu, Junge Zhang, Qiyue Yin, Chao Yu, Yaodong Yang, Kaiqi Huang:
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks. AAAI 2023: 11717-11725
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/0001HZ0W0D23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/0001HZ0W0D23
Zhijian Duan, Wenhan Huang, Dinghuai Zhang, Yali Du, Jun Wang, Yaodong Yang, Xiaotie Deng:
Is Nash Equilibrium Approximator Learnable? AAMAS 2023: 233-241
[c53]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/corl/HuangCWQ00W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/HuangCWQ00W23
Binghao Huang, Yuanpei Chen, Tianyu Wang, Yuzhe Qin, Yaodong Yang, Nikolay Atanasov, Xiaolong Wang:
Dynamic Handover: Throw and Catch with Bimanual Hands. CoRL 2023: 1887-1902
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/ecai/LiJLZN00O23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/LiJLZN00O23
Chuming Li, Ruonan Jia, Jie Liu, Yinmin Zhang, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning. ECAI 2023: 1381-1388
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/WanGLS0Y023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/WanGLS0Y023
Weikang Wan, Haoran Geng, Yun Liu, Zikang Shan, Yaodong Yang, Li Yi, He Wang:
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning. ICCV 2023: 3868-3879
[c50]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/WuYFT00F023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WuYFT00F023
Shuang Wu, Jian Yao, Haobo Fu, Ye Tian, Chao Qian, Yaodong Yang, Qiang Fu, Wei Yang:
Quality-Similar Diversity via Population Based Reinforcement Learning. ICLR 2023
[c49]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MguniCJWYFMTW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MguniCJWYFMTW023
David Henry Mguni, Haojun Chen, Taher Jafferjee, Jianhong Wang, Longfei Yue, Xidong Feng, Stephen Marcus McAleer, Feifei Tong, Jun Wang, Yaodong Yang:
MANSA: Learning Fast and Slow in Multi-Agent Systems. ICML 2023: 24631-24658
[c48]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/SlumbersMBM0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SlumbersMBM0023
Oliver Slumbers, David Henry Mguni, Stefano B. Blumberg, Stephen Marcus McAleer, Yaodong Yang, Jun Wang:
A Game-Theoretic Framework for Managing Risk in Multi-Agent Systems. ICML 2023: 32059-32087
[c47]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/TangDMY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/TangDMY23
Xiaohang Tang, Le Cong Dinh, Stephen Marcus McAleer, Yaodong Yang:
Regret-Minimizing Double Oracle for Extensive-Form Games. ICML 2023: 33599-33615
[c46]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WangSH00W0M23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangSH00W0M23
Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai:
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models. ICML 2023: 36380-36390
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/GengAGCYD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/GengAGCYD23
Yiran Geng, Boshi An, Haoran Geng, Yuanpei Chen, Yaodong Yang, Hao Dong:
RLAfford: End-to-End Affordance Learning for Robotic Manipulation. ICRA 2023: 5880-5886
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/LiLLGZYH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/LiLLGZYH23
Puhao Li, Tengyu Liu, Yuyang Li, Yiran Geng, Yixin Zhu, Yaodong Yang, Siyuan Huang:
GenDexGrasp: Generalizable Dexterous Grasping. ICRA 2023: 8068-8074
[c43]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/JiLDPZB0SW023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiLDPZB0SW023
Jiaming Ji, Mickel Liu, Josef Dai, Xuehai Pan, Chi Zhang, Ce Bian, Boyuan Chen, Ruiyang Sun, Yizhou Wang, Yaodong Yang:
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset. NeurIPS 2023
[c42]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/JiZZP0SGZD023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/JiZZP0SGZD023
Jiaming Ji, Borong Zhang, Jiayi Zhou, Xuehai Pan, Weidong Huang, Ruiyang Sun, Yiran Geng, Yifan Zhong, Josef Dai, Yaodong Yang:
Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark. NeurIPS 2023
[c41]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/McAleerFZW0S23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/McAleerFZW0S23
Stephen McAleer, Gabriele Farina, Gaoyue Zhou, Mingzhi Wang, Yaodong Yang, Tuomas Sandholm:
Team-PSRO for Learning Approximate TMECor in Large Team Games via Cooperative Reinforcement Learning. NeurIPS 2023
[c40]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Yang0LZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Yang0LZL23
Mingyu Yang, Yaodong Yang, Zhenbo Lu, Wengang Zhou, Houqiang Li:
Hierarchical Multi-Agent Skill Discovery. NeurIPS 2023
[c39]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YaoLF0MF023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YaoLF0MF023
Jian Yao, Weiming Liu, Haobo Fu, Yaodong Yang, Stephen McAleer, Qiang Fu, Wei Yang:
Policy Space Diversity for Non-Transitive Games. NeurIPS 2023
[c38]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Zhao0LZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Zhao0LZL23
Youpeng Zhao, Yaodong Yang, Zhenbo Lu, Wengang Zhou, Houqiang Li:
Multi-Agent First Order Constrained Optimization in Policy Space. NeurIPS 2023
[c37]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/usenix/Zhu0CCCS0PC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/Zhu0CCCS0PC23
Huanzhou Zhu, Bo Zhao, Gang Chen, Weifeng Chen, Yijie Chen, Liang Shi, Yaodong Yang, Peter R. Pietzuch, Lei Chen:
MSRL: Distributed Reinforcement Learning with Dataflow Fragments. USENIX ATC 2023: 977-993
[i87]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-05910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-05910
David Mguni, Taher Jafferjee, Haojun Chen, Jianhong Wang, Long Fei, Xidong Feng, Stephen McAleer, Feifei Tong, Jun Wang, Yaodong Yang:
MANSA: Learning Fast and Slow in Multi-Agent Systems. CoRR abs/2302.05910 (2023)
[i86]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-13137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-13137
Shangding Gu, Alap Kshirsagar, Yali Du, Guang Chen, Yaodong Yang, Jan Peters, Alois C. Knoll:
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors. CoRR abs/2302.13137 (2023)
[i85]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-00466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-00466
Chenguang Wang, Zhouliang Yu, Stephen McAleer, Tianshu Yu, Yaodong Yang:
ASP: Learn a Universal Neural Solver! CoRR abs/2303.00466 (2023)
[i84]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-00464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-00464
Weikang Wan, Haoran Geng, Yun Liu, Zikang Shan, Yaodong Yang, Li Yi, He Wang:
UniDexGrasp++: Improving Dexterous Grasping Policy Learning via Geometry-aware Curriculum and Iterative Generalist-Specialist Learning. CoRR abs/2304.00464 (2023)
[i83]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-07520
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-07520
Sirui Chen, Zhaowei Zhang, Yali Du, Yaodong Yang:
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning. CoRR abs/2304.07520 (2023)
[i82]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-09870
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-09870
Yifan Zhong, Jakub Grudzien Kuba, Siyi Hu, Jiaming Ji, Yaodong Yang:
Heterogeneous-Agent Reinforcement Learning. CoRR abs/2304.09870 (2023)
[i81]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-10498
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-10498
Xiaohang Tang, Le Cong Dinh, Stephen Marcus McAleer, Yaodong Yang:
Regret-Minimizing Double Oracle for Extensive-Form Games. CoRR abs/2304.10498 (2023)
[i80]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-09304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-09304
Jiaming Ji, Jiayi Zhou, Borong Zhang, Juntao Dai, Xuehai Pan, Ruiyang Sun, Weidong Huang, Yiran Geng, Mickel Liu, Yaodong Yang:
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research. CoRR abs/2305.09304 (2023)
[i79]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12872
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12872
Simin Li, Jun Guo, Jingqiao Xiu, Xini Yu, Jiakai Wang, Aishan Liu, Yaodong Yang, Xianglong Liu:
Byzantine Robust Cooperative Multi-Agent Reinforcement Learning as a Bayesian Game. CoRR abs/2305.12872 (2023)
[i78]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17147
Zhaowei Zhang, Nian Liu, Siyuan Qi, Ceyao Zhang, Ziqi Rong, Song-Chun Zhu, Shuguang Cui, Yaodong Yang:
Heterogeneous Value Evaluation for Large Language Models. CoRR abs/2305.17147 (2023)
[i77]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10698
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10698
Yonggang Jin, Chenxu Wang, Liuyu Xiang, Yaodong Yang, Jie Fu, Zhaofeng He:
Deep Reinforcement Learning with Multitask Episodic Memory Based on Task-Conditioned Hypernetwork. CoRR abs/2306.10698 (2023)
[i76]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10715
Jiarong Liu, Yifan Zhong, Siyi Hu, Haobo Fu, Qiang Fu, Xiaojun Chang, Yaodong Yang:
Maximum Entropy Heterogeneous-Agent Mirror Learning. CoRR abs/2306.10715 (2023)
[i75]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-13945
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-13945
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang:
Large Sequence Models for Sequential Decision-Making: A Survey. CoRR abs/2306.13945 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-16884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-16884
Jian Yao, Weiming Liu, Haobo Fu, Yaodong Yang, Stephen McAleer, Qiang Fu, Wei Yang:
Policy Space Diversity for Non-Transitive Games. CoRR abs/2306.16884 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-04657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-04657
Jiaming Ji, Mickel Liu, Juntao Dai, Xuehai Pan, Chi Zhang, Ce Bian, Boyuan Zhang, Ruiyang Sun, Yizhou Wang, Yaodong Yang:
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset. CoRR abs/2307.04657 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-07176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-07176
Weidong Huang, Jiaming Ji, Borong Zhang, Chunhe Xia, Yaodong Yang:
Safe DreamerV3: Safe Reinforcement Learning with World Models. CoRR abs/2307.07176 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-12933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-12933
Chuming Li, Ruonan Jia, Jie Liu, Yinmin Zhang, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning. CoRR abs/2307.12933 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-04719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-04719
Yang Li, Kun Xiong, Yingping Zhang, Jiangcheng Zhu, Stephen McAleer, Wei Pan, Jun Wang, Zonghong Dai, Yaodong Yang:
JiangJun: Mastering Xiangqi by Tackling Non-Transitivity in Two-Player Zero-Sum Games. CoRR abs/2308.04719 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11339
Ceyao Zhang, Kaijie Yang, Siyi Hu, Zihao Wang, Guanghe Li, Yihang Sun, Cheng Zhang, Zhaowei Zhang, Anji Liu, Song-Chun Zhu, Xiaojun Chang, Junge Zhang, Feng Yin, Yitao Liang, Yaodong Yang:
ProAgent: Building Proactive Cooperative AI with Large Language Models. CoRR abs/2308.11339 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-15116
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-15116
Jingbang Chen, Yian Wang, Xingwei Qu, Shuangjia Zheng, Yaodong Yang, Hao Dong, Jie Fu:
Mixup-Augmented Meta-Learning for Sample-Efficient Fine-Tuning of Protein Simulators. CoRR abs/2308.15116 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-05655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-05655
Binghao Huang, Yuanpei Chen, Tianyu Wang, Yuzhe Qin, Yaodong Yang, Nikolay Atanasov, Xiaolong Wang:
Dynamic Handover: Throw and Catch with Bimanual Hands. CoRR abs/2309.05655 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00322
Chengdong Ma, Ziran Yang, Minquan Gao, Hai Ci, Jun Gao, Xuehai Pan, Yaodong Yang:
Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models. CoRR abs/2310.00322 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00378
Zhaowei Zhang, Fengshuo Bai, Jun Gao, Yaodong Yang:
Measuring Value Understanding in Language Models through Discriminator-Critique Gap. CoRR abs/2310.00378 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05205
Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai:
GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models. CoRR abs/2310.05205 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-09833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-09833
Simin Li, Ruixiao Xu, Jun Guo, Pu Feng, Jiakai Wang, Aishan Liu, Yaodong Yang, Xianglong Liu, Weifeng Lv:
MIR2: Towards Provably Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization. CoRR abs/2310.09833 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11846
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11846
Jie Liu, Yinmin Zhang, Chuming Li, Chao Yang, Yaodong Yang, Yu Liu, Wanli Ouyang:
Masked Pretraining for Multi-Agent Decision Making. CoRR abs/2310.11846 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12567
Jiaming Ji, Borong Zhang, Jiayi Zhou, Xuehai Pan, Weidong Huang, Ruiyang Sun, Yiran Geng, Yifan Zhong, Juntao Dai, Yaodong Yang:
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark. CoRR abs/2310.12567 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-12773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-12773
Josef Dai, Xuehai Pan, Ruiyang Sun, Jiaming Ji, Xinbo Xu, Mickel Liu, Yizhou Wang, Yaodong Yang:
Safe RLHF: Safe Reinforcement Learning from Human Feedback. CoRR abs/2310.12773 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-15599
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-15599
Yuyang Li, Bo Liu, Yiran Geng, Puhao Li, Yaodong Yang, Yixin Zhu, Tengyu Liu, Siyuan Huang:
Grasp Multiple Objects with One Hand. CoRR abs/2310.15599 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19852
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19852
Jiaming Ji, Tianyi Qiu, Boyuan Chen, Borong Zhang, Hantao Lou, Kaile Wang, Yawen Duan, Zhonghao He, Jiayi Zhou, Zhaowei Zhang, Fanzhi Zeng, Kwan Yee Ng, Juntao Dai, Xuehai Pan, Aidan O'Gara, Yingshan Lei, Hua Xu, Brian Tse, Jie Fu, Stephen McAleer, Yaodong Yang, Yizhou Wang, Song-Chun Zhu, Yike Guo, Wen Gao:
AI Alignment: A Comprehensive Survey. CoRR abs/2310.19852 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-05997
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-05997
Zihao Wang, Shaofei Cai, Anji Liu, Yonggang Jin, Jinbing Hou, Bowei Zhang, Haowei Lin, Zhaofeng He, Zilong Zheng, Yaodong Yang, Xiaojian Ma, Yitao Liang:
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models. CoRR abs/2311.05997 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-07685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-07685
Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning. CoRR abs/2312.07685 (2023)
2022
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/algorithms/SanjayaWY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/algorithms/SanjayaWY22
Ricky Sanjaya, Jun Wang, Yaodong Yang:
Measuring the Non-Transitivity in Chess. Algorithms 15(5): 152 (2022)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/jossac/ZengZLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jossac/ZengZLY22
Qingduo Zeng, Qiang Zhang, Shancun Liu, Yaodong Yang:
Illiquidity Comovement and Market Crisis. J. Syst. Sci. Complex. 35(5): 1863-1874 (2022)
[j3]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/DinhMTNSMWBY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/DinhMTNSMWBY22
Le Cong Dinh, Stephen Marcus McAleer, Zheng Tian, Nicolas Perez Nieves, Oliver Slumbers, David Henry Mguni, Jun Wang, Haitham Bou-Ammar, Yaodong Yang:
Online Double Oracle. Trans. Mach. Learn. Res. 2022 (2022)
[c36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/TangMHCGLYML0TW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/TangMHCGLYML0TW22
Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang:
What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator. AAAI 2022: 8441-8449
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/WenCYLTCW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/WenCYLTCW22
Ying Wen, Hui Chen, Yaodong Yang, Minne Li, Zheng Tian, Xu Chen, Jun Wang:
A Game-Theoretic Approach to Multi-agent Trust Region Optimization. DAI 2022: 74-87
[c34]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/KubaCWWSW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KubaCWWSW022
Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang:
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning. ICLR 2022
[c33]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MguniJWNSTLZ0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MguniJWNSTLZ0W22
David Henry Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez Nieves, Oliver Slumbers, Feifei Tong, Yang Li, Jiangcheng Zhu, Yaodong Yang, Jun Wang:
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning. ICLR 2022
[c32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/0001DLM0Y022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/0001DLM0Y022
Yurong Chen, Xiaotie Deng, Chenchen Li, David Mguni, Jun Wang, Xiang Yan, Yaodong Yang:
On the Convergence of Fictitious Play: A Decomposition Approach. IJCAI 2022: 179-185
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/DuMLLDW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/DuMLLDW022
Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang:
Scalable Model-based Policy Optimization for Decentralized Networked Systems. IROS 2022: 9019-9026
[c30]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/0001CWHHH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001CWHHH22
Yaodong Yang, Guangyong Chen, Weixun Wang, Xiaotian Hao, Jianye Hao, Pheng-Ann Heng:
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing. NeurIPS 2022
[c29]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/0039FRMZ00022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0039FRMZ00022
Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang:
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning. NeurIPS 2022
[c28]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ChenWWFJLMDZY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenWWFJLMDZY22
Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuan Jiang, Zongqing Lu, Stephen McAleer, Hao Dong, Song-Chun Zhu, Yaodong Yang:
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning. NeurIPS 2022
[c27]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LiuBD022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuBD022
Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang:
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning. NeurIPS 2022
[c26]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/PanLZ0Z022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PanLZ0Z022
Xuehai Pan, Mickel Liu, Fangwei Zhong, Yaodong Yang, Song-Chun Zhu, Yizhou Wang:
MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control. NeurIPS 2022
[c25]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/WenKL000022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WenKL000022
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang:
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem. NeurIPS 2022
[c24]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YangJDZZL0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangJDZZL0022
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. NeurIPS 2022
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/wise/ZhuSWYX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wise/ZhuSWYX22
Zhitao Zhu, Shijing Si, Jianzong Wang, Yaodong Yang, Jing Xiao:
Debias the Black-Box: A Fair Ranking Framework via Knowledge Distillation. WISE 2022: 395-405
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-00633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-00633
Ming Zhou, Jingxiao Chen, Ying Wen, Weinan Zhang, Yaodong Yang, Yong Yu:
Efficient Policy Space Response Oracles. CoRR abs/2202.00633 (2022)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-04862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-04862
Juliusz Krysztof Ziomek, Jun Wang, Yaodong Yang:
Settling the Communication Complexity for Distributed Offline Reinforcement Learning. CoRR abs/2202.04862 (2022)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-04868
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-04868
Zehao Dou, Jakub Grudzien Kuba, Yaodong Yang:
Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2202.04868 (2022)
[i52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01469
Yurong Chen, Xiaotie Deng, Chenchen Li, David Mguni, Jun Wang, Xiang Yan, Yaodong Yang:
On the Convergence of Fictitious Play: A Decomposition Approach. CoRR abs/2205.01469 (2022)
[i51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10330
Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois C. Knoll:
A Review of Safe Reinforcement Learning: Methods, Theory and Applications. CoRR abs/2205.10330 (2022)
[i50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14953
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14953
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang:
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem. CoRR abs/2205.14953 (2022)
[i49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15434
Oliver Slumbers, David Henry Mguni, Stephen McAleer, Jun Wang, Yaodong Yang:
Learning Risk-Averse Equilibria in Multi-Agent Systems. CoRR abs/2205.15434 (2022)
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08686
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08686
Yuanpei Chen, Yaodong Yang, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuang Jiang, Stephen Marcus McAleer, Hao Dong, Zongqing Lu, Song-Chun Zhu:
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning. CoRR abs/2206.08686 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-06559
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-06559
Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang:
Fully Decentralized Model-based Policy Optimization for Networked Systems. CoRR abs/2207.06559 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-01682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-01682
Jakub Grudzien Kuba, Xidong Feng, Shiyao Ding, Hao Dong, Jun Wang, Yaodong Yang:
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL. CoRR abs/2208.01682 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-11628
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-11628
Zhitao Zhu, Shijing Si, Jianzong Wang, Yaodong Yang, Jing Xiao:
Debias the Black-box: A Fair Ranking Framework via Knowledge Distillation. CoRR abs/2208.11628 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-07089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-07089
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan:
Constrained Update Projection Approach to Safe Policy Optimization. CoRR abs/2209.07089 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-12941
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-12941
Yiran Geng, Boshi An, Haoran Geng, Yuanpei Chen, Yaodong Yang, Hao Dong:
End-to-End Affordance Learning for Robotic Manipulation. CoRR abs/2209.12941 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00722
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00722
Puhao Li, Tengyu Liu, Yuyang Li, Yiran Geng, Yixin Zhu, Yaodong Yang, Siyuan Huang:
GenDexGrasp: Generalizable Dexterous Grasping. CoRR abs/2210.00722 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00882
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00882
Huanzhou Zhu, Bo Zhao, Gang Chen, Weifeng Chen, Yijie Chen, Liang Shi, Yaodong Yang, Peter R. Pietzuch, Lei Chen:
MSRL: Distributed Reinforcement Learning with Dataflow Fragments. CoRR abs/2210.00882 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13708
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13708
Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Zhihui Li, Xiaodan Liang, Xiaojun Chang, Yaodong Yang:
MARLlib: Extending RLlib for Multi-agent Reinforcement Learning. CoRR abs/2210.13708 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-06934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-06934
Jie Ren, Xidong Feng, Bo Liu, Xuehai Pan, Yao Fu, Luo Mai, Yaodong Yang:
TorchOpt: An Efficient Library for Differentiable Optimization. CoRR abs/2211.06934 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-08016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-08016
Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang:
Contextual Transformer for Offline Meta Reinforcement Learning. CoRR abs/2211.08016 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-16068
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-16068
Chuming Li, Jie Liu, Yinmin Zhang, Yuhong Wei, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang:
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency. CoRR abs/2211.16068 (2022)
2021
[b1]
- view
  - electronic edition @ bl.uk
  - no references & citations available
- export record
  dblp key:
  - phd/ethos/Yang21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ethos/Yang21a
Yaodong Yang:
Many-agent reinforcement learning. University College London (University of London), UK, 2021
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/YangLWSGBWT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/YangLWSGBWT21
Yaodong Yang, Jun Luo, Ying Wen, Oliver Slumbers, Daniel Graves, Haitham Bou-Ammar, Jun Wang, Matthew E. Taylor:
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems. AAMAS 2021: 51-56
[c21]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/MguniWDYWLWJW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/MguniWDYWLWJW21
David Henry Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang:
Learning in Nonzero-Sum Stochastic Games with Potentials. ICML 2021: 7688-7699
[c20]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/NievesYSMWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/NievesYSMWW21
Nicolas Perez Nieves, Yaodong Yang, Oliver Slumbers, David Henry Mguni, Ying Wen, Jun Wang:
Modelling Behavioural Diversity for Learning in Open-Ended Games. ICML 2021: 8514-8524
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/CaggianoDWCMTPPSMHGAZJCDYSDKPHTMSSSK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CaggianoDWCMTPPSMHGAZJCDYSDKPHTMSSSK21
Vittorio Caggiano, Guillaume Durandau, Huawei Wang, Alberto Silvio Chiappa, Alexander Mathis, Pablo Tano, Nisheet Patel, Alexandre Pouget, Pierre Schumacher, Georg Martius, Daniel F. B. Haeufle, Yiran Geng, Boshi An, Yifan Zhong, Jiaming Ji, Yuanpei Chen, Hao Dong, Yaodong Yang, Rahul Siripurapu, Luis Eduardo Ferro Diez, Michael Kopp, Vihang Patil, Sepp Hochreiter, Yuval Tassa, Josh Merel, Randy Schultheis, Seungmoon Song, Massimo Sartori, Vikash Kumar:
MyoChallenge 2022: Learning contact-rich manipulation using a musculoskeletal hand. NeurIPS (Competition and Demos) 2021: 233-250
[c18]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LiuJWHCFHY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuJWHCFHY21
Xiangyu Liu, Hangtian Jia, Ying Wen, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhipeng Hu, Yaodong Yang:
Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games. NeurIPS 2021: 941-952
[c17]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/FengSWLMWWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FengSWLMWWY21
Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang:
Neural Auto-Curricula in Two-Player Zero-Sum Games. NeurIPS 2021: 3504-3517
[c16]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KubaWMGZMWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KubaWMGZMWY21
Jakub Grudzien Kuba, Muning Wen, Linghui Meng, Shangding Gu, Haifeng Zhang, David Mguni, Jun Wang, Yaodong Yang:
Settling the Variance of Multi-Agent Policy Gradients. NeurIPS 2021: 13458-13470
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-07659
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-07659
Yaodong Yang, Jun Luo, Ying Wen, Oliver Slumbers, Daniel Graves, Haitham Bou-Ammar, Jun Wang, Matthew E. Taylor:
Diverse Auto-Curriculum is Critical for Successful Real-World Multiagent Learning Systems. CoRR abs/2102.07659 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-07780
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-07780
Le Cong Dinh, Yaodong Yang, Zheng Tian, Nicolas Perez Nieves, Oliver Slumbers, David Henry Mguni, Haitham Bou-Ammar, Jun Wang:
Online Double Oracle. CoRR abs/2103.07780 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-07927
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-07927
Nicolas Perez Nieves, Yaodong Yang, Oliver Slumbers, David Henry Mguni, Jun Wang:
Modelling Behavioural Diversity for Learning in Open-Ended Games. CoRR abs/2103.07927 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-09159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-09159
David Mguni, Jianhong Wang, Taher Jafferjee, Nicolas Perez Nieves, Wenbin Song, Yaodong Yang, Feifei Tong, Hui Chen, Jiangcheng Zhu, Yali Du, Jun Wang:
Learning to Shape Rewards using a Game of Switching Controls. CoRR abs/2103.09159 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-09284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-09284
David Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang:
Learning in Nonzero-Sum Stochastic Games with Potentials. CoRR abs/2103.09284 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02745
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02745
Xidong Feng, Oliver Slumbers, Yaodong Yang, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang:
Discovering Multi-Agent Auto-Curricula in Two-Player Zero-Sum Games. CoRR abs/2106.02745 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-04958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-04958
Xiangyu Liu, Hangtian Jia, Ying Wen, Yaodong Yang, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhipeng Hu:
Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games. CoRR abs/2106.04958 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06828
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06828
Ying Wen, Hui Chen, Yaodong Yang, Zheng Tian, Minne Li, Xu Chen, Jun Wang:
A Game-Theoretic Approach to Multi-Agent Trust Region Optimization. CoRR abs/2106.06828 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07551
Ming Zhou, Ziyu Wan, Hanjing Wang, Muning Wen, Runzhe Wu, Ying Wen, Yaodong Yang, Weinan Zhang, Jun Wang:
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning. CoRR abs/2106.07551 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-08612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-08612
Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang:
Settling the Variance of Multi-Agent Policy Gradients. CoRR abs/2108.08612 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-01795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-01795
Xiaotie Deng, Yuhao Li, David Henry Mguni, Jun Wang, Yaodong Yang:
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games. CoRR abs/2109.01795 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-09833
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-09833
Yixin Wu, Rui Luo, Chen Zhang, Jun Wang, Yaodong Yang:
Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics. CoRR abs/2109.09833 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-11251
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-11251
Jakub Grudzien Kuba, Ruiqing Chen, Muning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang:
Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning. CoRR abs/2109.11251 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-02793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-02793
Shangding Gu, Jakub Grudzien Kuba, Muning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois C. Knoll, Yaodong Yang:
Multi-Agent Constrained Policy Optimisation. CoRR abs/2110.02793 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03604
Le Cong Dinh, David Henry Mguni, Long Tran-Thanh, Jun Wang, Yaodong Yang:
Online Markov Decision Processes with Non-oblivious Strategic Adversary. CoRR abs/2110.03604 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-11737
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-11737
Ricky Sanjaya, Jun Wang, Yaodong Yang:
Measuring the Non-Transitivity in Chess. CoRR abs/2110.11737 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14468
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14468
David Mguni, Joel Jennings, Taher Jafferjee, Aivar Sootla, Yaodong Yang, Changmin Yu, Usman Islam, Ziyan Wang, Jun Wang:
DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention. CoRR abs/2110.14468 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-15105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-15105
Chenguang Wang, Yaodong Yang, Oliver Slumbers, Congying Han, Tiande Guo, Haifeng Zhang, Jun Wang:
A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers. CoRR abs/2110.15105 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02618
David Henry Mguni, Taher Jafferjee, Jianhong Wang, Nicolas Perez Nieves, Oliver Slumbers, Feifei Tong, Yang Li, Jiangcheng Zhu, Yaodong Yang, Jun Wang:
LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning. CoRR abs/2112.02618 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02845
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02845
Linghui Meng, Muning Wen, Yaodong Yang, Chenyang Le, Xiyun Li, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Bo Xu:
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks. CoRR abs/2112.02845 (2021)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-15400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-15400
Bo Liu, Xidong Feng, Haifeng Zhang, Jun Wang, Yaodong Yang:
Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning. CoRR abs/2112.15400 (2021)
[i15]
- view
  - electronic edition @ weizmann.ac.il (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/eccc/DengLMWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eccc/DengLMWY21
Xiaotie Deng, Yuhao Li, David Mguni, Jun Wang, Yaodong Yang:
On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games. Electron. Colloquium Comput. Complex. TR21 (2021)
2020
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/eor/KimYLMSJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eor/KimYLMSJ20
Alisa Kim, Yaodong Yang, Stefan Lessmann, Tiejun Ma, Ming-Chien Sung, Johnnie E. V. Johnson:
Can deep learning predict risky retail investors? A case study in financial risk behavior forecasting. Eur. J. Oper. Res. 283(1): 217-234 (2020)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jossac/ZhangWLY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jossac/ZhangWLY20
Qiang Zhang, Chao Wang, Shancun Liu, Yaodong Yang:
Order Execution Probability and Order Queue in Limit Order Markets. J. Syst. Sci. Complex. 33(5): 1545-1557 (2020)
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Zhang0HLY0W20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Zhang0HLY0W20
Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang, Jun Wang:
Bi-Level Actor-Critic for Multi-Agent Coordination. AAAI 2020: 7325-7332
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/YangTSB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/YangTSB20
Yaodong Yang, Rasul Tutunov, Phu Sakulwongtana, Haitham Bou-Ammar:
αα-Rank: Practically Scaling α-Rank through Stochastic Optimisation. AAMAS 2020: 1575-1583
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/PengJLYLWZXYLLX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/PengJLYLWZXYLLX20
Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai:
Sequential Advertising Agent with Interpretable User Hidden Intents. AAMAS 2020: 1966-1968
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/PengJLYLWZXXYLL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/PengJLYLWZXXYLL20
Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Haiyang Xu, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai:
Learning to Infer User Hidden States for Online Sequential Advertising. CIKM 2020: 2677-2684
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/corl/ZhouLVYRM0AFCHW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/corl/ZhouLVYRM0AFCHW20
Ming Zhou, Jun Luo, Julian Villela, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Zhengbang Zhu, Yihan Ni, Nhat M. Nguyen, Mohamed Elsayed, Haitham Ammar, Alexander I. Cowen-Rivers, Sanjeevan Ahilan, Zheng Tian, Daniel Palenicek, Kasra Rezaee, Peyman Yadmellat, Kun Shao, Dong Chen, Baokuan Zhang, Hongbo Zhang, Jianye Hao, Wulong Liu, Jun Wang:
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving. CoRL 2020: 264-285
[c10]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/YangW0CSM020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangW0CSM020
Yaodong Yang, Ying Wen, Jun Wang, Liheng Chen, Kun Shao, David Mguni, Weinan Zhang:
Multi-Agent Determinantal Q-Learning. ICML 2020: 10757-10766
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/WenYW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WenYW20
Ying Wen, Yaodong Yang, Jun Wang:
Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning. IJCAI 2020: 414-421
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LuoZY020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuoZY020
Rui Luo, Qiang Zhang, Yaodong Yang, Jun Wang:
Replica-Exchange Nosé-Hoover Dynamics for Bayesian Learning on Large Datasets. NeurIPS 2020
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-01482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-01482
Yaodong Yang, Ying Wen, Liheng Chen, Jun Wang, Kun Shao, David Mguni, Weinan Zhang:
Multi-Agent Determinantal Q-Learning. CoRR abs/2006.01482 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-01453
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-01453
Zhaoqing Peng, Junqi Jin, Lan Luo, Yaodong Yang, Rui Luo, Jun Wang, Weinan Zhang, Haiyang Xu, Miao Xu, Chuan Yu, Tiejian Luo, Han Li, Jian Xu, Kun Gai:
Learning to Infer User Hidden States for Online Sequential Advertising. CoRR abs/2009.01453 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-09776
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-09776
Ming Zhou, Jun Luo, Julian Villela, Yaodong Yang, David Rusu, Jiayu Miao, Weinan Zhang, Montgomery Alban, Iman Fadakar, Zheng Chen, Aurora Chongxi Huang, Ying Wen, Kimia Hassanzadeh, Daniel Graves, Dong Chen, Zhengbang Zhu, Nhat M. Nguyen, Mohamed Elsayed, Kun Shao, Sanjeevan Ahilan, Baokuan Zhang, Jiannan Wu, Zhengang Fu, Kasra Rezaee, Peyman Yadmellat, Mohsen Rohani, Nicolas Perez Nieves, Yihan Ni, Seyedershad Banijamali, Alexander I. Cowen-Rivers, Zheng Tian, Daniel Palenicek, Haitham Bou-Ammar, Hongbo Zhang, Wulong Liu, Jianye Hao, Jun Wang:
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving. CoRR abs/2010.09776 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-00583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-00583
Yaodong Yang, Jun Wang:
An Overview of Multi-Agent Reinforcement Learning from Game Theoretical Perspective. CoRR abs/2011.00583 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/dai2/ZhouCWYS0ZW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dai2/ZhouCWYS0ZW19
Ming Zhou, Yong Chen, Ying Wen, Yaodong Yang, Yufeng Su, Weinan Zhang, Dell Zhang, Jun Wang:
Factorized Q-learning for large-scale multi-agent systems. DAI 2019: 7:1-7:7
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangLL19
Yaodong Yang, Rui Luo, Yuanyuan Liu:
Adversarial Variational Bayes Methods for Tweedie Compound Poisson Mixed Models. ICASSP 2019: 3377-3381
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/WenYLWP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WenYLWP19
Ying Wen, Yaodong Yang, Rui Luo, Jun Wang, Wei Pan:
Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning. ICLR (Poster) 2019
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/www/LiQJYWWWY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/LiQJYWWWY19
Minne Li, Zhiwei (Tony) Qin, Yan Jiao, Yaodong Yang, Jun Wang, Chenxi Wang, Guobin Wu, Jieping Ye:
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning. WWW 2019: 983-994
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-09207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-09207
Ying Wen, Yaodong Yang, Rui Luo, Jun Wang, Wei Pan:
Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning. CoRR abs/1901.09207 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-09216
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-09216
Ying Wen, Yaodong Yang, Rui Lu, Jun Wang:
Multi-Agent Generalized Recursive Reasoning. CoRR abs/1901.09216 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-11454
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-11454
Minne Li, Zhiwei (Tony) Qin, Yan Jiao, Yaodong Yang, Zhichen Gong, Jun Wang, Chenxi Wang, Guobin Wu, Jieping Ye:
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning. CoRR abs/1901.11454 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-12569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-12569
Rui Luo, Qiang Zhang, Yaodong Yang, Jun Wang:
Replica-exchange Nosé-Hoover dynamics for Bayesian learning on large datasets. CoRR abs/1905.12569 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-03510
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-03510
Haifeng Zhang, Weizhe Chen, Zeren Huang, Minne Li, Yaodong Yang, Weinan Zhang, Jun Wang:
Bi-level Actor-Critic for Multi-agent Coordination. CoRR abs/1909.03510 (2019)
2018
[c3]
- view
  - electronic edition @ acm.org
  - no references & citations available
- export record
  dblp key:
  - conf/atal/YangYBWZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/YangYBWZW18
Yaodong Yang, Lantao Yu, Yiwei Bai, Ying Wen, Weinan Zhang, Jun Wang:
A Study of AI Population Dynamics with Million-agent Reinforcement Learning. AAMAS 2018: 2133-2135
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/YangLLZZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangLLZZW18
Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, Jun Wang:
Mean Field Multi-Agent Reinforcement Learning. ICML 2018: 5567-5576
[c1]
- view
- export record
  dblp key:
  - conf/nips/LuoWY0Z18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuoWY0Z18
Rui Luo, Jianhong Wang, Yaodong Yang, Jun Wang, Zhanxing Zhu:
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning. NeurIPS 2018: 10696-10705
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05438
Yaodong Yang, Rui Luo, Minne Li, Ming Zhou, Weinan Zhang, Jun Wang:
Mean Field Multi-Agent Reinforcement Learning. CoRR abs/1802.05438 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-03738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-03738
Yong Chen, Ming Zhou, Ying Wen, Yaodong Yang, Yufeng Su, Weinan Zhang, Dell Zhang, Jun Wang, Han Liu:
Factorized Q-Learning for Large-Scale Multi-Agent Systems. CoRR abs/1809.03738 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-03711
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-03711
Qiang Zhang, Rui Luo, Yaodong Yang, Yuanyuan Liu:
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series. CoRR abs/1811.03711 (2018)
2017
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/PengYWYTLW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PengYWYTLW17
Peng Peng, Quan Yuan, Ying Wen, Yaodong Yang, Zhenkun Tang, Haitao Long, Jun Wang:
Multiagent Bidirectionally-Coordinated Nets for Learning to Play StarCraft Combat Games. CoRR abs/1703.10069 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1709-04511
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-04511
Yaodong Yang, Lantao Yu, Yiwei Bai, Jun Wang, Weinan Zhang, Ying Wen, Yong Yu:
An Empirical Study of AI Population Dynamics with Million-agent Reinforcement Learning. CoRR abs/1709.04511 (2017)

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.