default search action
Bo An 0001
Person information
- affiliation: Nanyang Technological University, Singapore
Other persons with the same name
- Bo An
- Bo An 0002 — Chinese Academy of Sciences, Institute of Software, Beijing, China
- Bo An 0003 — Peking University, Beijing, China
- Bo An 0004 — North China Electric Power University, Beijing, China
- Bo An 0005 — Universitat Politécnica de Catalunya, Barcelona, Spain
- Bo An 0006 — Beihang University, Beijing, China
- Bo An 0007 — Heilongjiang Institute of Technology, Harbin, China
- Bo An 0008 — Southeast University, School of Computer Science and Engineering, China
- Bo An 0009 — Southeast University, National Mobile Communications Research Laboratory, Nanjing, China
- Bo An 0010 — Chinese Academy of Social Sciences, Institute of Ethnology and Anthropology, CASS Research Center for Ethnic Minority languages, Beijing, China
- Bo An 0011 — Lanzhou Jiaotong University, School of Automation and Electrical Engineering, China
- Bo An 0012 — Hebei Chemical and Pharmaceutical Vocational and Technical College, Shijiazhuang, China
- Bo An 0013 — Shijiazhuang University of Applied Technology, Department of Economy and Trade, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j68]Wanyuan Wang, Qian Che, Yifeng Zhou, Weiwei Wu, Bo An, Yichuan Jiang:
Offline policy reuse-guided anytime online collective multiagent planning and its application to mobility-on-demand systems. Auton. Agents Multi Agent Syst. 38(1): 19 (2024) - [j67]Ruhao Jiang, Yanchen Deng, Yingying Chen, He Luo, Bo An:
Deep reinforcement learning for multi-objective game strategy selection. Comput. Oper. Res. 168: 106683 (2024) - [j66]Weihao Tan, Wentao Zhang, Shanqi Liu, Longtao Zheng, Xinrun Wang, Bo An:
TWOSOME: An Efficient Online Framework to Align LLMs with Embodied Environments via Reinforcement Learning. Int. J. Artif. Intell. Robotics Res. 1(2): 2450004:1-2450004:21 (2024) - [j65]Senlin Shu, Haobo Wang, Zhuowei Wang, Bo Han, Tao Xiang, Bo An, Lei Feng:
Online binary classification from similar and dissimilar data. Mach. Learn. 113(6): 3463-3484 (2024) - [j64]Shaokang Dong, Chao Li, Shangdong Yang, Bo An, Wenbin Li, Yang Gao:
Egoism, utilitarianism and egalitarianism in multi-agent reinforcement learning. Neural Networks 178: 106544 (2024) - [j63]Jiaqi Lv, Biao Liu, Lei Feng, Ning Xu, Miao Xu, Bo An, Gang Niu, Xin Geng, Masashi Sugiyama:
On the Robustness of Average Losses for Partial-Label Learning. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2569-2583 (2024) - [j62]Shifei Ding, Wei Du, Ling Ding, Jian Zhang, Lili Guo, Bo An:
Robust Multi-Agent Communication With Graph Information Bottleneck Optimization. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 3096-3107 (2024) - [c218]Molei Qin, Shuo Sun, Wentao Zhang, Haochong Xia, Xinrun Wang, Bo An:
EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading. AAAI 2024: 14669-14676 - [c217]Haochong Xia, Shuo Sun, Xinrun Wang, Bo An:
Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context. AAAI 2024: 15996-16004 - [c216]Pengdeng Li, Runsheng Yu, Xinrun Wang, Bo An:
Transition-Informed Reinforcement Learning for Large-Scale Stackelberg Mean-Field Games. AAAI 2024: 17469-17476 - [c215]Shuqi Liu, Yuzhou Cao, Qiaozhen Zhang, Lei Feng, Bo An:
Mitigating Underfitting in Learning to Defer with Consistent Losses. AISTATS 2024: 4816-4824 - [c214]Yuzhou Cao, Lei Feng, Bo An:
Consistent Hierarchical Classification with A Generalized Metric. AISTATS 2024: 4825-4833 - [c213]Pengdeng Li, Shuxin Li, Xinrun Wang, Jakub Cerný, Youzhi Zhang, Stephen McAleer, Hau Chan, Bo An:
Grasper: A Generalist Pursuer for Pursuit-Evasion Problems. AAMAS 2024: 1147-1155 - [c212]Xinrun Wang, Chang Yang, Shuxin Li, Pengdeng Li, Xiao Huang, Hau Chan, Bo An:
Reinforcement Nash Equilibrium Solver. AAMAS 2024: 2552-2554 - [c211]Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan:
vMFER: von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement of Actor-Critic Algorithms. AAMAS 2024: 2621-2623 - [c210]Ruyi An, Yewen Li, Xu He, Pengjie Gu, Mengchen Zhao, Dong Li, Jianye Hao, Chaojie Wang, Bo An, Mingyuan Zhou:
Improving Unsupervised Hierarchical Representation With Reinforcement Learning. CVPR 2024: 22946-22956 - [c209]Shanqi Liu, Dong Xing, Pengjie Gu, Xinrun Wang, Bo An, Yong Liu:
Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution. ICLR 2024 - [c208]Safa Messaoud, Billel Mokeddem, Zhenghai Xue, Linsey Pang, Bo An, Haipeng Chen, Sanjay Chawla:
S2AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic. ICLR 2024 - [c207]Weihao Tan, Wentao Zhang, Shanqi Liu, Longtao Zheng, Xinrun Wang, Bo An:
True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning. ICLR 2024 - [c206]Zixi Wei, Senlin Shu, Yuzhou Cao, Hongxin Wei, Bo An, Lei Feng:
Consistent Multi-Class Classification from Multiple Unlabeled Datasets. ICLR 2024 - [c205]Longtao Zheng, Rundong Wang, Xinrun Wang, Bo An:
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control. ICLR 2024 - [c204]Shengjie Zhou, Lue Tao, Yuzhou Cao, Tao Xiang, Bo An, Lei Feng:
On the Vulnerability of Adversarially Trained Models Against Two-faced Attacks. ICLR 2024 - [c203]Youzhi Zhang, Bo An, Daniel Dajun Zeng:
DAG-Based Column Generation for Adversarial Team Games. ICML 2024 - [c202]Lang Feng, Pengjie Gu, Bo An, Gang Pan:
Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree. ICML 2024 - [c201]Zhenxing Ge, Zheng Xu, Tianyu Ding, Linjian Meng, Bo An, Wenbin Li, Yang Gao:
Safe and Robust Subgame Exploitation in Imperfect Information Games. ICML 2024 - [c200]Pengdeng Li, Shuxin Li, Chang Yang, Xinrun Wang, Shuyue Hu, Xiao Huang, Hau Chan, Bo An:
Configurable Mirror Descent: Towards a Unification of Decision Making. ICML 2024 - [c199]Zitao Song, Chao Yang, Chaojie Wang, Bo An, Shuang Li:
Latent Logic Tree Extraction for Event Sequence Explanation from LLMs. ICML 2024 - [c198]Pengdeng Li, Shuxin Li, Chang Yang, Xinrun Wang, Xiao Huang, Hau Chan, Bo An:
Self-adaptive PSRO: Towards an Automatic Population-based Game Solver. IJCAI 2024: 139-147 - [c197]Xinrun Wang, Chang Yang, Shuxin Li, Pengdeng Li, Xiao Huang, Hau Chan, Bo An:
Reinforcement Nash Equilibrium Solver. IJCAI 2024: 265-273 - [c196]Pengjie Gu, Mengchen Zhao, Xu He, Yi Cai, Bo An:
PoRank: A Practical Framework for Learning to Rank Policies. IJCAI 2024: 4044-4052 - [c195]Wanqi Xue, Bo An, Shuicheng Yan, Zhongwen Xu:
Reinforcement Learning from Diverse Human Preferences. IJCAI 2024: 5298-5306 - [c194]Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan:
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement. IJCAI 2024: 5725-5733 - [c193]Hui Niu, Siyuan Li, Jiahao Zheng, Zhouchi Lin, Bo An, Jian Li, Jian Guo:
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making. IJCAI 2024: 5999-6007 - [c192]Ying Zheng, Lei Jiao, Yuedong Xu, Bo An, Xin Wang, Zongpeng Li:
Scheduling Generative-AI Job DAGs with Model Serving in Data Centers. IWQoS 2024: 1-6 - [c191]Wentao Zhang, Lingxuan Zhao, Haochong Xia, Shuo Sun, Jiaze Sun, Molei Qin, Xinyi Li, Yuqing Zhao, Yilei Zhao, Xinyu Cai, Longtao Zheng, Xinrun Wang, Bo An:
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist. KDD 2024: 4314-4325 - [c190]Chuqiao Zong, Chaojie Wang, Molei Qin, Lei Feng, Xinrun Wang, Bo An:
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading. KDD 2024: 4712-4721 - [c189]Wentao Zhang, Yilei Zhao, Shuo Sun, Jie Ying, Yonggang Xie, Zitao Song, Xinrun Wang, Bo An:
Reinforcement Learning with Maskable Stock Representation for Portfolio Management in Customizable Stock Pools. WWW 2024: 187-198 - [i93]Chaojie Wang, Yishi Xu, Zhong Peng, Chenxi Zhang, Bo Chen, Xinrun Wang, Lei Feng, Bo An:
keqing: knowledge-based question answering is a nature chain-of-thought mentor of LLM. CoRR abs/2401.00426 (2024) - [i92]Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Ievgen Redko, Jianfeng Zhang, Bo An:
Characterising Gradients for Unsupervised Accuracy Estimation under Distribution Shift. CoRR abs/2401.08909 (2024) - [i91]Qi Wei, Lei Feng, Haobo Wang, Bo An:
Debiased Sample Selection for Combating Noisy Labels. CoRR abs/2401.13360 (2024) - [i90]Weihao Tan, Wentao Zhang, Shanqi Liu, Longtao Zheng, Xinrun Wang, Bo An:
True Knowledge Comes from Practice: Aligning LLMs with Embodied Environments via Reinforcement Learning. CoRR abs/2401.14151 (2024) - [i89]Wentao Zhang, Lingxuan Zhao, Haochong Xia, Shuo Sun, Jiaze Sun, Molei Qin, Xinyi Li, Yuqing Zhao, Yilei Zhao, Xinyu Cai, Longtao Zheng, Xinrun Wang, Bo An:
A Multimodal Foundation Agent for Financial Trading: Tool-Augmented, Diversified, and Generalist. CoRR abs/2402.18485 (2024) - [i88]Naming Liu, Mingzhi Wang, Youzhi Zhang, Yaodong Yang, Bo An, Ying Wen:
Leveraging Team Correlation for Approximating Equilibrium in Two-Team Zero-Sum Games. CoRR abs/2403.00255 (2024) - [i87]Weihao Tan, Ziluo Ding, Wentao Zhang, Boyu Li, Bohan Zhou, Junpeng Yue, Haochong Xia, Jiechuan Jiang, Longtao Zheng, Xinrun Xu, Yifei Bi, Pengjie Gu, Xinrun Wang, Börje F. Karlsson, Bo An, Zongqing Lu:
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study. CoRR abs/2403.03186 (2024) - [i86]Longtao Zheng, Zhiyuan Huang, Zhenghai Xue, Xinrun Wang, Bo An, Shuicheng Yan:
AgentStudio: A Toolkit for Building General Virtual Agents. CoRR abs/2403.17918 (2024) - [i85]Pengdeng Li, Shuxin Li, Chang Yang, Xinrun Wang, Xiao Huang, Hau Chan, Bo An:
Self-adaptive PSRO: Towards an Automatic Population-based Game Solver. CoRR abs/2404.11144 (2024) - [i84]Pengdeng Li, Shuxin Li, Xinrun Wang, Jakub Cerný, Youzhi Zhang, Stephen McAleer, Hau Chan, Bo An:
Grasper: A Generalist Pursuer for Pursuit-Evasion Problems. CoRR abs/2404.12626 (2024) - [i83]Safa Messaoud, Billel Mokeddem, Zhenghai Xue, Linsey Pang, Bo An, Haipeng Chen, Sanjay Chawla:
S2AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic. CoRR abs/2405.00987 (2024) - [i82]Xinrun Wang, Chang Yang, Shuxin Li, Pengdeng Li, Xiao Huang, Hau Chan, Bo An:
Reinforcement Nash Equilibrium Solver. CoRR abs/2405.03518 (2024) - [i81]Yiwen Zhu, Jinyi Liu, Wenya Wei, Qianyi Fu, Yujing Hu, Zhou Fang, Bo An, Jianye Hao, Tangjie Lv, Changjie Fan:
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement. CoRR abs/2405.08638 (2024) - [i80]Pengdeng Li, Shuxin Li, Chang Yang, Xinrun Wang, Shuyue Hu, Xiao Huang, Hau Chan, Bo An:
Configurable Mirror Descent: Towards a Unification of Decision Making. CoRR abs/2405.11746 (2024) - [i79]Lang Feng, Pengjie Gu, Bo An, Gang Pan:
Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree. CoRR abs/2405.17879 (2024) - [i78]Renchunzi Xie, Ambroise Odonnat, Vasilii Feofanov, Weijian Deng, Jianfeng Zhang, Bo An:
MANO: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts. CoRR abs/2405.18979 (2024) - [i77]Zitao Song, Chao Yang, Chaojie Wang, Bo An, Shuang Li:
Latent Logic Tree Extraction for Event Sequence Explanation from LLMs. CoRR abs/2406.01124 (2024) - [i76]Chaojie Wang, Yanchen Deng, Zhiyi Lv, Zeng Liang, Jujie He, Shuicheng Yan, Bo An:
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning. CoRR abs/2406.14283 (2024) - [i75]Chuqiao Zong, Chaojie Wang, Molei Qin, Lei Feng, Xinrun Wang, Bo An:
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency Trading. CoRR abs/2406.14537 (2024) - [i74]Shuxin Li, Chang Yang, Youzhi Zhang, Pengdeng Li, Xinrun Wang, Xiao Huang, Hau Chan, Bo An:
In-Context Exploiter for Extensive-Form Games. CoRR abs/2408.05575 (2024) - [i73]Yewen Li, Chaojie Wang, Xiaobo Xia, Xu He, Ruyi An, Dong Li, Tongliang Liu, Bo An, Xinrun Wang:
Resultant: Incremental Effectiveness on Likelihood for Unsupervised Out-of-Distribution Detection. CoRR abs/2409.03801 (2024) - [i72]Hezhe Qiao, Hanghang Tong, Bo An, Irwin King, Charu C. Aggarwal, Guansong Pang:
Deep Graph Anomaly Detection: A Survey and New Perspectives. CoRR abs/2409.09957 (2024) - [i71]Naming Liu, Mingzhi Wang, Xihuai Wang, Weinan Zhang, Yaodong Yang, Youzhi Zhang, Bo An, Ying Wen:
Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games. CoRR abs/2410.01575 (2024) - [i70]Aye Phyu Phyu Aung, Xinrun Wang, Ruiyu Wang, Hau Chan, Bo An, Xiaoli Li, J. Senthilnath:
Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models. CoRR abs/2410.04764 (2024) - 2023
- [j61]Shijie Han, Siyuan Li, Bo An, Wei Zhao, Peng Liu:
Classifying ambiguous identities in hidden-role Stochastic games with multi-agent reinforcement learning. Auton. Agents Multi Agent Syst. 37(2): 35 (2023) - [j60]Xiao Liu, Shuyang Liu, Bo An, Yang Gao, Shangdong Yang, Wenbin Li:
Effective Interpretable Policy Distillation via Critical Experience Point Identification. IEEE Intell. Syst. 38(5): 28-36 (2023) - [j59]Shifei Ding, Wei Du, Ling Ding, Lili Guo, Jian Zhang, Bo An:
Multi-agent dueling Q-learning with mean field and value decomposition. Pattern Recognit. 139: 109436 (2023) - [j58]Shuo Sun, Rundong Wang, Bo An:
Reinforcement Learning for Quantitative Trading. ACM Trans. Intell. Syst. Technol. 14(3): 44:1-44:29 (2023) - [j57]Lei Feng, Senlin Shu, Yuzhou Cao, Lue Tao, Hongxin Wei, Tao Xiang, Bo An, Gang Niu:
Multiple-Instance Learning From Unlabeled Bags With Pairwise Similarity. IEEE Trans. Knowl. Data Eng. 35(11): 11599-11609 (2023) - [j56]Shuo Sun, Molei Qin, Xinrun Wang, Bo An:
PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets. Trans. Mach. Learn. Res. 2023 (2023) - [j55]Hongxin Wei, Renchunzi Xie, Lei Feng, Bo Han, Bo An:
Deep Learning From Multiple Noisy Annotators as A Union. IEEE Trans. Neural Networks Learn. Syst. 34(12): 10552-10562 (2023) - [c188]Linjian Meng, Zhenxing Ge, Pinzhuo Tian, Bo An, Yang Gao:
An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Games. AAAI 2023: 5823-5831 - [c187]Xin Cheng, Deng-Bao Wang, Lei Feng, Min-Ling Zhang, Bo An:
Partial-Label Regression. AAAI 2023: 7140-7147 - [c186]Shuxin Li, Xinrun Wang, Youzhi Zhang, Wanqi Xue, Jakub Cerný, Bo An:
Solving Large-Scale Pursuit-Evasion Games Using Pre-trained Strategies. AAAI 2023: 11586-11594 - [c185]Shuqi Liu, Yuzhou Cao, Qiaozhen Zhang, Lei Feng, Bo An:
Consistent Complementary-Label Learning via Order-Preserving Losses. AISTATS 2023: 8734-8748 - [c184]Qian Che, Wanyuan Wang, Fengchen Wang, Tianchi Qiao, Xiang Liu, Jiuchuan Jiang, Bo An, Yichuan Jiang:
Structural Credit Assignment-Guided Coordinated MCTS: An Efficient and Scalable Method for Online Multiagent Planning. AAMAS 2023: 543-551 - [c183]Wei Qiu, Weixun Wang, Rundong Wang, Bo An, Yujing Hu, Svetlana Obraztsova, Zinovi Rabinovich, Jianye Hao, Yingfeng Chen, Changjie Fan:
Off-Beat Multi-Agent Reinforcement Learning. AAMAS 2023: 2424-2426 - [c182]Haipeng Chen, Bryan Wilder, Wei Qiu, Bo An, Eric Rice, Milind Tambe:
A Learning Approach to Complex Contagion Influence Maximization. AAMAS 2023: 2622-2624 - [c181]Youzhi Zhang, Bo An, V. S. Subrahmanian:
Finding Optimal Nash Equilibria in Multiplayer Games via Correlation Plans. AAMAS 2023: 2712-2714 - [c180]Wei Qiu, Xiao Ma, Bo An, Svetlana Obraztsova, Shuicheng Yan, Zhongwen Xu:
RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning. ICLR 2023 - [c179]Pengdeng Li, Xinrun Wang, Shuxin Li, Hau Chan, Bo An:
Population-size-Aware Policy Optimization for Mean-Field Games. ICLR 2023 - [c178]Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor. ICLR 2023 - [c177]Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, Lei Feng:
Weakly Supervised Regression with Interval Targets. ICML 2023: 5428-5448 - [c176]Hongxin Wei, Huiping Zhuang, Renchunzi Xie, Lei Feng, Gang Niu, Bo An, Yixuan Li:
Mitigating Memorization of Noisy Labels by Clipping the Model Prediction. ICML 2023: 36868-36886 - [c175]Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan:
Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification. ICML 2023: 38272-38285 - [c174]Hao Cheng, Shufeng Kong, Yanchen Deng, Caihua Liu, Xiaohu Wu, Bo An, Chongjun Wang:
Exploring Leximin Principle for Fair Core-Selecting Combinatorial Auctions: Payment Rule Design and Implementation. IJCAI 2023: 2581-2588 - [c173]Haipeng Chen, Bryan Wilder, Wei Qiu, Bo An, Eric Rice, Milind Tambe:
Complex Contagion Influence Maximization: A Reinforcement Learning Approach. IJCAI 2023: 5531-5540 - [c172]Shuo Sun, Xinrun Wang, Wanqi Xue, Xiaoxuan Lou, Bo An:
Mastering Stock Markets with Efficient Mixture of Diversified Trading Experts. KDD 2023: 2109-2119 - [c171]Wanqi Xue, Qingpeng Cai, Zhenghai Xue, Shuo Sun, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement. KDD 2023: 2874-2884 - [c170]Youzhi Zhang, Bo An, Venkatramanan Siva Subrahmanian:
Computing Optimal Nash Equilibria in Multiplayer Games. NeurIPS 2023 - [c169]Yuzhou Cao, Hussein Mozannar, Lei Feng, Hongxin Wei, Bo An:
In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer. NeurIPS 2023 - [c168]Xin Cheng, Yuzhou Cao, Haobo Wang, Hongxin Wei, Bo An, Lei Feng:
Regression with Cost-based Rejection. NeurIPS 2023 - [c167]Zhibin Duan, Zhiyi Lv, Chaojie Wang, Bo Chen, Bo An, Mingyuan Zhou:
Few-shot Generation via Recalling Brain-Inspired Episodic-Semantic Memory. NeurIPS 2023 - [c166]Pengjie Gu, Xinyu Cai, Dong Xing, Xinrun Wang, Mengchen Zhao, Bo An:
Offline RL with Discrete Proxy Representations for Generalizability in POMDPs. NeurIPS 2023 - [c165]Shuo Sun, Molei Qin, Wentao Zhang, Haochong Xia, Chuqiao Zong, Jie Ying, Yonggang Xie, Lingxuan Zhao, Xinrun Wang, Bo An:
TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning. NeurIPS 2023 - [c164]Renchunzi Xie, Hongxin Wei, Lei Feng, Yuzhou Cao, Bo An:
On the Importance of Feature Separability in Predicting Out-Of-Distribution Error. NeurIPS 2023 - [c163]Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
State Regularized Policy Optimization on Data with Dynamics Shift. NeurIPS 2023 - [e7]Noa Agmon, Bo An, Alessandro Ricci, William Yeoh:
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2023, London, United Kingdom, 29 May 2023 - 2 June 2023. ACM 2023, ISBN 978-1-4503-9432-1 [contents] - [i69]Wanqi Xue, Bo An, Shuicheng Yan, Zhongwen Xu:
Reinforcement Learning from Diverse Human Preferences. CoRR abs/2301.11774 (2023) - [i68]Shuo Sun, Molei Qin, Xinrun Wang, Bo An:
PRUDEX-Compass: Towards Systematic Evaluation of Reinforcement Learning in Financial Markets. CoRR abs/2302.00586 (2023) - [i67]Pengdeng Li, Xinrun Wang, Shuxin Li, Hau Chan, Bo An:
Population-size-Aware Policy Optimization for Mean-Field Games. CoRR abs/2302.03364 (2023) - [i66]Rundong Wang, Longtao Zheng, Wei Qiu, Bowei He, Bo An, Zinovi Rabinovich, Yujing Hu, Yingfeng Chen, Tangjie Lv, Changjie Fan:
Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning. CoRR abs/2302.03429 (2023) - [i65]Renchunzi Xie, Hongxin Wei, Yuzhou Cao, Lei Feng, Bo An:
On the Importance of Feature Separability in Predicting Out-Of-Distribution Error. CoRR abs/2303.15488 (2023) - [i64]Zhenghai Xue, Qingpeng Cai, Shuchang Liu, Dong Zheng, Peng Jiang, Kun Gai, Bo An:
State Regularized Policy Optimization on Data with Dynamics Shift. CoRR abs/2306.03552 (2023) - [i63]Longtao Zheng, Rundong Wang, Bo An:
Synapse: Leveraging Few-Shot Exemplars for Human-Level Computer Control. CoRR abs/2306.07863 (2023) - [i62]Xin Cheng, Deng-Bao Wang, Lei Feng, Min-Ling Zhang, Bo An:
Partial-Label Regression. CoRR abs/2306.08968 (2023) - [i61]Xin Cheng, Yuzhou Cao, Ximing Li, Bo An, Lei Feng:
Weakly Supervised Regression with Interval Targets. CoRR abs/2306.10458 (2023) - [i60]Dong Xing, Pengjie Gu, Qian Zheng, Xinrun Wang, Shanqi Liu, Longtao Zheng, Bo An, Gang Pan:
Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification. CoRR abs/2306.10944 (2023) - [i59]Hui Niu, Siyuan Li, Jiahao Zheng, Zhouchi Lin, Jian Li, Jian Guo, Bo An:
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making. CoRR abs/2308.08918 (2023) - [i58]Linjian Meng, Zhenxing Ge, Wenbin Li, Bo An, Yang Gao:
Efficient Last-iterate Convergence Algorithms in Solving Games. CoRR abs/2308.11256 (2023) - [i57]Haochong Xia, Shuo Sun, Xinrun Wang, Bo An:
Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context. CoRR abs/2309.07708 (2023) - [i56]Zhenghai Xue, Qingpeng Cai, Tianyou Zuo, Bin Yang, Lantao Hu, Peng Jiang, Kun Gai, Bo An:
AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement. CoRR abs/2310.03984 (2023) - [i55]