default search action
Jiafei Lyu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j6]Aicheng Gong, Kai Yang, Jiafei Lyu, Xiu Li:
A two-stage reinforcement learning-based approach for multi-entity task allocation. Eng. Appl. Artif. Intell. 136: 108906 (2024) - [j5]Jiafei Lyu, Le Wan, Xiu Li, Zongqing Lu:
Off-policy RL algorithms can be sample-efficient for continuous control via sample multiple reuse. Inf. Sci. 666: 120371 (2024) - [j4]Jiafei Lyu, Le Wan, Xiu Li, Zongqing Lu:
Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence. J. Artif. Intell. Res. 81: 1-42 (2024) - [j3]Mengbei Yan, Jiafei Lyu, Xiu Li:
Enhancing visual reinforcement learning with State-Action Representation. Knowl. Based Syst. 304: 112487 (2024) - [c15]Lu Li, Jiafei Lyu, Guozheng Ma, Zilin Wang, Zhenjie Yang, Xiu Li, Zhiheng Li:
Normalization Enhances Generalization in Visual Reinforcement Learning. AAMAS 2024: 1137-1146 - [c14]Jiafei Lyu, Le Wan, Xiu Li, Zongqing Lu:
Towards Understanding How to Reduce Generalization Gap in Visual Reinforcement Learning. AAMAS 2024: 2369-2371 - [c13]Kai Yang, Jian Tao, Jiafei Lyu, Chunjiang Ge, Jiaxin Chen, Weihan Shen, Xiaolong Zhu, Xiu Li:
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model. CVPR 2024: 8941-8951 - [c12]Zhongjian Qiao, Jiafei Lyu, Xiu Li:
Mind the Model, Not the Agent: The Primacy Bias in Model-Based RL. ECAI 2024: 1824-1831 - [c11]Shengjie Sun, Jiafei Lyu, Lu Li, Jiazhe Guo, Mengbei Yan, Runze Liu, Xiu Li:
Enhancing Visual Generalization in Reinforcement Learning with Cycling Augmentation. ICANN (4) 2024: 397-411 - [c10]Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Li Xiu, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. ICLR 2024 - [c9]Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation. ICML 2024 - [c8]Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu, Xiu Li:
Cross-Domain Policy Adaptation by Capturing Representation Mismatch. ICML 2024 - [c7]Kai Yang, Jian Tao, Jiafei Lyu, Xiu Li:
Exploration and Anti-Exploration with Distributional Random Network Distillation. ICML 2024 - [i19]Kai Yang, Jian Tao, Jiafei Lyu, Xiu Li:
Exploration and Anti-Exploration with Distributional Random Network Distillation. CoRR abs/2401.09750 (2024) - [i18]Jiafei Lyu, Le Wan, Xiu Li, Zongqing Lu:
Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence. CoRR abs/2402.02701 (2024) - [i17]Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. CoRR abs/2402.03807 (2024) - [i16]Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu, Xiu Li:
Cross-Domain Policy Adaptation by Capturing Representation Mismatch. CoRR abs/2405.15369 (2024) - [i15]Zeyuan Liu, Ziyu Huan, Xiyao Wang, Jiafei Lyu, Jian Tao, Xiu Li, Furong Huang, Huazhe Xu:
World Models with Hints of Large Language Models for Goal Achieving. CoRR abs/2406.07381 (2024) - [i14]Aicheng Gong, Kai Yang, Jiafei Lyu, Xiu Li:
A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation. CoRR abs/2407.00496 (2024) - [i13]Zhongjian Qiao, Jiafei Lyu, Kechen Jiao, Qi Liu, Xiu Li:
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning. CoRR abs/2408.12970 (2024) - 2023
- [j2]Jiafei Lyu, Yu Yang, Jiangpeng Yan, Xiu Li:
Value activation for bias alleviation: Generalized-activated deep double deterministic policy gradients. Neurocomputing 518: 70-81 (2023) - [c6]Junjie Zhang, Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Jun Yang, Le Wan, Xiu Li:
Uncertainty-Driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning. ECAI 2023: 3018-3025 - [c5]Jiafei Lyu, Aicheng Gong, Le Wan, Zongqing Lu, Xiu Li:
State Advantage Weighting for Offline RL. Tiny Papers @ ICLR 2023 - [i12]Junjie Zhang, Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Jun Yang, Le Wan, Xiu Li:
Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning. CoRR abs/2304.04660 (2023) - [i11]Jiafei Lyu, Le Wan, Zongqing Lu, Xiu Li:
Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse. CoRR abs/2305.18443 (2023) - [i10]Lu Li, Jiafei Lyu, Guozheng Ma, Zilin Wang, Zhenjie Yang, Xiu Li, Zhiheng Li:
Normalization Enhances Generalization in Visual Reinforcement Learning. CoRR abs/2306.00656 (2023) - [i9]Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
Zero-shot Preference Learning for Offline RL via Optimal Transport. CoRR abs/2306.03615 (2023) - [i8]Zhongjian Qiao, Jiafei Lyu, Xiu Li:
The primacy bias in Model-based RL. CoRR abs/2310.15017 (2023) - [i7]Kai Yang, Jian Tao, Jiafei Lyu, Chunjiang Ge, Jiaxin Chen, Qimai Li, Weihan Shen, Xiaolong Zhu, Xiu Li:
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model. CoRR abs/2311.13231 (2023) - 2022
- [c4]Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Xiu Li:
Efficient Continuous Control with Double Actors and Regularized Critics. AAAI 2022: 7655-7663 - [c3]Jiafei Lyu, Xiu Li, Zongqing Lu:
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination. NeurIPS 2022 - [c2]Jiafei Lyu, Xiaoteng Ma, Xiu Li, Zongqing Lu:
Mildly Conservative Q-Learning for Offline Reinforcement Learning. NeurIPS 2022 - [c1]Xihui Li, Zhongjian Qiao, Aicheng Gong, Jiafei Lyu, ChengHui Yu, Jiangpeng Yan, Xiu Li:
PRAG: Periodic Regularized Action Gradient for Efficient Continuous Control. PRICAI (3) 2022: 106-119 - [i6]Jiafei Lyu, Xiaoteng Ma, Xiu Li, Zongqing Lu:
Mildly Conservative Q-Learning for Offline Reinforcement Learning. CoRR abs/2206.04745 (2022) - [i5]Jiafei Lyu, Xiu Li, Zongqing Lu:
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination. CoRR abs/2206.07989 (2022) - [i4]Jiafei Lyu, Aicheng Gong, Le Wan, Zongqing Lu, Xiu Li:
State Advantage Weighting for Offline RL. CoRR abs/2210.04251 (2022) - 2021
- [i3]Rui Yang, Jiafei Lyu, Yu Yang, Jiangpeng Yan, Feng Luo, Dijun Luo, Lanqing Li, Xiu Li:
Bias-reduced multi-step hindsight experience replay. CoRR abs/2102.12962 (2021) - [i2]Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Xiu Li:
Efficient Continuous Control with Double Actors and Regularized Critics. CoRR abs/2106.03050 (2021) - [i1]Jiafei Lyu, Yu Yang, Jiangpeng Yan, Xiu Li:
Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients. CoRR abs/2112.11216 (2021) - 2020
- [j1]Chao Lu, Jiafei Lyu, Liming Zhang, Aicheng Gong, Yipeng Fan, Jiangpeng Yan, Xiu Li:
Nuclear Power Plants With Artificial Intelligence in Industry 4.0 Era: Top-Level Design and Current Applications - A Systemic Review. IEEE Access 8: 194315-194332 (2020)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-28 21:13 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint