Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

Han Zhong 0001

钟涵

> Home > Persons

Person information

unicode name: 钟涵
affiliation: Peking University, Center for Data Science, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/Huang0W024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/Huang0W024
Jiayi Huang, Han Zhong, Liwei Wang, Lin Yang:
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation. AISTATS 2024: 3673-3681
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-03578
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-03578
Miao Lu, Han Zhong, Tong Zhang, Jose H. Blanchet:
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm. CoRR abs/2404.03578 (2024)
2023
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/0001YWJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/0001YWJ23
Han Zhong, Zhuoran Yang, Zhaoran Wang, Michael I. Jordan:
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopically Rational Followers? J. Mach. Learn. Res. 24: 35:1-35:52 (2023)
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/00150S00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/00150S00023
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Liwei Wang, Tong Zhang:
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game. ICLR 2023
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Hu0J023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Hu0J023
Jiachen Hu, Han Zhong, Chi Jin, Liwei Wang:
Provable Sim-to-real Transfer in Continuous Domain with Partial Observations. ICLR 2023
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/0001023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0001023
Han Zhong, Tong Zhang:
A Theoretical Analysis of Optimistic Proximal Policy Optimization in Linear Markov Decision Processes. NeurIPS 2023
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Huang00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Huang00023
Jiayi Huang, Han Zhong, Liwei Wang, Lin Yang:
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds. NeurIPS 2023
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/QiuD00YZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/QiuD00YZ23
Shuang Qiu, Ziyu Dai, Han Zhong, Zhaoran Wang, Zhuoran Yang, Tong Zhang:
Posterior Sampling for Competitive RL: Function Approximation and Partial Observation. NeurIPS 2023
[c8]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Yang0WLWD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Yang0WLWD23
Yunchang Yang, Han Zhong, Tianhao Wu, Bin Liu, Liwei Wang, Simon S. Du:
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback. NeurIPS 2023
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01477
Yunchang Yang, Han Zhong, Tianhao Wu, Bin Liu, Liwei Wang, Simon S. Du:
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback. CoRR abs/2302.01477 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10796
Han Zhong, Jiachen Hu, Yecheng Xue, Tongyang Li, Liwei Wang:
Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret. CoRR abs/2302.10796 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18258
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18258
Zhihan Liu, Miao Lu, Wei Xiong, Han Zhong, Hao Hu, Shenao Zhang, Sirui Zheng, Zhuoran Yang, Zhaoran Wang:
One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration. CoRR abs/2305.18258 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-06836
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-06836
Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang:
Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds. CoRR abs/2306.06836 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-04464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-04464
Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang:
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation. CoRR abs/2312.04464 (2023)
2022
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/YangWZGPLWD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YangWZGPLWD22
Yunchang Yang, Tianhao Wu, Han Zhong, Evrard Garcelon, Matteo Pirotta, Alessandro Lazaric, Liwei Wang, Simon Shaolei Du:
A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning. ICLR 2022
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ChenZYWW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenZYWW22
Xiaoyu Chen, Han Zhong, Zhuoran Yang, Zhaoran Wang, Liwei Wang:
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation. ICML 2022: 3773-3793
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/WuYZ0DJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WuYZ0DJ22
Tianhao Wu, Yunchang Yang, Han Zhong, Liwei Wang, Simon S. Du, Jiantao Jiao:
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee. ICML 2022: 24243-24265
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/XiongZSSZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XiongZSSZ22
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang:
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games. ICML 2022: 24496-24523
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/ZhongXTWZWY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhongXTWZWY22
Han Zhong, Wei Xiong, Jiyuan Tan, Liwei Wang, Tong Zhang, Zhaoran Wang, Zhuoran Yang:
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets. ICML 2022: 27117-27142
[c2]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LiJZHW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiJZHW22
Binghui Li, Jikai Jin, Han Zhong, John E. Hopcroft, Liwei Wang:
Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power. NeurIPS 2022
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07511
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07511
Han Zhong, Wei Xiong, Jiyuan Tan, Liwei Wang, Tong Zhang, Zhaoran Wang, Zhuoran Yang:
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets. CoRR abs/2202.07511 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-11140
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-11140
Xiaoyu Chen, Han Zhong, Zhuoran Yang, Zhaoran Wang, Liwei Wang:
Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation. CoRR abs/2205.11140 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13863
Binghui Li, Jikai Jin, Han Zhong, John E. Hopcroft, Liwei Wang:
Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power. CoRR abs/2205.13863 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15512
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15512
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Liwei Wang, Tong Zhang:
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game. CoRR abs/2205.15512 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01907
Wei Xiong, Han Zhong, Chengshuai Shi, Cong Shen, Tong Zhang:
A Self-Play Posterior Sampling Algorithm for Zero-Sum Markov Games. CoRR abs/2210.01907 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15598
Jiachen Hu, Han Zhong, Chi Jin, Liwei Wang:
Provable Sim-to-real Transfer in Continuous Domain with Partial Observations. CoRR abs/2210.15598 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01962
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01962
Han Zhong, Wei Xiong, Sirui Zheng, Liwei Wang, Zhaoran Wang, Zhuoran Yang, Tong Zhang:
GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP, and Beyond. CoRR abs/2211.01962 (2022)
2021
[c1]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ZhongHYW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhongHYW21
Han Zhong, Jiayi Huang, Lin Yang, Liwei Wang:
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs. NeurIPS 2021: 15710-15720
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-11692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-11692
Yunchang Yang, Tianhao Wu, Han Zhong, Evrard Garcelon, Matteo Pirotta, Alessandro Lazaric, Liwei Wang, Simon S. Du:
A Unified Framework for Conservative Exploration. CoRR abs/2106.11692 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-08984
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-08984
Han Zhong, Zhuoran Yang, Zhaoran Wang, Csaba Szepesvári:
Optimistic Policy Optimization is Provably Efficient in Non-stationary MDPs. CoRR abs/2110.08984 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13876
Han Zhong, Jiayi Huang, Lin F. Yang, Liwei Wang:
Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs. CoRR abs/2110.13876 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-10935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-10935
Tianhao Wu, Yunchang Yang, Han Zhong, Liwei Wang, Simon S. Du, Jiantao Jiao:
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee. CoRR abs/2112.10935 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-13521
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-13521
Han Zhong, Zhuoran Yang, Zhaoran Wang, Michael I. Jordan:
Can Reinforcement Learning Find Stackelberg-Nash Equilibria in General-Sum Markov Games with Myopic Followers? CoRR abs/2112.13521 (2021)
2020
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-14098
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-14098
Han Zhong, Ethan X. Fang, Zhuoran Yang, Zhaoran Wang:
Risk-Sensitive Deep RL: Variance-Constrained Actor-Critic Provably Finds Globally Optimal Policy. CoRR abs/2012.14098 (2020)

Coauthor Index

see FAQ

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.