default search action
Hengyuan Hu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c18]Samuel Sokota, Gabriele Farina, David J. Wu, Hengyuan Hu, Kevin A. Wang, J. Zico Kolter, Noam Brown:
The Update-Equivalence Framework for Decision-Time Planning. ICLR 2024 - [c17]Minae Kwon, Hengyuan Hu, Vivek Myers, Siddharth Karamcheti, Anca D. Dragan, Dorsa Sadigh:
Toward Grounded Commonsense Reasoning. ICRA 2024: 5463-5470 - 2023
- [c16]Brandon Cui, Andrei Lupu, Samuel Sokota, Hengyuan Hu, David J. Wu, Jakob Nicolaus Foerster:
Adversarial Diversity in Hanabi. ICLR 2023 - [c15]Hengyuan Hu, Dorsa Sadigh:
Language Instructed Reinforcement Learning for Human-AI Coordination. ICML 2023: 13584-13598 - [i18]Hengyuan Hu, Dorsa Sadigh:
Language Instructed Reinforcement Learning for Human-AI Coordination. CoRR abs/2304.07297 (2023) - [i17]Samuel Sokota, Gabriele Farina, David J. Wu, Hengyuan Hu, Kevin A. Wang, J. Zico Kolter, Noam Brown:
The Update Equivalence Framework for Decision-Time Planning. CoRR abs/2304.13138 (2023) - [i16]Minae Kwon, Hengyuan Hu, Vivek Myers, Siddharth Karamcheti, Anca D. Dragan, Dorsa Sadigh:
Toward Grounded Social Reasoning. CoRR abs/2306.08651 (2023) - [i15]Hengyuan Hu, Suvir Mirchandani, Dorsa Sadigh:
Imitation Bootstrapped Reinforcement Learning. CoRR abs/2311.02198 (2023) - 2022
- [c14]Samuel Sokota, Hengyuan Hu, David J. Wu, J. Zico Kolter, Jakob Nicolaus Foerster, Noam Brown:
A Fine-Tuning Approach to Belief State Modeling. ICLR 2022 - [c13]Athul Paul Jacob, David J. Wu, Gabriele Farina, Adam Lerer, Hengyuan Hu, Anton Bakhtin, Jacob Andreas, Noam Brown:
Modeling Strong and Human-Like Gameplay with KL-Regularized Search. ICML 2022: 9695-9728 - [c12]Brandon Cui, Hengyuan Hu, Andrei Lupu, Samuel Sokota, Jakob N. Foerster:
Off-Team Learning. NeurIPS 2022 - [c11]Hengyuan Hu, Samuel Sokota, David J. Wu, Anton Bakhtin, Andrei Lupu, Brandon Cui, Jakob N. Foerster:
Self-Explaining Deviations for Coordination. NeurIPS 2022 - [i14]Brandon Cui, Hengyuan Hu, Luis Pineda, Jakob N. Foerster:
K-level Reasoning for Zero-Shot Coordination in Hanabi. CoRR abs/2207.07166 (2022) - [i13]Hengyuan Hu, Samuel Sokota, David J. Wu, Anton Bakhtin, Andrei Lupu, Brandon Cui, Jakob N. Foerster:
Self-Explaining Deviations for Coordination. CoRR abs/2207.12322 (2022) - [i12]Hengyuan Hu, David J. Wu, Adam Lerer, Jakob N. Foerster, Noam Brown:
Human-AI Coordination via Human-Regularized Search and Learning. CoRR abs/2210.05125 (2022) - 2021
- [c10]Andrei Lupu, Hengyuan Hu, Jakob N. Foerster:
Trajectory Diversity for Zero-Shot Coordination. AAMAS 2021: 1593-1595 - [c9]Hengyuan Hu, Adam Lerer, Brandon Cui, Luis Pineda, Noam Brown, Jakob N. Foerster:
Off-Belief Learning. ICML 2021: 4369-4379 - [c8]Andrei Lupu, Brandon Cui, Hengyuan Hu, Jakob N. Foerster:
Trajectory Diversity for Zero-Shot Coordination. ICML 2021: 7204-7213 - [c7]Brandon Cui, Hengyuan Hu, Luis Pineda, Jakob N. Foerster:
K-level Reasoning for Zero-Shot Coordination in Hanabi. NeurIPS 2021: 8215-8228 - [c6]Arnaud Fickinger, Hengyuan Hu, Brandon Amos, Stuart J. Russell, Noam Brown:
Scalable Online Planning via Reinforcement Learning Fine-Tuning. NeurIPS 2021: 16951-16963 - [i11]Hengyuan Hu, Adam Lerer, Brandon Cui, Luis Pineda, David J. Wu, Noam Brown, Jakob N. Foerster:
Off-Belief Learning. CoRR abs/2103.04000 (2021) - [i10]Hengyuan Hu, Adam Lerer, Noam Brown, Jakob N. Foerster:
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings. CoRR abs/2106.09086 (2021) - [i9]Arnaud Fickinger, Hengyuan Hu, Brandon Amos, Stuart Russell, Noam Brown:
Scalable Online Planning via Reinforcement Learning Fine-Tuning. CoRR abs/2109.15316 (2021) - 2020
- [j1]Tristan Cazenave, Yen-Chi Chen, Guan-Wei Chen, Shi-Yu Chen, Xian-Dong Chiu, Julien Dehos, Maria Elsa, Qucheng Gong, Hengyuan Hu, Vasil Khalidov, Cheng-Ling Li, Hsin-I Lin, Yu-Jin Lin, Xavier Martinet, Vegard Mella, Jérémy Rapin, Baptiste Rozière, Gabriel Synnaeve, Fabien Teytaud, Olivier Teytaud, Shi-Cheng Ye, Yi-Jun Ye, Shi-Jim Yen, Sergey Zagoruyko:
Polygames: Improved zero learning. J. Int. Comput. Games Assoc. 42(4): 244-256 (2020) - [c5]Adam Lerer, Hengyuan Hu, Jakob N. Foerster, Noam Brown:
Improving Policies via Search in Cooperative Partially Observable Games. AAAI 2020: 7187-7194 - [c4]Hengyuan Hu, Jakob N. Foerster:
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning. ICLR 2020 - [c3]Hengyuan Hu, Adam Lerer, Alex Peysakhovich, Jakob N. Foerster:
"Other-Play" for Zero-Shot Coordination. ICML 2020: 4399-4410 - [c2]Jack Parker-Holder, Luke Metz, Cinjon Resnick, Hengyuan Hu, Adam Lerer, Alistair Letcher, Alexander Peysakhovich, Aldo Pacchiano, Jakob N. Foerster:
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian. NeurIPS 2020 - [i8]Tristan Cazenave, Yen-Chi Chen, Guan-Wei Chen, Shi-Yu Chen, Xian-Dong Chiu, Julien Dehos, Maria Elsa, Qucheng Gong, Hengyuan Hu, Vasil Khalidov, Cheng-Ling Li, Hsin-I Lin, Yu-Jin Lin, Xavier Martinet, Vegard Mella, Jérémy Rapin, Baptiste Rozière, Gabriel Synnaeve, Fabien Teytaud, Olivier Teytaud, Shi-Cheng Ye, Yi-Jun Ye, Shi-Jim Yen, Sergey Zagoruyko:
Polygames: Improved Zero Learning. CoRR abs/2001.09832 (2020) - [i7]Hengyuan Hu, Adam Lerer, Alex Peysakhovich, Jakob N. Foerster:
"Other-Play" for Zero-Shot Coordination. CoRR abs/2003.02979 (2020) - [i6]Jack Parker-Holder, Luke Metz, Cinjon Resnick, Hengyuan Hu, Adam Lerer, Alistair Letcher, Alex Peysakhovich, Aldo Pacchiano, Jakob N. Foerster:
Ridge Rider: Finding Diverse Solutions by Following Eigenvectors of the Hessian. CoRR abs/2011.06505 (2020)
2010 – 2019
- 2019
- [c1]Hengyuan Hu, Denis Yarats, Qucheng Gong, Yuandong Tian, Mike Lewis:
Hierarchical Decision Making by Generating and Following Natural Language Instructions. NeurIPS 2019: 10025-10034 - [i5]Hengyuan Hu, Denis Yarats, Qucheng Gong, Yuandong Tian, Mike Lewis:
Hierarchical Decision Making by Generating and Following Natural Language Instructions. CoRR abs/1906.00744 (2019) - [i4]Hengyuan Hu, Jakob N. Foerster:
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning. CoRR abs/1912.02288 (2019) - [i3]Adam Lerer, Hengyuan Hu, Jakob N. Foerster, Noam Brown:
Improving Policies via Search in Cooperative Partially Observable Games. CoRR abs/1912.02318 (2019) - 2016
- [i2]Hengyuan Hu, Rui Peng, Yu-Wing Tai, Chi-Keung Tang:
Network Trimming: A Data-Driven Neuron Pruning Approach towards Efficient Deep Architectures. CoRR abs/1607.03250 (2016) - [i1]Hengyuan Hu, Lisheng Gao, Quanbin Ma:
Deep Restricted Boltzmann Networks. CoRR abs/1611.07917 (2016)
Coauthor Index
aka: Jakob Nicolaus Foerster
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-20 22:55 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint